Skip to main content
Kick off your first run in minutes using the Example Project that’s created automatically when you sign up. You can also browse the run that was kicked off for you and explore how testsets, prompts, and metrics fit together.
1

Open the Example Project and click Kickoff Run

After org creation, navigate to the Example Project’s Records page. Click the Kickoff Run button in the top right to open the Kickoff Run modal.
Screenshot showing the Kickoff Run button in the top right of the Records page.Screenshot showing the Kickoff Run button in the top right of the Records page.

Kickoff Run button on the Records page.

2

Kick off your first run

In the modal, you can use the default selected Testset, System, and Metrics.Click “Kickoff run” to create the run and automatically evaluate the system.
Kickoff modal with prefilled Testset, Prompt, and Metrics selections.Kickoff modal with prefilled Testset, Prompt, and Metrics selections.

Kickoff Run modal.

3

View results

After your run starts and scoring completes, open the results to see per-record scores, distributions, and explanations.
Run details page with scores and aggregates.Run details page with scores and aggregates.

Run details with scores and aggregates.

Click the Run Again button in the top right corner to iterate with a different prompt version, model, or metric set.
Click any record in the Records page to view individual testcase inputs, outputs, and score explanations.
4

Browse the Example Project

Learn how the sample data is organized:
  • Tone Testset: inputs original, tone → expected idealRewritten.
  • Prompt versions for Tone — already set to Scorecard Cloud with low temperature for consistency.
  • Metrics: Correctness (AI, 1-5), Human Tone Check (Human, Boolean).
Testsets overviewTestsets overview

Testsets overview.

Open a Testset to see its schema and Testcases. Click a testcase row to view its inputs and expected outputs.
Testset detailsTestset details

A sample Testset with Testcases.

Next, browse Prompts. Use “View” to open a prompt, review messages, and model settings.
Prompts listPrompts list

Prompts list.

Inside a prompt version, see the template (Jinja-style variables) and evaluator model configuration.
Prompt version editorPrompt version editor

Prompt version with template and model settings.

Finally, explore Metrics to learn how scoring works. Each metric has guidelines, evaluation type, and output type.
Metrics list.Metrics list.

Metrics configured for the Example Project.

Where to go next

  • Read about creating and managing Testsets in Testsets
  • Dive deeper into running evaluations in Runs & Results
  • Explore interactive prompt iteration in the Playground
  • Define and reuse evaluation criteria with Metrics
That’s it — you’ve seen Scorecard in action and how example data flows through prompts, runs, and metrics. Have fun iterating!