Get Started With Evaluations

Start Here ↓

You can only evaluate a model if you have a snapshot of your dataset.

Create an New Evaluation

Browse and Choose Metrics

Check your results

Once your evaluation is done you can check your results. Image of checking your results

You can click on each metric to organize the results.To get more details on each datapoint, click on the percentage under the model name. Image of checking your results

The results will look something like this: Image of checking your details

Click Model Results Tab to see additional details of the evaluation based on the model and metric.

Here are Some Results to Keep in Mind:

Average Score: The average score of the evaluation for the model.

Model Name: The name of the model the evaluation was run on.

System Prompt: The system prompt used for the evaluation.

User Message: The user message from the dataset.

Original Assistant Message: The original assistant message from the dataset.

Predicted Assistant Message: The predicted assistant message from the model.

Model Score: The score of the model chosen for the evaluation.

Score Reason: The reasoning behind the score.

Get started

Projects 🚀

Datasets 🗃️

Fine-Tuning 🛠️

Inference 🏃‍♂️

Agentic Evaluations 📈

User Guides 📚

Resources 🧰

Get Started With Evaluations

Start Here ↓

Here are Some Results to Keep in Mind:

Other Options

Bring Your Own Evaluation

Evaluation Metrics

Fine-Tuned Model → Playground

Get started

Projects 🚀

Datasets 🗃️

Fine-Tuning 🛠️

Inference 🏃‍♂️

Agentic Evaluations 📈

User Guides 📚

Resources 🧰

Documentation Index

​Start Here ↓

​Here are Some Results to Keep in Mind:

​Other Options

Bring Your Own Evaluation

Evaluation Metrics

Fine-Tuned Model → Playground

Start Here ↓

Here are Some Results to Keep in Mind:

Other Options