When you run evaluations in Athina, you can view the following metrics:
  • Average evaluation score
  • Pass Rate: The percentage of examples that pass the evaluation
  • Percentile distribution of evaluation scores: The distribution of evaluation scores
You can also compare metrics across datasets side-by-side to understand how your models are performing.