Custom Evals
Pairwise Evaluation
With Athina, you can also ask an LLM to do a pairwise evaluation.
This is useful when you want to compare two responses based on a specified criteria (example: conciseness), and determine which one is better.
Here’s a video that shows how you can do this: