With Athina, you can also ask an LLM to do a pairwise evaluation.

This is useful when you want to compare two responses based on a specified criteria (example: conciseness), and determine which one is better.

Here’s a video that shows how you can do this: