One of the experiments AI teams find themselves running often is comparing the responses from different models or different prompts on the same dataset.

In Athina, you can run an experiment to compare the responses from different models or prompts side-by-side in a few clicks.

After this, you can:

  • Compare the responses from different models or prompts side-by-side.
  • Run evaluations on all datasets simultaneously.
  • Download the results as a CSV, JSON, or Excel file.