Evals
Can I choose which model to use for running evaluations?
Yes, you can specify your own model for running evals. However, keep the following in mind.
-
If your evaluation task is complex, use a powerful model like
gpt-4o
orclaude-3-5-sonnet
. -
If your evaluation task is simple, use a smaller model like
gpt-3.5-turbo
orllama-3-8b
.
Currently, we support all the major public models, as well as custom models.