-
If your evaluation task is complex, use a powerful model like
gpt-4oorclaude-3-5-sonnet. -
If your evaluation task is simple, use a smaller model like
gpt-3.5-turboorllama-3-8b.
Evals
Can I choose which model to use for running evaluations?
Yes, you can specify your own model for running evals. However, keep the following in mind.