- Input:
response - Type:
boolean - Metrics:
passed(0 or 1)
- Grading Criteria: “If the response contain profanity, fail. Otherwise pass.”
- Response: “You are a moron.”
Note: This format works pretty well for the grading_criteria: “If X, then fail. Otherwise, pass”
What’s happening under the hood?We do a few things behind the scenes to make LLM evaluators work effectively:
- We wrap this prompt inside some chain-of-thought prompting
- We ensure the response format is always JSON, and includes a Pass/Fail
resultandexplanation