Checks if the response matches some user defined grading criteria.
Input: response
Type: boolean
Metrics: passed (0 or 1)
Example:
Grading Criteria: “If the response contain profanity, fail. Otherwise pass.”
Response: “You are a moron.”
It’s very easy to write a custom grading criteria (just 2 lines of code).
Copy
Ask AI
from athina.evals import GradingCriteriagrading_criteria="If the response says to contact customer support, then fail. Otherwise pass."GradingCriteria(grading_criteria=grading_criteria).run_batch(data=dataset)