Answer Completeness

On this page

This is an LLM Graded Evaluator Github

This evaluator checks if the response answer’s the query sufficiently. Required Args

Default Engine: gpt-4

Query: Which spaceship landed on the moon first?

Response: Neil Armstrong was the first man to set foot on the moon in 1969

Eval Result

Result: Fail
Explanation: The query is asking which spaceship landed on the moon first, but the response only mentions the name of the astronaut, and does not say anything about the name of the spaceship.

from athina.loaders import Loader

# Load the data from JSON, Athina or Dictionary
dataset = Loader().load_json(json_file)

from athina.evals import DoesResponseAnswerQuery

DoesResponseAnswerQuery().run_batch(data=dataset)

from athina.evals import DoesResponseAnswerQuery

DoesResponseAnswerQuery().run(
    query=query,
    response=response
)