Preset Evaluators

Available Preset Evaluators
RAG Evals
RAGAS Evals
Safety Evals
Summarization Evals
JSON Evals
Function Evals
Evals with Ground Truth

Athina has a large library of preset evaluators to cover all kinds of common use cases.

View the evaluators in the Athina IDE.
View the evaluators on Github in Athina’s Open-Source Evaluation SDK.

Available Preset Evaluators

You can also create custom evaluators. See here for more information.

RAG Evals

These evals are useful for evaluating LLM applications with Retrieval Augmented Generation (RAG):

RAGAS Evals

RAGAS is a popular library with state-of-the-art evaluation metrics for RAG models:

Safety Evals

These evals are useful for evaluating LLM applications with safety in mind:

PII Detection: Will fail if PII is found in the text
Prompt Injection: Will fail if any known Prompt Injection attack is found in the text
OpenAI Content Moderation: Will fail if text is potentially harmful
Guardrails: A popular library for custom validators for LLM applications:
- Safe for work: Checks if text has inappropriate/NSFW content
- Not gibberish: Checks if response contains gibberish
- Contains no sensitive topics: Checks for sensitive topics

Summarization Evals

These evals are useful for evaluating LLM-powered summarization performance:

Summarization Accuracy

JSON Evals

These evals are useful for validating JSON outputs:

Function Evals

Unlike the previous evaluators which used an LLM for grading, function evals use simple functions to check if:

Head over to the function evaluators page for further details.

Evals with Ground Truth

These evaluators compare the response against reference data:

Head over to the grounded evaluators page for further details.

Offline Evals Custom Evals

⌘I

Getting Started

Datasets

Evals

Flows

Annotation

Prompts

Monitoring

Settings

Integrations

Self Hosting

Datasets

Preset Evaluators

Available Preset Evaluators

RAG Evals

RAGAS Evals

Safety Evals

Summarization Evals

JSON Evals

Function Evals

Evals with Ground Truth

Getting Started

Datasets

Evals

Flows

Annotation

Prompts

Monitoring

Settings

Integrations

Self Hosting

Datasets

​Available Preset Evaluators

​RAG Evals

​RAGAS Evals

​Safety Evals

​Summarization Evals

​JSON Evals

​Function Evals

​Evals with Ground Truth

Available Preset Evaluators

RAG Evals

RAGAS Evals

Safety Evals

Summarization Evals

JSON Evals

Function Evals

Evals with Ground Truth