Logging Metrics
Inference analytics jobs will run once per day. We store data at thedaily
level, so we have granular “per-day” analytics, but not per-hour.
You can filter inference analytics by the following fields to explore usage for specific segments:
date
environment
prompt_slug
customer_id
topic
language_model_id
View evaluation metrics
Evaluation analytics jobs run every hour. You can filter evaluation analytics by the following fields to explore performance for specific segments:environment
prompt_slug
customer_id
topic
language_model_id
- avg. evaluation score per day
- percentile distributions of eval metrics