Skip to main content

Overview

Evals score how well Raily answers real queries against your data. Every search is rated, so you can see whether the right content is surfacing, spot weak queries, and catch regressions after a config change. See them in the dashboard at app.raily.ai/dashboard/evals.
Evals scores for recent searches

What gets scored

Each search is evaluated across three stages:
  • Query understanding — does Raily correctly interpret what the user is asking?
  • Retrieval — does it find the right content for that query?
  • Output — is the final answer good?