Documentation Index
Fetch the complete documentation index at: https://docs.latitude.so/llms.txt
Use this file to discover all available pages before exploring further.
Evaluation Alignment
Alignment measures how closely evaluations match human judgment. It answers: Can you trust this monitor to represent the issue your team cares about?Why Alignment Matters
Evaluations are useful only when they agree with the way your team reviews real traces. Without alignment tracking:- You may not notice that an evaluation is too strict or too lenient.
- You may miss drift as your agent, users, or product behavior changes.
- You may keep monitoring an issue with outdated examples.
How Alignment Works
Alignment is computed when an evaluation and a human annotation score the same trace. Latitude compares their verdicts and uses the result as feedback for the monitor. The goal is not just to display a metric; it is to keep the evaluation close to the latest human-reviewed examples of the issue.Viewing Alignment
Each evaluation detail page shows alignment information when enough human-reviewed traces are available. Use it to see whether the evaluation still matches reviewer expectations and where it may be drifting.Alignment and Evaluation Generation
When you generate an evaluation from an issue:- Latitude collects examples from annotations, issue-linked scores, and trace context.
- Latitude creates a monitor for the issue pattern.
- The evaluation is compared against known examples.
- The monitor is attached to the issue.
Automatic Realignment
Once an evaluation exists, Latitude can realign it as new annotations, flagger matches, evaluation results, and custom scores arrive. This keeps the monitor matched to the issue as production traffic evolves.Improving Alignment
When an evaluation appears misaligned:- Review traces where the evaluation and human review disagree.
- Add annotations with specific feedback.
- Confirm the issue contains representative examples of the behavior you want to track.
- Let the new signal improve future realignment.
Next Steps
- Annotations: How human review produces alignment signal
- Flaggers: Automatic annotators that contribute signal
- Search: Build cohorts of traces to annotate
- Issues: How failed evaluations become trackable issues