The core workflow
Connect telemetry (Observe)
Send traces from your agent to Latitude. Each interaction becomes a trace of spans (LLM calls, tool calls, retrieval, and more), and multi-turn conversations group into sessions. Send a
userId and sessionId so you can also break activity down per user and review reliability, errors, and latency per tool.If you have not connected your app yet, follow Start tracing.Find what matters (Understand)
Use Search to find conversations by meaning, exact text, or metadata filters: frustrated users, tool loops, hallucinations, failed workflows, or anything specific to your product. Behaviours goes further, automatically clustering your sessions into the topics users actually hit, so you discover patterns without writing a query.
Annotate what is good or bad (Understand)
Open traces from search results, behaviours, or the trace list and leave annotations. A thumbs-down with clear feedback tells Latitude this behaviour is worth tracking. Flaggers also annotate common failure categories automatically, such as frustration, refusal, jailbreaking, tool errors, and empty responses.
Let Latitude group failures into signals (Understand)
Triage and monitor (Refine)
Triage signals on the Signals page: set priority, inspect example traces, and resolve noise. Monitors watch a signal, a saved search, a tool, or your raw traffic and open an incident when something needs attention, notifying you in-app, by email, or in Slack. Generate evaluations to keep scoring live traffic for the same failure.
Fix and prevent regressions (Refine)
Fix the underlying behaviour in your code, prompts, tools, retrieval, or product flow. Turn the failing traces into a dataset and regression test the fix, using the MCP server to drive it from your coding agent so the failure cannot quietly return. Resolve the signal once it is fixed, and the regressed monitor tells you if it comes back. Repeat the loop as new production traffic arrives.
What to focus on first
If you are setting up Latitude for the first time:- Connect tracing for one production agent.
- Add
userIdandsessionIdso traces group by user and conversation. - Search for one failure mode your team already cares about, or browse Behaviours to see what stands out.
- Annotate representative traces with specific feedback.
- Watch the Signals page for grouped patterns, and let the built-in monitors notify you.
- Generate evaluations for the signals you want to score continuously.
Work with agents and self-host
- MCP: manage your workspace straight from Claude, Cursor, and other agents through the MCP server.
- Self-hosting: run Latitude in your own infrastructure, from a single host to a full cluster.