Core Concepts
Latitude is organized around a small set of concepts, grouped by the loop you work in: Observe what your agent did, Understand what matters, and Refine with monitoring and tests.Projects
A project is the main workspace for one agent, application, or AI feature. Projects keep production data and reliability workflows scoped to the right product boundary. Project-scoped data includes:- spans, traces, and sessions
- users and tools
- search, saved searches, and behaviours
- scores from annotations, evaluations, flaggers, and custom checks
- signals grouped from failed scores
- evaluations and monitors
- project settings such as monitoring behaviour
Spans
A span is the smallest unit of captured work. It represents one operation inside an agent run, such as an LLM call, tool call, retrieval step, HTTP request, or custom instrumented operation. Spans can capture:- input and output content when available
- start time, end time, and duration
- success or error status
- tags and metadata
- provider, model, token usage, and cost for LLM calls
Traces
A trace is one complete interaction from start to finish, composed of one or more spans. For a chat agent, a trace commonly represents a single user turn and the agent work needed to respond. Traces are the main unit used across Latitude:- the trace detail view shows the full execution path
- search returns matching traces
- annotations, scores, and evaluations attach to traces
- flaggers run on completed traces
- failed scores on traces feed signal discovery
Sessions
A session groups related traces into a multi-turn conversation or workflow. Traces always exist; sessions exist when your application sends a stable session id with telemetry. Use sessions when multiple traces belong to the same user conversation, support ticket, workflow, or agent task. For example, each chat turn can be one trace while the whole conversation shares one session.Users
A user is an end user of your agent, identified by theuserId you send with telemetry. The Users view groups everything one user did: their sessions, errors, cost, and activity over time, along with the signals affecting them. Use it to see who is impacted by a problem and who is most active.
Tools
A tool is a function your agent can call. Latitude discovers every tool from your telemetry and tracks per-tool usage, error rate, and latency in the Tools view, so you can see which tools are heavily used, failing, or never called.Search
Search finds traces by meaning, exact text, metadata filters, or a combination of all three. It is the discovery layer for behaviours your team wants to inspect, review, and track. Saved searches let teams bookmark useful cohorts and review them repeatedly. Use search to find traces such as:- users showing frustration
- failed payments or onboarding problems
- tool loops and tool call errors
- hallucinated or incomplete answers
- traces from a specific user, model, release, environment, tag, or metadata value
Behaviours
A behaviour is a topic Latitude automatically discovers by clustering your sessions by meaning. Behaviours surface what your users are actually doing, with trends and outcome metrics, so you find patterns you did not know to search for. Where search starts from a query, behaviours start from a map of your traffic.Scores
A score is Latitude’s common verdict model for trace quality. Scores can come from human annotations, built-in flaggers, automated evaluations, or custom checks submitted through the API. Scores answer questions such as:- Did this trace pass or fail a quality check?
- What feedback explains the verdict?
- Which source produced the verdict?
- Should this failure contribute to signal discovery?
Signals
A signal is a recurring failure pattern Latitude tracks. Instead of leaving failures as isolated examples, Latitude groups similar failed scores into named signals with example traces, trends, affected users, lifecycle states, and linked evaluations. Signals created from negative human annotations are also called issues; the word is still used in that narrower context. Signals help teams:- triage new and escalating production problems
- inspect representative traces for the failure pattern
- decide whether a pattern is real, noise, resolved, or ignored
- generate evaluations that monitor the same failure mode on future traces
- catch regressions after a fix ships
Evaluations
An evaluation is an automated monitor that checks traces for a specific behaviour or quality criterion. Evaluations run on completed traces and produce scores, so their results feed the same analytics and signal discovery workflows as annotations, flaggers, and custom checks. Evaluations help teams:- monitor known failure modes on live traffic
- measure whether a fix reduced or eliminated a behaviour
- catch regressions after a signal is resolved
- turn important signals into ongoing monitors
- compare automated judgments with human annotations through alignment metrics
Monitors
A monitor watches a target, such as a signal, a saved search, a tool, or your raw traffic, and opens an incident when a condition is met. When an incident opens, Latitude notifies you in-app, by email, or in Slack. Every project comes with system monitors that watch the signal lifecycle, and you can create your own.How the concepts fit together
- A project receives telemetry from your agent.
- Agent operations arrive as spans.
- Related spans form completed traces, and related traces group into sessions.
- Search and Behaviours help you find the traces and topics that matter; Users and Tools break the same activity down by end user and by tool.
- Annotations, flaggers, evaluations, and custom checks create scores on traces.
- Failed scores cluster into signals you can triage, monitor, and fix. Signals from negative annotations are called issues.
- Monitors watch signals (and searches, tools, or raw traffic) and open incidents that alert you when something needs attention.
Next steps
- Start tracing: send your first spans and traces to Latitude.
- Observability: inspect traces, sessions, users, and tools.
- Search and Behaviours: find the traffic that matters.
- Signals: understand how failed scores become tracked signals.
- Monitors: get alerted when something needs attention.