Ollama

Overview

This guide shows you how to integrate Latitude Telemetry into an application that uses Ollama for local model inference.

You’ll keep calling Ollama exactly as you do today. Telemetry simply observes and enriches those calls.

Auto-instrumentation for Ollama is available in the Python SDK. For other languages, send traces via the OpenTelemetry exporter.

Requirements

A Latitude account and API key
A Latitude project slug
A project that uses the Ollama SDK (ollama)
A running Ollama server (set OLLAMA_HOST if it is not the default http://localhost:11434)

Steps

Install

pip install latitude-telemetry ollama

uv add latitude-telemetry ollama

poetry add latitude-telemetry ollama

Initialize and use

import ollama

from latitude_telemetry import Latitude, capture

latitude = Latitude(
    api_key="your-api-key",
    project="your-project-slug",
    instrumentations={"ollama": ollama},
)

def generate_reply():
    response = ollama.chat(
        model="llama3.2",
        messages=[{"role": "user", "content": "Hello"}],
    )
    return response["message"]["content"]

capture("generate-reply", generate_reply)

latitude.shutdown()

Seeing Your Traces

Once connected, traces appear automatically in Latitude:

Open your project in the Latitude dashboard
Each execution shows input/output messages, model, token usage, latency, and errors

Mistral AI Cohere

⌘I

Overview

Getting Started

Observe

Understand

Refine

Security and Compliance

Deployment

Development

More

Overview

Requirements

Steps

Seeing Your Traces

​Overview

​Requirements

​Steps

​Seeing Your Traces

Overview

Requirements

Steps

Seeing Your Traces