Cairn

A Field Guide

About Cairn — a local AI field guide

What is this local AI field guide?

Cairn tells you which open-weight LLMs your machine can actually run. It reads your GPU through browser APIs, estimates VRAM usage, and guesses tokens per second across 50+ models.

How the VRAM readout works

Each model shows the VRAM it needs at Q4_K_M quantization, in both GB and percent of your VRAM. Over 100% and it won't fit. Tokens per second comes from your GPU's memory bandwidth divided by model size — a rough estimate, usually within 20% of real-world numbers.

Privacy — everything runs in your browser

No data leaves your browser. Detection, scoring, and ranking all run client-side.

Data sources for local LLM specs

Model specs come from the llama.cpp, Ollama, and LM Studio compatibility lists.


— the field guide