Hand us a teacher API and a task spec. We distill it into a 20–50M model that runs on CPU, edge, or browser with zero API dependency at inference.
A real distillation, in our language. Each step maps to something concrete in the engine.
Install the CLI and distill your first Spirit. About 30 minutes on a single GPU, ~$0.30 in teacher API.
# Install pip install distillarium[gemini] # Distill — uses GOOGLE_API_KEY from env distillery distill recipes/needle.tool-calling-v1.yaml # Inspect your local Cellar distillery cellar # Re-taste against fresh held-out data distillery taste spirits/needle.pt --mash held_out.jsonl # Bottle for deployment (ONNX, GGUF, WASM) distillery bottle spirits/needle.pt --format onnx
Browse and fork community Spirits. Every one ships with full Tasting Notes.
A Spirit is good for any task you'd otherwise hit an API for over and over. Fork a recipe, swap the catalog, distill.
needle.tool-callingprivacy.pii-guardrouting.intentdata.sql-parse (waitlist)data.receipt-ner (waitlist)(start a new one)Every bottled Spirit is a single self-contained artifact. Hover the parts to see what each one is in ML terms.
Recipe. Sealed, signed config that produced this Spirit. You can re-distill from it byte-for-byte.inference shape. Tokenizer, vocab size, max sequence length.bottling format. PyTorch, ONNX, or GGUF wrapper around the weights.trained weights. The actual model parameters. Darker = higher proof.Tasting Notes. Name, vintage, batch, headline proof. Auto-generated, honest.Every term means something concrete. Each maps to an ML concept — and that's deliberate.
The Distillery is open-source, MIT-licensed, and built solo. If you find it useful, here are three ways to help.
This is not a SaaS. The CLI is free, runs locally, uses your own teacher API key, and produces models you own. Nothing reports back to us. The "Cellar" is just a public showcase you can opt into.
Built on top of the Research Radar pipeline — an autonomous research-to-product system. The Spirit you're looking at (Needle) started life as a Show HN paper that the Radar surfaced last Tuesday.