~/distillery setup → hermes-agent
Configure Hermes to route Claude requests through The Distillery via the OpenAI adapter.
Overview
Hermes sends requests to `http://localhost:3080/v1/chat/completions`. The Distillery records token counts, distils context, then forwards to Anthropic. Your Anthropic API key is set in Hermes directly; The Distillery never sees it.
Why agents save more
A 10-step agentic session sends context forward on every step. At 40% distillation that's 20k tokens saved per run; multiply by daily runs and savings compound fast.
Prerequisites
You have The Distillery installed and running locally on port 3080. See the main install guide if you have not done this yet.
$ configure
step 01 / 02Set the OPENAI_BASE_URL environment variable before starting Hermes:
export OPENAI_BASE_URL=http://localhost:3080/v1
hermes startset OPENAI_BASE_URL=http://localhost:3080/v1
hermes start$ verify
step 02 / 02thedistillery stats$ troubleshoot
Evaluating other Claude Code cost-reduction approaches alongside Hermes? See how The Distillery compares to Claude Code Router (model-routing approach), Compresr (YC W26), or browse alternatives to Claude Code Router for the full landscape.
Pricing breakdown: thedistillery.com/pricing.