Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc
Updated 2026-06-28 16:56:21 +00:00
🪓 Small always-on Go daemon that fronts one Ollama target — turns it into a queued, observable job endpoint (model-swap serialization, job IDs, progress webhooks). Speaks native Ollama on the wire, so it's a drop-in target for any Ollama client.
Updated 2026-06-27 00:33:42 +00:00