steve/foreman

Fork 0

T

steve bf005867b2

CI / Build & Test (push) Successful in 10m26s

Details

CI / Tidy (push) Successful in 9m34s

Details

CI / Publish Docker Image (push) Successful in 20s

Details

chore: gitignore local agent tooling dirs

Ignore .claude/ and .claude-mpm/ — local agent scratch dirs, not project files.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

2026-06-26 20:33:39 -04:00

.gitea/workflows

ci: push container image to gitea registry on success

2026-05-23 19:36:43 -04:00

client

feat: add FOREMAN_KEEP_ALIVE config for worker model residency

2026-05-23 21:29:37 -04:00

cmd/foreman

feat: add FOREMAN_KEEP_ALIVE config for worker model residency

2026-05-23 21:29:37 -04:00

docs

docs: land prior ADR + prompt updates

2026-06-26 20:33:39 -04:00

internal

feat: add FOREMAN_KEEP_ALIVE config for worker model residency

2026-05-23 21:29:37 -04:00

prompts

docs: land prior ADR + prompt updates

2026-06-26 20:33:39 -04:00

scripts

chore: add deployment docs, model script, and finalize env config

2026-05-23 18:43:10 -04:00

.env.example

feat: add FOREMAN_KEEP_ALIVE config for worker model residency

2026-05-23 21:29:37 -04:00

.gitignore

chore: gitignore local agent tooling dirs

2026-06-26 20:33:39 -04:00

CLAUDE.md

docs: MIT license + public-readiness framing

2026-06-26 20:30:52 -04:00

Dockerfile

chore: add deployment docs, model script, and finalize env config

2026-05-23 18:43:10 -04:00

go.mod

feat: add durable queue, single worker, and drain-by-model scheduling

2026-05-23 18:29:32 -04:00

go.sum

feat: add durable queue, single worker, and drain-by-model scheduling

2026-05-23 18:29:32 -04:00

LICENSE

docs: MIT license + public-readiness framing

2026-06-26 20:30:52 -04:00

progress.md

chore: add deployment docs, model script, and finalize env config

2026-05-23 18:43:10 -04:00

README.md

docs: MIT license + public-readiness framing

2026-06-26 20:30:52 -04:00

README.md

foreman

🪓 A small, always-on Go daemon that fronts one Ollama target. It turns a single Ollama instance into a queued, observable job endpoint: it polls the target's installed models, serializes work through the target (managing model swaps), assigns every job an ID, and reports progress via webhooks.

On the wire it speaks native Ollama, so it doubles as a drop-in target for any Ollama client — including majordomo (via its ollama.Foreman(url, token) preset) and, through that, gadfly. Point a client at the foreman URL instead of the raw Ollama and you get queuing + model-swap serialization for free.

This is a public, vibe-coded project (built largely by an AI agent). It runs the author's homelab but is intentionally generic — one daemon, one target, one queue. Treat the homelab specifics in the docs as illustrative, and don't oversell it: it's a deliberately small queue in front of Ollama, not a distributed scheduler.

Quickstart

# Set the required Ollama target URL
export FOREMAN_OLLAMA_URL=http://mac.tail:11434

# Run directly
go run ./cmd/foreman serve

# Or build and run
go build -o foreman ./cmd/foreman
./foreman serve

Docker

docker build -t foreman .
docker run -e FOREMAN_OLLAMA_URL=http://mac.tail:11434 -p 8080:8080 foreman

Configuration

All configuration is via environment variables, namespaced under FOREMAN_*. See .env.example for the full list.

Variable	Default	Description
`FOREMAN_ADDR`	`:8080`	Listen address
`FOREMAN_OLLAMA_URL`	(required)	Ollama target base URL
`FOREMAN_OLLAMA_TOKEN`	(empty)	Bearer token sent to the target
`FOREMAN_TOKEN`	(empty)	Bearer token callers must present
`FOREMAN_EMBED_MODEL`	(empty)	Always-resident embedder model
`FOREMAN_DB_PATH`	`foreman.db`	SQLite database path
`FOREMAN_POLL_INTERVAL`	`30s`	Target model poll interval
`FOREMAN_WEBHOOK_SECRET`	(empty)	HMAC key for webhook signing

Health check

curl http://localhost:8080/healthz
# {"status":"ok","degraded":false}

Architecture

See docs/adr/ for design decisions. Key points:

One daemon per Ollama target (ADR-0001)
SQLite-backed durable job queue in WAL mode (ADR-0008)
Single worker loop with drain-by-model scheduling (ADR-0009)
Native Ollama passthrough + async /jobs surface (ADR-0003, ADR-0004)
Embeddings bypass the queue entirely (ADR-0013)

License

Description

🪓 Small always-on Go daemon that fronts one Ollama target — turns it into a queued, observable job endpoint (model-swap serialization, job IDs, progress webhooks). Speaks native Ollama on the wire, so it's a drop-in target for any Ollama client.

Readme MIT 244 KiB