steve
96c612e707
feat(llamaswap): add llama-swap provider + canonical imagegen interface
...
CI / Tidy (pull_request) Successful in 9m25s
CI / Build & Test (pull_request) Successful in 10m15s
Add provider/llamaswap, a tailored provider for llama-swap (the model-swapping
proxy over llama.cpp / stable-diffusion.cpp). Its chat path delegates to
provider/openai at {base}/v1 — no duplicated wire client (ADR-0007) — with
legacy max_tokens, a Bearer no-key placeholder for keyless local instances, and
a timeout-free client so cold model swaps rely on context deadlines. The
"tailored" surface is concrete management methods (ListModels / Running /
Unload) that don't belong on the canonical llm.Provider interface. The
llama-swap:// DSN scheme builds an http base URL (local-first); a no-URL
built-in errors clearly on use, mirroring foreman.
Add imagegen, a new canonical text-to-image interface separate from llm
(Request/Result/Model/Provider; Image = llm.ImagePart so generated images feed
straight back into chat). First backend is llama-swap via OpenAI
/v1/images/generations (b64_json, bytes-only). Re-exported from the root. v1 is
txt2img only.
Hermetic httptest coverage for chat delegation, management endpoints, image
decode, and scheme wiring. ADR-0015 + ADR-0016, README support matrix +
image-gen section, CLAUDE.md package map, and progress.md updated in the same
commit.
Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com >
2026-06-27 15:01:54 -04:00
Steve Dudenhoeffer
3e81fbd540
docs: public-readiness — vibe-coded disclosure + genericize internal hosts
...
CI / Tidy (push) Successful in 9m39s
CI / Build & Test (push) Successful in 10m21s
- README + CLAUDE.md: upfront "this is a vibe-coded project" disclosure for
going public.
- Replace internal LAN hostnames (*.orgrimmar.dudenhoeffer.casa) with
example.com across README, ADR-0004, the envproviders example, and env_test.go
(assertions updated together; suite still green). Token was already a
"change-me" placeholder, not a real secret.
Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com >
2026-06-25 19:25:58 -04:00
steve
0147a79d18
feat: conversion-driven extensions — resolvers, DefineTool, hooks, ops controls
...
CI / Tidy (push) Successful in 9m31s
CI / Build & Test (push) Successful in 10m13s
Phase 9a (ADR-0014): Registry.RegisterResolver for dynamic tiers;
DefineTool[Args] typed tools; Usage cache/reasoning detail fields wired
through anthropic/openai/google; WithPromptCaching (Anthropic
cache_control); agent supervision hooks (WithMaxStepsFunc, WithSteer,
WithCompactor, WithToolErrorLimits + ErrToolLoop); health
Bench/Unbench/Snapshot; ChainConfig.Observer failover events.
Co-Authored-By: Claude Fable 5 <noreply@anthropic.com >
2026-06-10 13:30:06 +02:00
steve
97513141dc
docs: examples for every hard requirement + mort migration blueprint
...
Phase 7: nine runnable examples/ programs (parse, failover chains with
trailing alias, tiers, LLM_* env providers, multimodal, tool loop,
Generate[T], agent, skills); docs/mort-migration.md mapping mort's
go-llm/go-agentkit usage onto majordomo APIs with the planned additive
library extensions and conversion order; README finalized with the
complete matrix.
Co-Authored-By: Claude Fable 5 <noreply@anthropic.com >
2026-06-10 13:17:20 +02:00
steve
76ecf0e49e
feat: skills — additive instruction+tool bundles, clock + calc examples
...
Phase 6: skill.New constructor satisfying the agent.Skill contract;
instruction-only skills; ordered additive composition; skill/clock
(injectable-clock time tools) and skill/calc (recursive-descent arithmetic
evaluator) as ready-made examples with full test suites incl. an
agent-loop round trip. ADR-0013; README skills section + matrix synced.
Co-Authored-By: Claude Fable 5 <noreply@anthropic.com >
2026-06-10 13:13:07 +02:00
steve
7dab4112ff
feat: agent run loop, Generate[T], reflect-derived schemas
...
Phase 5:
- agent/: model + system prompt + toolboxes composition; bounded
tool-dispatch loop (default 10 steps); panic-proof tool execution;
unknown-tool and duplicate-name handling; history continuation; step
observers; partial results on ErrMaxSteps/errors (ADR-0012)
- llm.SchemaFor[T]: strict-compatible JSON schemas from Go types
(nullable pointers, description/enum tags, recursion rejected)
- majordomo.Generate[T]: typed structured output with fence-stripping
decode and model-naming errors
- README agents/structured-output sections + matrix synced
Co-Authored-By: Claude Fable 5 <noreply@anthropic.com >
2026-06-10 13:10:18 +02:00
steve
1ca607906d
feat: Google (Gemini) provider on the official Gen AI SDK
...
Phase 4: provider/google on google.golang.org/genai v1.59.0 — lazy cached
client, FunctionResponse tool loop, raw-JSON-schema tools and structured
output, ThinkingLevel reasoning mapping, iter.Pull2 streaming, hermetic
httptest suite via HTTPOptions.BaseURL. Registry wires google + gemini
schemes to the real client; stub machinery deleted (all built-ins real).
ADR-0011; README matrix + CLAUDE.md synced.
Co-Authored-By: Claude Fable 5 <noreply@anthropic.com >
2026-06-10 13:04:28 +02:00
steve
043249e0e1
feat: OpenAI, Anthropic, and native-Ollama providers + media pipeline
...
Phase 3:
- provider/openai: Chat Completions for OpenAI + compat endpoints (SSE
streaming with by-index tool-call assembly, response_format json_schema,
legacy max_tokens option, reasoning_effort)
- provider/anthropic: Messages API (tool_use/tool_result, GA structured
output via output_config.format, full SSE event parser, 529 transient)
- provider/ollama: one native /api/chat client behind the ollama,
ollama-cloud, and foreman built-ins (presets; NDJSON streaming tolerant
of foreman's buffered single-object responses; object tool arguments;
format-schema structured output; think mapping)
- media/: capability normalization (sniff, downscale, transcode, byte
ladder, ErrUnsupported), wired into the chain executor per target with
penalty-free advance past incapable elements
- registry: real provider + scheme wiring, WithHTTPClient option, required
env-foreman TLS chat round-trip test
- ADR-0009 multimodal strategy, ADR-0010 tools/structured mapping; README
matrix + CLAUDE.md synced
Co-Authored-By: Claude Fable 5 <noreply@anthropic.com >
2026-06-10 12:58:08 +02:00
steve
dcd004289f
feat: foundations — canonical types, Parse grammar, env DSNs, health, chains
...
Phase 1 of the majordomo build:
- llm/ canonical contract (messages, parts, tools, capabilities, streaming,
Model/Provider, error classification)
- health/ clock-injected tracker (threshold bench, exponential capped
cooldown, reset-on-success)
- root Registry + Parse (verbatim model ids, inline recursive alias
expansion with cycle detection, chain dedup), LLM_* env-DSN providers
(go-llm parity: lazy fallback + eager LoadEnv), health-aware chain
executor behind the Model interface
- provider/fake scriptable test provider; hermetic test suite incl. the
trailing-thinking chain and foreman:// env loading
- ADRs 0001-0008, CLAUDE.md, README (honest matrix), CI workflow,
docs/phase-1-design.md
Co-Authored-By: Claude Fable 5 <noreply@anthropic.com >
2026-06-10 12:35:34 +02:00