T

executus CI / test (pull_request) Successful in 45s

Details

fix(run): address gadfly review of the checkpoint PR

Real findings from the consensus review (44 raw; heavy devstral noise):

- finalizeCheckpoint is now fired from the top-of-Run defer, so it runs on
  EVERY exit: a panic, an early build-error return (before the run loop), AND
  normal completion. Previously an early return on a recovered run left its
  durable record unfinalized → boot recovery would retry it forever on a
  persistent build error. (opus + glm)
- Removed the dead ActivePhase field from run.RunCheckpointState +
  run.ResumeState (and the battery RunCheckpoint) — phase recovery is
  boundary-granular (skip completed phases; the interrupted phase re-runs from
  its start), so ActivePhase was never written nor read. Docs across
  ports/checkpoint/phases now state this plainly (5-model consensus that the
  field + docs over-promised mid-phase resume).
- CheckpointerFactory.Begin error is now logged (WARN) before degrading to
  non-durable, per the documented contract (was silently swallowed). (4 models)
- finalizeCheckpoint logs Complete/Fail errors (was silent).
- Resume phase-skip now keys off a SEPARATE resumeSkip set, not the live
  outputs map — a fresh run with two same-named phases no longer skips the
  second (the outputs map fills as phases run). (opus:max) + regression test.
- Removed the dead checkpoint.factory.now field (never set). (opus + glm)
- Fixed the stale phaseDeps doc (the step observer moved out of sharedOpts to
  per-path). Hoisted the resume guard to a local; dropped the wasted acc
  allocation on the resume path; documented that Save throttling is the
  Checkpointer's responsibility and the accumulated transcript is pre-compaction
  (host size-caps it).

Note (carried from the PR): classifyCheckpointOutcome keys shutdown on
run.ErrShutdown; mort stamps its own runengine.ErrShutdown — the mort wiring PR
aliases them so errors.Is matches.

New test: duplicate phase names both run on a fresh run. Full ./... green.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

2026-06-29 16:34:42 -04:00

.gitea/workflows

chore: repin gadfly reusable to @5007597 (structured findings + consensus + inline review)

2026-06-28 22:13:24 -04:00

audit

fix: address verified gadfly P4/#4 findings (audit/budget/persona)

2026-06-27 00:12:19 -04:00

budget

fix: address verified gadfly P4/#4 findings (audit/budget/persona)

2026-06-27 00:12:19 -04:00

checkpoint

fix(run): address gadfly review of the checkpoint PR

2026-06-29 16:34:42 -04:00

compact

fix: address verified gadfly P2 findings (9 real of 18)

2026-06-27 02:02:21 +00:00

config

P0: stand up executus harness module above majordomo

2026-06-26 19:18:37 -04:00

contrib/store

P4b: skill noun + contrib/store (SQLite for budget/persona/skill/audit)

2026-06-27 00:15:00 -04:00

critic

run: critic parity — fuller RecordStep + cause-carrying Kill (distinct status)

2026-06-27 16:35:13 -04:00

deliver

P0: stand up executus harness module above majordomo

2026-06-26 19:18:37 -04:00

dispatchguard

P0: stand up executus harness module above majordomo

2026-06-26 19:18:37 -04:00

examples

fix: address verified gadfly P5 findings (canary robustness)

2026-06-27 00:34:01 -04:00

fanout

P0: stand up executus harness module above majordomo

2026-06-26 19:18:37 -04:00

identity

P0: stand up executus harness module above majordomo

2026-06-26 19:18:37 -04:00

lane

P0: stand up executus harness module above majordomo

2026-06-26 19:18:37 -04:00

llmmeta

fix: address gadfly P1 review (3 low-risk findings)

2026-06-27 02:02:21 +00:00

model

fix: address verified gadfly P2 findings (9 real of 18)

2026-06-27 02:02:21 +00:00

pendingattach

P0: stand up executus harness module above majordomo

2026-06-26 19:18:37 -04:00

persona

fix: address verified gadfly P4/#4 findings (audit/budget/persona)

2026-06-27 00:12:19 -04:00

run

fix(run): address gadfly review of the checkpoint PR

2026-06-29 16:34:42 -04:00

schedule

P4c: remaining batteries — checkpoint + schedule + critic

2026-06-27 00:15:32 -04:00

skill

P4b: skill noun + contrib/store (SQLite for budget/persona/skill/audit)

2026-06-27 00:15:00 -04:00

tool

fix(run): harden input-file staging per gadfly #18 validation pass

2026-06-28 14:08:57 -04:00

tools

fix: address verified gadfly P3 review (3-cloud fleet)

2026-06-27 00:11:54 -04:00

.gitignore

P0: stand up executus harness module above majordomo

2026-06-26 19:18:37 -04:00

CLAUDE.md

C0b: wire Critic + Delivery into run.Executor

2026-06-27 10:00:05 -04:00

go.mod

P4b: skill noun + contrib/store (SQLite for budget/persona/skill/audit)

2026-06-27 00:15:00 -04:00

go.sum

chore: go mod tidy (add missing go.sum entry; CI tidiness gate)

2026-06-27 14:53:58 -04:00

README.md

fix(run): harden input-file staging per gadfly #18 validation pass

2026-06-28 14:08:57 -04:00

README.md

executus

⚠️ This project is vibe-coded. executus is written almost entirely by an AI coding agent (Claude), with a human steering at the design and review level rather than typing the code. That's a deliberate choice, stated up front — the same way gadfly is. Read the code before you depend on it, pin a version, and file issues if something looks off. It is offered as-is.

A batteries-included base for building LLM agent harnesses in Go. Import it, do a little wiring, and you have agentic capabilities: a bounded run loop, a tool registry with a suite of common tools, context compaction, config-driven model tiering and failover, structured output, and parallel fan-out — with sensible defaults so a brand-new project is agentic with almost no setup, and pluggable seams so a serious host can swap in its own storage, config, delivery, and tools.

executus sits strictly above majordomo — the lean LLM substrate (agent loop, canonical llm types, providers, media normalization, model parsing / failover / tiering). majordomo stays the substrate; executus is the opinionated, batteries-included layer on top. executus requires no changes to majordomo.

Status

Early. Being extracted, phase by phase, from the agent layer of mort (a Discord bot) — mort and gadfly are the first two consumers (heavy and light). See CLAUDE.md for the architecture and the extraction roadmap (P0–P6).

Available today:

run/ — executus is runnable. run.Executor ties model resolution, the tool registry, majordomo's agent loop, context compaction, run-bounding, and step/audit instrumentation into one Run(ctx, RunnableAgent, inv) Result, with every host concern behind a nil-safe run.Ports (Audit/Budget/Critic/ Checkpointer/PaletteSource/Delivery/InputFiles). See examples/minimal.
model/ — config-driven tier resolution + failover over majordomo, with pluggable UsageSink/TraceSink and GenerateWith[T] structured output.
tool/ — the tool registry + 3-stage permission model + SSRF guard.
compact/ — the per-run context compactor.
lane/ — bounded worker pool with fair-share queueing (run- and provider-concurrency).
fanout/ — programmatic N×M swarm with bounded global + per-key concurrency.
config/, deliver/, identity/ — host seams (config / output / identity), each with a shipped default.
dispatchguard/, pendingattach/ — run-safety primitives.
examples/reviewer — a gadfly-shaped PR reviewer on the core only (env-config model fleet → fanout N×M swarm → model.GenerateWith[T] structured findings → consolidation), the light-tier canary; CI asserts it pulls in no battery.

Design

Two tiers in one module (go.mod = majordomo + stdlib only):

Core — everything a light host needs to be agentic: run loop, tool registry + common tools, model resolution, compaction, lanes, fan-out, structured output. No persistence, no scheduling.
Batteries (opt-in sibling packages) — persona/agent nouns, saved skills, audit, run-critic, scheduling, budgets, checkpointing. Each is nil-safe and ships a default, so you add only what you use.

Persistence that needs a real database lives in a separate nested module (contrib/store, pure-Go SQLite) so the core never drags in a DB driver — a static-binary host (gadfly) stays static.

License

TBD.

README.md Unescape Escape

executus

Status

Design

License

README.md