go-llm

Author	SHA1	Message	Date
steveandClaude Opus 4.6	34b2e29019	feat(v2/anthropic): apply cache_control markers from CacheHints buildRequest now tracks a source-index → built-message-index mapping during the role-merge pass, then uses the mapping to attach cache_control: {type: ephemeral} markers at the positions indicated by Request.CacheHints. The last tool, the last system part, and the last non-system message each get a marker when the corresponding hint is set. Covers the merge-induced index drift that would otherwise cause the breakpoint to land on the wrong content block when consecutive same-role source messages are combined into a single Anthropic message with multiple content blocks. Co-Authored-By: Claude Opus 4.6 <[email protected]>	2026-04-09 19:33:25 +00:00
steveandClaude Opus 4.6	4c6dfb9058	test(v2/anthropic): drop placeholder import sentinel from cache_test.go Removes the blank-assign workaround that was only needed because the anth import was being kept alive for Task 5's use. Task 5 will bring the import back when it actually references anth.CacheControlTypeEphemeral. Co-Authored-By: Claude Opus 4.6 <[email protected]>	2026-04-09 19:29:14 +00:00
steveandClaude Opus 4.6	a6b5544674	refactor(v2/anthropic): use MultiSystem for system prompts Switches buildRequest to emit anthReq.MultiSystem instead of anthReq.System whenever a system message is present. Upstream's MarshalJSON prefers MultiSystem when non-empty, so the wire format is unchanged for requests without cache_control. This refactor is a prerequisite for attaching cache_control markers to system parts in the next commit. Co-Authored-By: Claude Opus 4.6 <[email protected]>	2026-04-09 19:26:55 +00:00
steveandClaude Opus 4.6	01b18dcf32	test(v2): cover empty-messages and disabled-but-non-nil cacheConfig edges Adds two boundary tests suggested by code review: - TestBuildProviderRequest_CachingEnabled_EmptyMessages: verifies that caching with an empty message list still emits a CacheHints with LastCacheableMessageIndex=-1, not a spurious breakpoint. - TestBuildProviderRequest_CachingNonNilButDisabled: verifies that an explicitly-disabled cacheConfig (non-nil, enabled=false) produces nil CacheHints, exercising the &&-guard branch that the previous "disabled" test left untested. Co-Authored-By: Claude Opus 4.6 <[email protected]>	2026-04-09 19:24:55 +00:00
steveandClaude Opus 4.6	4b401fcc0d	feat(v2): populate CacheHints on provider.Request when caching enabled CI / Lint (push) Successful in 9m36s Details CI / Root Module (push) Successful in 10m55s Details CI / V2 Module (push) Successful in 11m14s Details buildProviderRequest now computes cache-breakpoint positions automatically when the WithPromptCaching() option is set. It places up to 3 hints: tools, system, and the index of the last non-system message. Providers that don't support caching (OpenAI, Google) ignore the field. Co-Authored-By: Claude Opus 4.6 <[email protected]>	2026-04-09 19:22:00 +00:00
steveandClaude Opus 4.6	c4fe0026a2	feat(v2): add WithPromptCaching() request option CI / Lint (push) Failing after 2m2s Details CI / V2 Module (push) Failing after 2m3s Details CI / Root Module (push) Has been cancelled Details Introduces an opt-in RequestOption that callers can pass to enable automatic prompt-caching markers. The option populates a cacheConfig on requestConfig but has no effect yet — plumbing through to provider.Request and on to the Anthropic provider lands in subsequent commits. Co-Authored-By: Claude Opus 4.6 <[email protected]>	2026-04-09 19:17:55 +00:00
steveandClaude Opus 4.6	b4bf73136a	feat(v2/provider): add CacheHints to Request for prompt caching Adds an optional CacheHints field on provider.Request that carries cache-breakpoint placement directives from the public llm package down to individual provider implementations. Anthropic will consume these in a follow-up commit; OpenAI and Google ignore them. Co-Authored-By: Claude Opus 4.6 <[email protected]>	2026-04-09 19:14:44 +00:00
steveandClaude Opus 4.6	5b687839b2	feat: comprehensive token usage tracking for V2 CI / Lint (pull_request) Successful in 10m18s Details CI / Root Module (pull_request) Successful in 11m4s Details CI / V2 Module (pull_request) Successful in 11m5s Details Add provider-specific usage details, fix streaming usage, and return usage from all high-level APIs (Chat.Send, Generate[T], Agent.Run). Breaking changes: - Chat.Send/SendMessage/SendWithImages now return (string, Usage, error) - Generate[T]/GenerateWith[T] now return (T, Usage, error) - Agent.Run/RunMessages now return (string, *Usage, error) New features: - Usage.Details map for provider-specific token breakdowns (reasoning, cached, audio, thoughts tokens) - OpenAI streaming now captures usage via StreamOptions.IncludeUsage - Google streaming now captures UsageMetadata from final chunk - UsageTracker.Details() for accumulated detail totals - ModelPricing and PricingRegistry for cost computation Closes #2 Co-Authored-By: Claude Opus 4.6 <[email protected]>	2026-03-02 04:33:18 +00:00
steveandClaude Opus 4.6	7e1705c385	feat: add audio input support to v2 providers CI / Lint (push) Successful in 9m37s Details CI / Root Module (push) Successful in 10m53s Details CI / V2 Module (push) Successful in 11m9s Details Add Audio struct alongside Image for sending audio attachments to multimodal LLMs. OpenAI uses input_audio content parts (wav/mp3), Google Gemini uses genai.NewPartFromBytes, and Anthropic skips audio gracefully since it's not supported. Co-Authored-By: Claude Opus 4.6 <[email protected]>	2026-02-08 21:00:56 -05:00
steveandClaude Opus 4.6	fc2218b5fe	Add comprehensive test suite for sandbox package (78 tests) CI / Lint (push) Successful in 9m35s Details CI / V2 Module (push) Successful in 10m39s Details CI / Root Module (push) Successful in 11m2s Details Expanded from 22 basic tests to 78 tests covering error injection, task polling, IP discovery, context cancellation, HTTP error codes, concurrent access, SSH lifecycle, and request verification. Co-Authored-By: Claude Opus 4.6 <[email protected]>	2026-02-08 01:10:59 -05:00
steveandClaude Opus 4.6	23c9068022	Add sandbox package for isolated Linux containers via Proxmox LXC CI / V2 Module (push) Successful in 11m46s Details CI / Root Module (push) Successful in 11m50s Details CI / Lint (push) Successful in 9m28s Details Provides a complete lifecycle manager for ephemeral sandbox environments: - ProxmoxClient: thin REST wrapper for container CRUD, IP discovery, internet toggle - SSHExecutor: persistent SSH/SFTP for command execution and file transfer - Manager/Sandbox: high-level orchestrator tying Proxmox + SSH together - 22 unit tests with mock Proxmox HTTP server - Proxmox setup & hardening guide (docs/sandbox-setup.md) Co-Authored-By: Claude Opus 4.6 <[email protected]>	2026-02-08 00:47:45 -05:00
steveandClaude Opus 4.6	87ec56a2be	Add agent sub-package for composable LLM agents CI / Lint (push) Successful in 9m46s Details CI / V2 Module (push) Successful in 12m5s Details CI / Root Module (push) Successful in 12m6s Details Introduces v2/agent with a minimal API: Agent, New(), Run(), and AsTool(). Agents wrap a model + system prompt + tools. AsTool() turns an agent into a llm.Tool, enabling parent agents to delegate to sub-agents through the normal tool-call loop — no channels, pools, or orchestration needed. Also exports NewClient(provider.Provider) for custom provider integration. Co-Authored-By: Claude Opus 4.6 <[email protected]>	2026-02-07 23:17:19 -05:00
steveandClaude Opus 4.6	be572a76f4	Add structured output support with Generate[T] and GenerateWith[T] CI / Lint (push) Successful in 9m35s Details CI / V2 Module (push) Successful in 11m43s Details CI / Root Module (push) Successful in 11m53s Details Generic functions that use the "hidden tool" technique to force models to return structured JSON matching a Go struct's schema, replacing the verbose "tool as structured output" pattern. Co-Authored-By: Claude Opus 4.6 <[email protected]>	2026-02-07 22:36:33 -05:00
steveandClaude Opus 4.6	6a7eeef619	Add comprehensive test suite for v2 module with mock provider CI / Lint (push) Successful in 9m36s Details CI / V2 Module (push) Successful in 11m33s Details CI / Root Module (push) Successful in 11m35s Details Cover all core library logic (Client, Model, Chat, middleware, streaming, message conversion, request building) using a configurable mock provider that avoids real API calls. ~50 tests across 7 files. Co-Authored-By: Claude Opus 4.6 <[email protected]>	2026-02-07 22:00:49 -05:00
steveandClaude Opus 4.6	9e288954f2	Add transcription API to v2 module CI / Lint (push) Failing after 5m0s Details CI / Root Module (push) Failing after 5m3s Details CI / V2 Module (push) Successful in 10m48s Details Migrate speech-to-text transcription types and OpenAI transcriber implementation from v1. Types are defined in provider/ to avoid import cycles and re-exported via type aliases from the root package. Co-Authored-By: Claude Opus 4.6 <[email protected]>	2026-02-07 20:24:20 -05:00
steveandClaude Opus 4.6	a4cb4baab5	Add go-llm v2: redesigned API for simpler LLM abstraction v2 is a new Go module (v2/) with a dramatically simpler API: - Unified Message type (no more Input marker interface) - Define[T] for ergonomic tool creation with standard context.Context - Chat session with automatic tool-call loop (agent loop) - Streaming via pull-based StreamReader - MCP one-call connect (MCPStdioServer, MCPHTTPServer, MCPSSEServer) - Middleware support (logging, retry, timeout, usage tracking) - Decoupled JSON Schema (map[string]any, no provider coupling) - Sample tools: WebSearch, Browser, Exec, ReadFile, WriteFile, HTTP - Providers: OpenAI, Anthropic, Google (all with streaming) Co-Authored-By: Claude Opus 4.6 <[email protected]>	2026-02-07 20:00:08 -05:00