llama-swap

Author	SHA1	Message	Date
Benson Wong	b5fde8eb6d	proxy,ui-svelte: add request/response capturing (#508 ) Add saving request and response headers and bodies that go through llama-swap in memory. - captureBuffer added to configuration. Captures are enabled by default. - 5MB of memory is allocated for req/response captures in a ring buffer. Setting captureBuffer to 0 will disable captures. - UI elements to view captured data added to Activity page. Includes some QOL features like json formatting and recombining SSE chat streams - capture saving is done at the byte level and has minimal impact on llama-swap performance Fixes #464 Ref #503	2026-02-07 15:40:01 -08:00
Benson Wong	cdea7d16bd	proxy/config: skip env macros in YAML comment lines (#496 ) Fix a bug where ${env.macro_not_exist} in comments would trigger a non-substituted macro error. fixes #495	2026-01-30 20:10:29 -08:00
Benson Wong	4e850c2834	config: refactor macro substitution in configuration (#470 ) This commit simplifies substitution of environment variables into the configuration. There was a lot of repetitive code substituting ${env.VAR_NAME} into different fields after the configuration was parsed into a config.Config. This refactor uses a string substitution of env vars into the YAML config before it is fully parsed. This eliminates a lot of logic while maintaining backwards compatibility.	2026-01-18 21:52:34 -08:00
Benson Wong	75fced579e	config: support macros in peer apiKey and filters (#469 ) * config: support environment variable macros in peer apiKeys Add ${env.VAR_NAME} substitution for peer apiKey fields, consistent with existing env macro support for model fields and global apiKeys. - Add env macro substitution for peers.{name}.apiKey in LoadConfigFromReader - Add tests for peer apiKey env substitution - Update config.example.yaml to show env macro usage * config: support macros in peer apiKey and filters Extend macro substitution to peer configuration fields: - peers.{name}.apiKey supports both global macros and env macros - peers.{name}.filters.stripParams supports both macro types - peers.{name}.filters.setParams supports both macro types Also renamed validateMetadataForUnknownMacros to validateNestedForUnknownMacros for reuse across model metadata and peer filters validation.	2026-01-16 23:10:50 -08:00
Benson Wong	8f2137c72b	config: support environment variable macros in apiKeys (#467 ) Add substituteEnvMacros support for apiKeys configuration field, allowing API keys to be loaded from environment variables using the ${env.VAR_NAME} syntax. - Apply env macro substitution before validation - Add tests for env macro substitution in apiKeys	2026-01-16 22:41:14 -08:00
Benson Wong	124007cc98	config: add environment variable macros (#466 ) * config: add environment variable macros Add support for ${env.VAR_NAME} syntax to pull values from system environment variables during config loading. - env macros processed before regular macros (allows macros to reference env vars) - works in cmd, cmdStop, proxy, checkEndpoint, filters.stripParams, metadata - returns error if env var is not set - add comprehensive tests fixes #462 * docs: add env macro example to config.example.yaml	2026-01-16 22:25:20 -08:00
Benson Wong	eb5bfff0b0	proxy: unify filtering for local models and peers This unifies the filtering capabilities for models and peers - stripParams: removes params in the request - setParams: sets params in the request fixes #453	2026-01-15 18:59:43 -08:00
Benson Wong	22e098ac8b	Add Peer Model Support (#438 ) This PR allows a single llama-swap to be the central proxy for models served by other inference servers. The peer servers can be another llama-swap or any API that supports the /v1/* inference endpoint. Updates: #433, #299 Closes: #296	2025-12-27 20:18:06 -08:00
Benson Wong	53b32f3601	proxy: add API key support (#436 ) Add configuration support for api keys that are enforced by llama-swap. Keys are stripped before sending them to upstream servers. Updates: #433, #50 and #251	2025-12-23 23:39:33 -08:00
Benson Wong	565c44766d	config,proxy: add new configuration logToStdout (#432 ) The new logToStdout option controls what is logged to stdout. The default has been changed to just the proxy logs, which contain swap and http request logs. There are four supported settings: none, proxy, upstream, both. The "both" setting is the legacy setting where everything was spewed to stdout.	2025-12-21 22:23:31 -08:00
Ryan Steed	3acace810f	proxy: add configurable logging timestamp format (#401 ) introduces a new configuration option logTimeFormat that allows customizing the timestamp in log messages using golang's built in time format constants. The default remains no timestamp.	2025-11-16 10:21:59 -08:00
Ryan Steed	554d29e87d	feat: enhance model listing to include aliases (#400 ) introduce includeAliasesInList as a new configuration setting (default false) that includes aliases in v1/models Fixes #399	2025-11-15 14:35:26 -08:00
Benson Wong	a89b803d4a	Stream loading state when swapping models (#371 ) Swapping models can take a long time and leave a lot of silence while the model is loading. Rather than silently load the model in the background, this PR allows llama-swap to send status updates in the reasoning_content of a streaming chat response. Fixes: #366	2025-10-29 00:09:39 -07:00
David Wen Riccardi-Zhu	d58a8b85bf	Refactor to use httputil.ReverseProxy (#342 ) * Refactor to use httputil.ReverseProxy Refactor manual HTTP proxying logic in Process.ProxyRequest to use the standard library's httputil.ReverseProxy. * Refactor TestProcess_ForceStopWithKill test Update to handle behavior with httputil.ReverseProxy. * Fix gin interface conversion panic	2025-10-13 16:47:04 -07:00
Benson Wong	00b738cd0f	Add Macro-In-Macro Support (#337 ) Add full macro-in-macro support so any user defined macro can contain another one as long as it was previously declared in the configuration file. Fixes #336 Supercedes #335	2025-10-06 22:57:15 -07:00
Benson Wong	70930e4e91	proxy: add support for user defined metadata in model configs (#333 ) Changes: - add Metadata key to ModelConfig - include metadata in /v1/models under meta.llamaswap key - add recursive macro substitution into Metadata - change macros at global and model level to be any scalar type Note: This is the first mostly AI generated change to llama-swap. See #333 for notes about the workflow and approach to AI going forward.	2025-10-04 19:56:41 -07:00
Benson Wong	1f6179110c	proxy/config: add model level macros (#330 ) * proxy/config: add model level macros Add macros to model configuration. Model macros override macros that are defined at the global configuration level. They follow the same naming and value rules as the global macros. * proxy/config: fix bug with macro reserved name checking The PORT reserved name was not properly checked * proxy/config: add tests around model.filters.stripParams - add check that model.filters.stripParams has no invalid macros - renamed strip_params to stripParams for camel case consistency - add legacy code compatibility so model.filters.strip_params continues to work * proxy/config: add duplicate removal to model.filters.stripParams * clean up some doc nits	2025-09-28 23:32:52 -07:00
Benson Wong	216c40b951	proxy/config: create config package and migrate configuration (#329 ) * proxy/config: create config package and migrate configuration The configuration is become more complex as llama-swap adds more advanced features. This commit moves config to its own package so it can be developed independently of the proxy package. Additionally, enforcing a public API for a configuration will allow downstream usage to be more decoupled.	2025-09-28 16:50:06 -07:00

18 Commits