-
f6cf9f5844
proxy: Refactor tests (#660)
Benson Wong
2026-04-16 22:47:42 -07:00
-
121fd93ad8
Makefile: restore linux arm64 targets
Benson Wong
2026-04-14 22:05:39 -07:00
-
17233e9278
docs: update configuration.md for matrix
v202
Benson Wong
2026-04-14 22:01:03 -07:00
-
4866d16c3e
README.md: update to use matrix instead of groups
Benson Wong
2026-04-14 21:57:49 -07:00
-
35193f82f1
proxy: add swap matrix with solver-based model swapping (#646)
Benson Wong
2026-04-14 21:55:30 -07:00
-
40e39f7a86
ui-svelte: fix security issues (#649)
Benson Wong
2026-04-12 16:21:31 -07:00
-
a9d840ffd7
proxy,proxy/config: restore timeouts to pre PR 619 (#648)
v201
Benson Wong
2026-04-11 20:42:13 -07:00
-
7b2b82777f
docker/unified: derive rootless image from root container (#644)
Benson Wong
2026-04-10 22:59:54 -07:00
-
d87f0ce2c5
docker/unified: publish rootless image variant (#630)
v200
Benson Wong
2026-04-07 03:05:53 -07:00
-
06bc6a614c
proxy: preserve wall-clock duration in metrics (#629)
Leoy
2026-04-07 16:52:41 +08:00
-
a37b4866d8
proxy: add configurable HTTP timeouts for models and peers (#619)
Ron M
2026-04-06 04:30:27 -07:00
-
981910d734
ci: validate config.example.yaml against config-schema.json (#627)
Benson Wong
2026-04-05 15:17:57 +08:00
-
a185efe37e
docker: make CMAKE_CUDA_ARCHITECTURES configurable via build arg (#625)
Benson Wong
2026-04-04 08:49:59 +08:00
-
1dd1aadf93
docker/unified: add ik_llama.cpp to CUDA container (#620)
Benson Wong
2026-04-03 15:16:30 +08:00
-
955900972a
add /sdapi to list of supported endpoints
Benson Wong
2026-04-01 12:01:38 +08:00
-
c2c8cfaf81
docker/unified: build llama.cpp with static libraries (#616)
Benson Wong
2026-04-01 03:38:07 +08:00
-
1e440770ea
ci: fix matrix exclude for scheduled docker workflow (#610)
Benson Wong
2026-03-29 20:04:28 +09:00
-
c794273c83
docker/unified,.github: fix unified build (#606)
Benson Wong
2026-03-27 10:31:12 +09:00
-
6574a52cbb
build(deps): bump picomatch from 4.0.3 to 4.0.4 in /ui-svelte (#605)
dependabot[bot]
2026-03-26 22:28:24 +09:00
-
8fabc75634
docker/unified: vulkan build fixes (#600)
v199
Benson Wong
2026-03-25 23:26:13 +09:00
-
e5e7391b6d
.github,docker/unified: include vulkan build (#599)
Benson Wong
2026-03-25 06:58:28 +09:00
-
2c282dccad
.github,docker/unified: improve caching and fix bugs (#598)
Benson Wong
2026-03-23 22:24:40 +09:00
-
916d13f5bd
.github/workflows,docker/unified: add cuda based unified container (#597)
Benson Wong
2026-03-22 21:11:54 +09:00
-
a3725e7d09
Update go.mod to 1.26.1 (#593)
Benson Wong
2026-03-20 16:09:58 +09:00
-
15bd55d3a9
proxy, ui-svelte: add /sdapi/v1 endpoint support (#587)
Benson Wong
2026-03-19 22:08:31 +09:00
-
c3c258a55d
proxy: fix metrics capture for v1/responses (#586)
v198
Benson Wong
2026-03-13 16:50:12 -07:00
-
29a38fde0d
ui-svelte: upgrade to vite 8 (#585)
Benson Wong
2026-03-13 08:45:59 -07:00
-
d569681daa
Change model sorting to natural order (#582)
tesuri
2026-03-12 23:49:34 +09:00
-
24efdb76b1
config: add macro support for name and description fields (#578)
Benson Wong
2026-03-10 08:27:05 -07:00
-
cc77139ff8
proxy,proxy/config: add global TTL feature (#554)
v197
Benson Wong
2026-03-01 21:02:12 -08:00
-
390a35bf93
ui-svelte: add copy button to markdown code blocks (#537)
Benson Wong
2026-03-01 09:48:56 -08:00
-
181f71ca11
.github,docker: add cuda13 architecture support (#551)
pdscomp
2026-03-01 12:37:08 -05:00
-
49546e2cf2
ui: fix text size svg
v196
Benson Wong
2026-02-27 23:47:28 -08:00
-
2c078964f4
Update README with additional images
Benson Wong
2026-02-27 23:45:40 -08:00
-
175bb36fb1
Revise README description for clarity and detail
Benson Wong
2026-02-27 23:42:40 -08:00
-
aedb640471
Enhance web UI section in README
Benson Wong
2026-02-27 23:40:31 -08:00
-
2f377f6dc6
ui: add OGG audio format support to transcription playground (#544)
Benson Wong
2026-02-26 19:48:19 -08:00
-
64e4c79fc3
ui: add Rerank tab to playground (#536)
v195
Benson Wong
2026-02-21 21:59:14 -08:00
-
19fb5f35e9
proxy: implement setParamsByID filter (#535)
v194
Benson Wong
2026-02-19 22:21:10 -08:00
-
b45102bde8
ui: smart auto-scroll in LogPanel (#530)
v193
Benson Wong
2026-02-18 19:47:37 -08:00
-
1688bdd1e9
proxy, ui: add pending requests count to the main dashboard (#516)
v192
Brian Mendonca
2026-02-16 09:41:15 -08:00
-
d33d51fa75
.coderabbit.yaml,AGENTS.md: small tweaks
v191
Benson Wong
2026-02-15 21:31:30 -08:00
-
e3bf065574
ui: persist playground state across route navigation (#525)
Benson Wong
2026-02-15 21:30:52 -08:00
-
3e52144058
ui-svelte: incremental rendering of chat messages in the Playground (#520)
Benson Wong
2026-02-15 11:00:44 -08:00
-
d5e52d7d00
build: disable provenance attestations in container builds (#523)
Benson Wong
2026-02-14 10:23:08 -08:00
-
17e5263a76
.github/workflows: fix expired token in publishing images (#522)
Benson Wong
2026-02-14 10:06:05 -08:00
-
8d6d949ec3
proxy: support timings for /infill from llama-server (#510)
Benson Wong
2026-02-07 17:16:27 -08:00
-
4b4ee70154
proxy: support timings for /infill from llama-server
v190
Benson Wong
2026-02-07 16:54:02 -08:00
-
-
b5fde8eb6d
proxy,ui-svelte: add request/response capturing (#508)
Benson Wong
2026-02-07 15:40:01 -08:00
-
7eef5defb8
docs: add stable-diffusion.cpp references (#506)
Nuno
2026-02-05 05:20:39 +01:00
-
bc01e6f539
build: add stable-diffusion server to musa and vulkan container images (#504)
v189
Benson Wong
2026-02-01 16:17:26 -08:00
-
0462e3dc3f
Reorganize UI controls and improve form interactions (#500)
Benson Wong
2026-02-01 15:18:22 -08:00
-
7b20fc011b
Add path filters to CI workflows and create UI test workflow (#501)
Benson Wong
2026-02-01 15:11:49 -08:00
-
20738f3623
proxy,ui-svelte: replace old UI with svelte+playground
v188
Benson Wong
2026-01-31 22:49:13 -08:00
-
cdea7d16bd
proxy/config: skip env macros in YAML comment lines (#496)
Benson Wong
2026-01-30 20:10:29 -08:00
-
5de387dbf9
ui: fix node-tar vulnerability
v187
Benson Wong
2026-01-28 21:40:18 -08:00
-
6f8e7ccb57
.github/workflows: switch release.yml to build ui-svelte
Benson Wong
2026-01-28 21:39:06 -08:00
-
4384315b44
ui-svelte: add Svelte port of React UI (#487)
Benson Wong
2026-01-28 21:37:29 -08:00
-
6439ab1515
ui: add peer:true in package-lock.json
v186
Benson Wong
2026-01-22 08:43:36 -08:00
-
f94226122c
build(deps-dev): bump tar from 7.5.3 to 7.5.6 in /ui (#477)
dependabot[bot]
2026-01-21 22:55:02 -08:00
-
7493618fdc
Add count_tokens api proxying (#476)
Ryan Voots
2026-01-20 12:34:42 -05:00
-
205efd40a1
proxy: extend /running endpoint with additional process data (#474)
Benson Wong
2026-01-19 17:37:00 -08:00
-
14207f8492
ui: npm security update
Benson Wong
2026-01-18 21:56:32 -08:00
-
4e850c2834
config: refactor macro substitution in configuration (#470)
Benson Wong
2026-01-18 21:52:34 -08:00
-
75fced579e
config: support macros in peer apiKey and filters (#469)
v185
Benson Wong
2026-01-16 23:10:50 -08:00
-
b73f367f22
config-schema.json,config.example.yaml: Update examples and schema
v184
Benson Wong
2026-01-16 22:43:25 -08:00
-
8f2137c72b
config: support environment variable macros in apiKeys (#467)
Benson Wong
2026-01-16 22:41:14 -08:00
-
124007cc98
config: add environment variable macros (#466)
Benson Wong
2026-01-16 22:25:20 -08:00
-
eb5bfff0b0
proxy: unify filtering for local models and peers
Benson Wong
2026-01-15 18:59:43 -08:00
-
3edb180c08
ci: free up disk space before ROCm container build (#460)
Benson Wong
2026-01-14 22:03:42 -08:00
-
66d555e625
Improve container build reliability (#457)
Benson Wong
2026-01-10 22:14:33 -08:00
-
4f863fd9fc
CLAUDE.md: tweak instructions
Benson Wong
2026-01-09 21:42:06 -08:00
-
267c030457
ui: update react-router-dom to 7.12.0 (#456)
v183
Benson Wong
2026-01-08 16:13:09 -08:00
-
c19309fe7e
CLAUDE.md: small instruction tweaks
Benson Wong
2026-01-07 21:34:23 -08:00
-
4413881b2d
proxy: actually add /v1/responses endpoint (#449)
v182
Benson Wong
2026-01-01 13:35:45 -08:00
-
8df5e8563b
proxy: add /v1/responses and /v1/audio/voices endpoints (#448)
v181
Benson Wong
2026-01-01 12:52:12 -08:00
-
7931212d3e
proxy: add v1/images/edits API endpoint (#447)
Benson Wong
2026-01-01 12:43:06 -08:00
-
3dc36032fb
proxy: skip very slow tests in -short test mode (#446)
Benson Wong
2025-12-31 14:08:56 -08:00
-
addb98646f
proxy: add support for basic authorization (#445)
Benson Wong
2025-12-31 13:42:35 -08:00
-
37d74efc2d
proxy: add /v1/images/generations (#443)
v180
Benson Wong
2025-12-30 21:04:58 -08:00
-
22e098ac8b
Add Peer Model Support (#438)
v179
Benson Wong
2025-12-27 20:18:06 -08:00
-
9864f9f517
.coderabbit.yaml: disable annoying features
Benson Wong
2025-12-23 23:52:48 -08:00
-
53b32f3601
proxy: add API key support (#436)
Benson Wong
2025-12-23 23:39:33 -08:00
-
565c44766d
config,proxy: add new configuration logToStdout (#432)
v178
Benson Wong
2025-12-21 22:23:31 -08:00
-
e6a9e210ba
proxy: fix path bug in /logs/stream/{model_id} (#431)
Benson Wong
2025-12-21 21:47:14 -08:00
-
d3f329f924
proxy: Improve logging performance and allow separate log streaming (#421)
v177
Benson Wong
2025-12-18 21:49:25 -08:00
-
98879b38c1
docker: add /app to $PATH (#424)
Benson Wong
2025-12-06 22:58:29 -08:00
-
7b3b0f5eae
move header images around [skip ci]
Benson Wong
2025-12-02 19:40:42 -08:00
-
021ccceef1
README: update hero image
Benson Wong
2025-12-02 19:37:03 -08:00
-
f03871c50a
Update README.md
Benson Wong
2025-12-02 19:03:01 -08:00
-
dc00d17abe
docs: add documentation for non-root container images and security considerations (#416)
Ryan Steed
2025-12-02 16:52:26 +00:00
-
dea98733c3
proxy: extract metrics for v1/messages (#419)
v176
Benson Wong
2025-11-29 23:51:20 -08:00
-
bccce5fa19
go.mod,ui/package-lock.json: dependency and security updates (#418)
v175
Benson Wong
2025-11-29 22:27:22 -08:00
-
c968da1b73
proxy: add support for anthropic v1/messages api (#417)
Benson Wong
2025-11-29 22:09:07 -08:00
-
a883d68d4f
feat: Add support for custom llama.cpp base image and forked llama-swap repositories (#396)
Ryan Steed
2025-11-30 04:59:15 +00:00
-
b1dec8b735
docker: build both root and non-root container images (#412)
v174
Ryan Steed
2025-11-25 18:44:13 +00:00
-
06523d8c1e
feat: add platform-specific process attributes support (#411)
Nikesh Parajuli
2025-11-25 00:39:56 -05:00
-
86e9b93c37
proxy,ui: add version endpoint and display version info in UI (#395)
v173
Ryan Steed
2025-11-17 18:43:47 +00:00
-
3acace810f
proxy: add configurable logging timestamp format (#401)
Ryan Steed
2025-11-16 18:21:59 +00:00
-
554d29e87d
feat: enhance model listing to include aliases (#400)
Ryan Steed
2025-11-15 22:35:26 +00:00