Commit Graph

  • 0292c90ca1 ci: copy ui-svelte/.npmrc before npm ci in fork-cuda build main steve 2026-06-28 12:56:21 -04:00
  • 617c7dc6b9 ci: add Gitea workflow to build fork CUDA image steve 2026-06-28 12:48:48 -04:00
  • 542b79dacf internal/router/scheduler: add serial scheduler, default on this fork steve 2026-06-28 12:17:32 -04:00
  • d567fa78cb npm audit fix claude/ui-svelte-shading-migration-w30ta6 Benson Wong 2026-06-28 04:38:45 +00:00
  • 187f1ae27a ui: fix logs tab height and column toggle dropdown Benson Wong 2026-06-28 04:36:56 +00:00
  • 0ae56b1eb9 ui: convert chat settings panel to a dialog Benson Wong 2026-06-28 04:22:01 +00:00
  • e46cbeb2bf ui: refocus message input after chat generation completes Benson Wong 2026-06-28 04:16:23 +00:00
  • a0578f0007 ui: reorganize sidebar and add Settings page Benson Wong 2026-06-28 03:53:14 +00:00
  • d207a059a4 ui: enable pagination on Activity page and fix table reactivity Benson Wong 2026-06-28 03:43:55 +00:00
  • 040ee1e284 ui: convert ActivityTable to shadcn-svelte data-table Benson Wong 2026-06-28 03:26:24 +00:00
  • 82cad1b84e ui: add ModelsDash route, clickable sidebar headings, and dialog tweaks Benson Wong 2026-06-28 03:04:04 +00:00
  • 55c3678906 ui: extract shared ActivityTable and split ModelDetail into components Benson Wong 2026-06-28 02:27:05 +00:00
  • 8b5a62d92a ui-svelte: big convert to shadcn components Benson Wong 2026-06-28 01:53:19 +00:00
  • d1e4c8ee77 ui tweaks Benson Wong 2026-06-28 01:21:40 +00:00
  • 11f8afead8 ui: add collapsible Models section to sidebar Benson Wong 2026-06-27 23:54:18 +00:00
  • 749819ef47 ui: consolidate playground nav into sidebar Benson Wong 2026-06-27 16:46:10 +00:00
  • 0ab9e74333 ui: finish shadcn migration and remove legacy shim Claude 2026-06-27 12:10:56 +00:00
  • b20be6dcd1 ui: convert Image, Speech, Audio interfaces to shadcn buttons Claude 2026-06-27 12:05:19 +00:00
  • fc24722258 ui: migrate Rerank and normalize remaining views to shadcn tokens Claude 2026-06-27 12:01:19 +00:00
  • 2b087dffb1 ui: migrate ChatMessage to shadcn tokens Claude 2026-06-27 11:58:24 +00:00
  • 746c083a87 ui: migrate chat playground and stats to shadcn Claude 2026-06-27 11:56:31 +00:00
  • 8dd91e99e8 ui: migrate Activity, Logs views to shadcn Claude 2026-06-27 11:52:11 +00:00
  • 136dcdc25f ui: migrate Models panel and Playground to shadcn Claude 2026-06-27 11:49:16 +00:00
  • 767b8015fa ui: replace top navbar with shadcn sidebar layout Claude 2026-06-27 11:46:30 +00:00
  • f0144a2361 ui: add shadcn-svelte foundation and theming Claude 2026-06-27 11:42:43 +00:00
  • 0a25b3bd31 AGENTS.md: small tweaks Benson Wong 2026-06-25 20:31:48 -07:00
  • 32bc781326 internal/config,watcher: add -config-dir (#873) v230 Benson Wong 2026-06-24 20:48:51 -07:00
  • 316ad63f76 config,server: add upstream.ignorePaths (#869) v229 Benson Wong 2026-06-21 13:49:53 -07:00
  • e37077a963 feat: hide performance menu item if disabled (#832) g2mt 2026-06-21 13:38:29 -07:00
  • eff9b60434 server: capture failed (non-200) LLM requests (#862) Benson Wong 2026-06-20 11:50:35 -07:00
  • 9bcddad91b internal/server,ui: add new Acitivty page column - Drafted (#859) Wojciech 2026-06-19 05:55:02 +02:00
  • a15e47922c proxy: meter /upstream requests via metrics middleware (#858) v228 Benson Wong 2026-06-17 17:38:52 -07:00
  • 0ab214d1c8 perf: add vendor-agnostic GPU monitoring for Windows (experimental) (#779) v227 George 2026-06-17 04:49:09 +00:00
  • d07b063ab6 internal/server,shared: support request metadata (#850) Benson Wong 2026-06-16 21:44:55 -07:00
  • 826210dac9 .coderabbit.yaml: disable unit_tests Benson Wong 2026-06-16 10:10:17 -07:00
  • 090bb4623c CodeRabbit Generated Unit Tests: Generate unit tests for PR changes coderabbitai/utg/6cf1317 coderabbitai[bot] 2026-06-16 12:47:43 +00:00
  • 6cf1317341 schedule,shared: move concurrency 429 limits into scheduler code (#849) Benson Wong 2026-06-15 22:35:12 -07:00
  • 8e84b2ec4f README.md: add macports install option to README (#848) Wojciech 2026-06-16 00:58:24 +02:00
  • ed77385d08 ui: improve manual model load and cancel (#847) v226 Benson Wong 2026-06-14 13:38:10 -07:00
  • 92b90447e8 Model capabilities 734 (#842) v225 Benson Wong 2026-06-13 23:23:19 -07:00
  • 62aea0e83d internal/router,server,shared: refactor auth, libs (#839) Benson Wong 2026-06-13 10:19:04 -07:00
  • 8c660dcb90 main: gofmt Benson Wong 2026-06-11 22:16:39 -07:00
  • f6877b8175 main: show message when listening on network (#836) Benson Wong 2026-06-11 22:15:14 -07:00
  • 9b3a33d7b9 Implement new scheduler (#823) Benson Wong 2026-06-10 20:34:25 -07:00
  • 0cfe5a6639 Makefile,internal: fix websocket regression and other small things (#830) v224 Benson Wong 2026-06-09 21:37:53 -07:00
  • 44e1501e81 internal/process,server: fix unload regression (#828) Benson Wong 2026-06-09 20:49:58 -07:00
  • 46cea36bc2 proxy: remove legacy code. Thanks champ 🫡 (#822) Benson Wong 2026-06-06 21:00:30 -07:00
  • ccfba0df28 docker: fix arm64 cpu image downloading amd64 llama-swap binary (#819) Benson Wong 2026-06-04 14:26:21 -07:00
  • ddfae90b19 Change cron schedule for container builds Benson Wong 2026-06-04 11:00:43 -07:00
  • 29d3d9ba20 perf: add macOS GPU monitoring via mactop and ioreg (#816) v223 Benson Wong 2026-06-03 21:51:03 -07:00
  • 9be9a87fa0 internal/process: improve windows shutdown behaviour (#808) v222 Benson Wong 2026-06-01 00:45:30 -07:00
  • 6ea551362e process,router: make model shutdown and load-streaming robust v221 Benson Wong 2026-05-31 10:11:12 -07:00
  • 03d58e53fa Add load testing tool to the UI (#805) v220 Benson Wong 2026-05-30 17:04:30 -07:00
  • c790d0ee03 fix: update the concurrency middleware to respond with a JSON payload (#798) Luiszzzor 2026-05-30 08:59:32 +02:00
  • 4ca9c478a2 Makefile,internal/server: various release tweaks v219 Benson Wong 2026-05-29 15:27:08 -07:00
  • 146a9eab24 ui-svelte: update build directory (#801) Benson Wong 2026-05-29 14:45:05 -07:00
  • 02e015fa49 Introduce new routing backend (#790) v218 Benson Wong 2026-05-28 21:47:01 -07:00
  • 63bc266395 Add new power draw column header for rocm-smi monitoring (#788) Cr4xy 2026-05-25 20:36:16 +02:00
  • 636b53e70f Improve rocm-smi performance monitoring (#775) v217 Cr4xy 2026-05-21 02:59:49 +02:00
  • 59cd3b690d Added Windows performance monitoring using nvidia-smi (#773) gatkisson 2026-05-18 21:02:03 +03:00
  • 5d1e62d224 Disable auto review feature in coderabbit config Benson Wong 2026-05-18 10:40:21 -07:00
  • dbb869d019 Increase inactivity thresholds for stale issues Benson Wong 2026-05-17 22:52:58 -07:00
  • 26bb17e57e config.example.yaml: Improve matrix vs groups info Benson Wong 2026-05-17 15:59:25 -07:00
  • 2982dd3d40 ui-svelte: update link to performance discussion thread v216 Benson Wong 2026-05-17 11:45:56 -07:00
  • 79dc87f881 Add ROCm stats via rocm-smi (#767) v215 knguyen298 2026-05-17 09:58:26 -05:00
  • b2fcc2daa1 ui-svelte: fix cached tokens total counting -1 sentinel (#760) v214 krzychdre 2026-05-15 23:42:44 +02:00
  • 6a9c4efc8f fix: use --loop instead of -loop for nvidia-smi (driver 540+ compat) (#759) cdwaage 2026-05-15 13:20:29 -07:00
  • 0c813e44d1 ui-svelte: package updates v213 Benson Wong 2026-05-14 21:56:04 -07:00
  • fe71e8a6ea proxy,ui-svelte: improve support for v1/messages and v1/responses (#758) Benson Wong 2026-05-14 21:53:57 -07:00
  • aac7b8745a ci: set go-version-file in release workflow v212 Benson Wong 2026-05-13 22:12:02 -07:00
  • 4e606feff0 ci: fix workflow bugs in release and go-ci Benson Wong 2026-05-13 21:48:27 -07:00
  • a4b91e08cf Changes and fixes before the release (docs/small tweaks) (#750) Benson Wong 2026-05-13 21:18:19 -07:00
  • 3e3646f9f9 perf: ignore LACT devices reporting zero VRAM (#753) David Soušek 2026-05-13 19:03:54 +02:00
  • a01afe261b ci: use manifest-aware cleanup action for multi-arch :cpu (#751) rhtenhove 2026-05-13 03:04:46 +02:00
  • 174e8562aa Multi arch cpu (#746) rhtenhove 2026-05-12 06:03:48 +02:00
  • 085b54bc88 proxy: fix data race in /running endpoint and typo in error message (#748) Abdulazez A. 2026-05-11 22:49:18 +03:00
  • 2be3416baa ui: add auto theme switch mode based on system theme (#741) bankjaneo 2026-05-10 10:22:18 +07:00
  • 7e3e94a08a proxy,ui: add performance monitoring with Prometheus metrics (#743) Benson Wong 2026-05-09 13:29:22 -07:00
  • e261745c66 proxy: add versionless API endpoint (#733) Wim Vander Schelden 2026-05-03 22:47:38 +02:00
  • 11b7913287 llama-swap.go: remove debounce, replace fmt.Printlns (#731) Benson Wong 2026-05-02 16:28:53 -07:00
  • c79114d40a proxy: fix logger not checking matrix for processes v211 Marcus 2026-05-01 16:43:20 -07:00
  • 430166d5eb proxy: fix zero duration for non streaming responses (#723) v210 Benson Wong 2026-04-30 19:51:28 -07:00
  • 5b4beaceef fix: ?no-history flag and improve /logs monitoring docs (#721) Marcus 2026-04-30 00:50:36 -07:00
  • fd3c28ffc5 Refactor Activity Page (#710) v209 Benson Wong 2026-04-28 20:33:03 -07:00
  • a846c4f18c config: remove hard cap on macro length (#718) Quentin Machu 2026-04-28 16:32:54 -04:00
  • 5bae33a769 ui-svelte: default theme to user preferred color scheme (#712) Marcus 2026-04-27 06:44:22 -07:00
  • 8f4ff01f93 ui-svelte: make it easier to toggle panels in logs view Benson Wong 2026-04-26 22:12:43 -07:00
  • e8d4384cd2 ui-svelte: support reasoning and reasoning_content (#708) v208 Benson Wong 2026-04-26 13:11:48 -07:00
  • ce28485be2 ui-svelte: add prompt processing histogram (#705) Benson Wong 2026-04-25 16:13:07 -07:00
  • 3cd7837b1f fix: support architecture-specific download URLs in install script (#698) Damir 2026-04-24 03:05:33 +02:00
  • 0b31ccacc1 ui-svelte: fix histogram calculation (#695) v207 v206 Benson Wong 2026-04-22 23:42:39 -07:00
  • 5938dbee8f Push unified docker images on scheduled runs (#694) Bryan Gahagan 2026-04-22 23:46:51 -04:00
  • 66639e83f7 proxy: replace fsnotify with stat-poll watcher and add SIGHUP reload (#685) v205 Benson Wong 2026-04-21 23:21:48 -07:00
  • 625b296720 docker/unified: add uv via pip install (#681) Benson Wong 2026-04-20 20:55:51 -07:00
  • 231e62291c proxy: fix matrix race and process stop bug (#677) v204 Benson Wong 2026-04-20 00:21:11 -07:00
  • 57ac666598 .github/workflows: tweak push ghcr conditional (#676) Benson Wong 2026-04-19 13:56:26 -07:00
  • 69728301f5 .github/workflows: add toggle for pushing unified images to github (#672) Benson Wong 2026-04-19 10:10:48 -07:00
  • c176fa70f1 docker/unified: add spirv-headers to fix vulkan build (#669) Benson Wong 2026-04-18 12:18:10 -07:00
  • 5e3c646829 proxy: compress captures with zstd (#668) v203 Benson Wong 2026-04-17 23:29:37 -07:00
  • c3f0d43e6e proxy: fix race conditions during swap (#667) Benson Wong 2026-04-17 21:23:17 -07:00