llama-swap

Files

T

pdscomp 181f71ca11 .github,docker: add cuda13 architecture support (#551 )

Add `cuda13` as a supported build architecture, targeting the
`ghcr.io/ggml-org/llama.cpp:server-cuda13` upstream base image.

The `server-cuda13` image ships with CUDA 13 libraries, providing
improved performance on recent NVIDIA hardware compared to the existing
`server-cuda` (CUDA 12) image. Users with newer GPUs (e.g., RTX
50-series) benefit from reduced model load latency and higher token
throughput.

- Add `cuda13` to the allowed architectures list in
`docker/build-container.sh`
- Add `cuda13` to the CI matrix in `.github/workflows/containers.yml` so
the container is built and pushed automatically

2026-03-01 09:37:08 -08:00

closeinactive.yml

Reduce stale time for issues

2025-04-29 21:16:34 -07:00

config-schema.yml

Add configuration file JSON schema (#393 )

2025-11-08 15:04:14 -08:00

containers.yml

.github,docker: add cuda13 architecture support (#551 )

2026-03-01 09:37:08 -08:00

go-ci-windows.yml

Add path filters to CI workflows and create UI test workflow (#501 )