llama-swap

Files

T

pdscomp 181f71ca11 .github,docker: add cuda13 architecture support (#551 )

Add `cuda13` as a supported build architecture, targeting the
`ghcr.io/ggml-org/llama.cpp:server-cuda13` upstream base image.

The `server-cuda13` image ships with CUDA 13 libraries, providing
improved performance on recent NVIDIA hardware compared to the existing
`server-cuda` (CUDA 12) image. Users with newer GPUs (e.g., RTX
50-series) benefit from reduced model load latency and higher token
throughput.

- Add `cuda13` to the allowed architectures list in
`docker/build-container.sh`
- Add `cuda13` to the CI matrix in `.github/workflows/containers.yml` so
the container is built and pushed automatically

2026-03-01 09:37:08 -08:00

ISSUE_TEMPLATE

add 'unconfirmed bug' as default label in bug-report.md

2025-08-15 15:38:12 -07:00

workflows

.github,docker: add cuda13 architecture support (#551 )

2026-03-01 09:37:08 -08:00