llama-swap

steve/llama-swap

Fork 0

Commit Graph

Author	SHA1	Message	Date
Ron M	a37b4866d8	proxy: add configurable HTTP timeouts for models and peers (#619 ) Add configurable HTTP timeout settings to both models and peers to support installations that requires longer timeouts than the current hardcoded defaults. Closes #618	2026-04-06 19:30:27 +08:00
Benson Wong	22e098ac8b	Add Peer Model Support (#438 ) This PR allows a single llama-swap to be the central proxy for models served by other inference servers. The peer servers can be another llama-swap or any API that supports the /v1/* inference endpoint. Updates: #433, #299 Closes: #296	2025-12-27 20:18:06 -08:00

Author

SHA1

Message

Date

Ron M

a37b4866d8

proxy: add configurable HTTP timeouts for models and peers (#619 )

Add configurable HTTP timeout settings to both models and peers to support installations that requires longer timeouts than the current hardcoded defaults.

Closes #618

2026-04-06 19:30:27 +08:00

Benson Wong

22e098ac8b

Add Peer Model Support (#438 )

This PR allows a single llama-swap to be the central proxy for models served by other inference servers. The peer servers can be another llama-swap or any API that supports the /v1/* inference endpoint.

Updates: #433, #299
Closes: #296

2025-12-27 20:18:06 -08:00

2 Commits