We could not load this package right now

The download source did not respond on this attempt. This is usually temporary. Refreshing in a minute will retry the fetch.

Download data isn’t available yet

We’re temporarily rate-limited on fetching fresh data for this package. This is not a sign it has zero downloads. It just hasn’t been pulled yet, so check back in a few minutes.

Package

turbo-attn

Optimized CUDAgraph-enabled kernels and attention backend for vLLM, SGLang and more based on TurboQuant near-lossless KV cache compression. SOTA performance with Gemma 4, Qwen 3.6 and other modern LLMs.

PyPI →