vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
A high-throughput and memory-efficient inference and serving engine for LLMs
| Name | Type | Version | Platform | Labels | Updated | Size | Actions |
|---|
linux-64/vllm-0.9.2-cuda128_py312had04d17_201.tar.bz2 | conda | 0.9.2 | linux-64 | main | 12 days ago | 288.46 MB | |
linux-64/vllm-0.9.2-cuda128_py311had04d17_201.tar.bz2 | conda | 0.9.2 | linux-64 | main | 12 days ago | 288.8 MB | |
linux-64/vllm-0.9.2-cpu_py311hc34d672_0.tar.bz2 | conda | 0.9.2 | linux-64 | main | Apr 27, 2026, 07:24 PM | 13.7 MB | |
linux-64/vllm-0.9.2-cpu_py312hc34d672_0.tar.bz2 | conda | 0.9.2 | linux-64 | main | Apr 27, 2026, 07:24 PM | 13.46 MB | |
linux-64/vllm-0.9.2-cpu_py310hc34d672_0.tar.bz2 | conda | 0.9.2 | linux-64 | main | Apr 27, 2026, 07:24 PM | 12.08 MB | |
linux-aarch64/vllm-0.9.2-cpu_py312h299015d_0.tar.bz2 | conda | 0.9.2 | linux-aarch64 | main | Apr 27, 2026, 07:20 PM | 7.3 MB | |
linux-aarch64/vllm-0.9.2-cpu_py311h299015d_0.tar.bz2 | conda | 0.9.2 | linux-aarch64 | main | Apr 27, 2026, 07:20 PM | 7.56 MB | |
linux-aarch64/vllm-0.9.2-cpu_py310h299015d_0.tar.bz2 | conda | 0.9.2 | linux-aarch64 | main | Apr 27, 2026, 07:20 PM | 5.91 MB |