vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
A high-throughput and memory-efficient inference and serving engine for LLMs
| Name | Type | Version | Platform | Labels | Updated | Size | Downloads | Actions |
|---|
linux-64/vllm-0.8.1-py312h2bc3f7f_0.tar.bz2 | conda | 0.8.1 | linux-64 | main | Jul 30, 2025, 04:26 PM | 125.31 MB | 5 | |
linux-64/vllm-0.8.1-py312h2bc3f7f_0.conda | conda | 0.8.1 | linux-64 | main | Jul 30, 2025, 04:26 PM | 87.63 MB | 26 | |
linux-64/vllm-0.8.1-py311h2bc3f7f_0.tar.bz2 | conda | 0.8.1 | linux-64 | main | Jul 30, 2025, 04:26 PM | 125.32 MB | 5 | |
linux-64/vllm-0.8.1-py311h2bc3f7f_0.conda | conda | 0.8.1 | linux-64 | main | Jul 30, 2025, 04:25 PM | 87.72 MB | 20 | |
linux-64/vllm-0.8.1-py310h2bc3f7f_0.tar.bz2 | conda | 0.8.1 | linux-64 | main | Jul 30, 2025, 04:25 PM | 124.39 MB | 5 | |
linux-64/vllm-0.8.1-py310h2bc3f7f_0.conda | conda | 0.8.1 | linux-64 | main | Jul 30, 2025, 04:25 PM | 86.77 MB | 25 |