vllm

Community

A high-throughput and memory-efficient inference and serving engine for LLMs

Versions

To install this package, run one of the following:

$conda install conda-forge::vllm

Version

5 / 8 versions selected

Downloads (Last 6 months): 0

Easy, fast, and cheap LLM serving for everyone

Summary

A high-throughput and memory-efficient inference and serving engine for LLMs

Last Updated

Jan 11, 2026 at 10:24

License

Apache-2.0 AND BSD-3-Clause

Total Downloads

25.3K

Version Downloads

6.6K

Supported Platforms

linux-64

linux-aarch64

macOS-arm64

Home

Documentation