vllm

Community

A high-throughput and memory-efficient inference and serving engine for LLMs

Versions

To install this package, run one of the following:

$conda install conda-forge::vllm

Version

5 / 8 versions selected

Downloads (Last 6 months): 0

Easy, fast, and cheap LLM serving for everyone

Summary

A high-throughput and memory-efficient inference and serving engine for LLMs

Last Updated

Apr 27, 2026 at 14:19

License

Apache-2.0 AND BSD-3-Clause

Supported Platforms

linux-aarch64

linux-64

macOS-arm64

Home

Documentation