vllm

Community

A high-throughput and memory-efficient inference and serving engine for LLMs

Versions

To install this package, run one of the following:

$conda install services::vllm

Version

1 / 8 versions selected

Downloads (Last 6 months): 0

Summary

A high-throughput and memory-efficient inference and serving engine for LLMs

Last Updated

Jul 30, 2025 at 16:25

License

Apache-2.0

Total Downloads

104

Version Downloads

104

Supported Platforms

linux-64