llama-cpp-python
Python bindings for the llama.cpp library
Python bindings for the llama.cpp library
To install this package, run one of the following:
Python bindings for llama.cpp, providing a simple Python interface for inference with Large Language Models (LLMs) using the llama.cpp backend. Supports CPU and GPU acceleration with external llama.cpp library.
Summary
Python bindings for the llama.cpp library
Last Updated
Apr 2, 2026 at 20:55
License
MIT
Supported Platforms
GitHub Repository
https://github.com/abetlen/llama-cpp-pythonDocumentation
https://llama-cpp-python.readthedocs.io