Python bindings for the llama.cpp library
Python bindings for llama.cpp, providing a simple Python interface for inference with Large Language Models (LLMs) using the llama.cpp backend. Supports CPU and GPU acceleration with external llama.cpp library.