LLM inference in C/C++
conda install main::libllama
Inference of Meta's LLaMA model (and others) in pure C/C++