LLM inference in C/C++
conda install anaconda::libllama
Inference of Meta's LLaMA model (and others) in pure C/C++