A lightweight CLI tool and OpenAI-compatible server for querying multiple Large Language Model (LLM) providers