llama.cpp

AI FrameworksOpen SourceVerified

C/C++ LLM inference engine. Run quantized models on CPU/GPU with minimal dependencies. GGUF format standard.

Price

From $0