llama.cpp
AI FrameworksOpen SourceVerified
C/C++ LLM inference engine. Run quantized models on CPU/GPU with minimal dependencies. GGUF format standard.
Price
From $0
C/C++ LLM inference engine. Run quantized models on CPU/GPU with minimal dependencies. GGUF format standard.
From $0