Skip to main content
starstack
Browse
Categories
Stack Builder
Build This
Why
Roadmap
Search
⌘
K
Sign in
Toggle theme
Services
llama.cpp
All Services
llama.cpp
AI & Machine Learning
AI Infrastructure
Visit Website
GitHub
97.8K
Copy Link
Share
Report Issue
About
Efficient LLM inference in C/C++ with support for CPU, Metal, and CUDA acceleration.