Running LLMs Locally on Arch Linux — No Cloud, No Ollama

Why I Built This Instead of Using Ollama I wanted to run AI models locally on my Arch Linux laptop — privately, offline, with zero cloud dependency. Ollama seemed like the obvious choice, but I didn’t want an opaque, heavy framework. I wanted control. After navigating outdated documentation and breaking changes, I got llama.cpp running with Vulkan GPU acceleration on my Intel Iris Xe. This is the guide I wish I had. ...

March 15, 2026 · 4 min · 668 words