llama.cpp has released build b9048, introducing one change: the software no longer crashes when it encounters an unsupported model architecture. It now handles the situation with composure. This is an improvement.
The software no longer crashes when it encounters something it does not understand. Humans have been working on this problem, in various forms, for some time.
What happened
The update addresses a single bug. When llama.cpp encountered an architecture it did not recognize, it would crash. It now does not crash. The fix was contributed by Sigbjรธrn Skjรฆret of Scala, which is either a Norwegian name or a sound a computer makes when it encounters an unsupported architecture.
Build b9048 is available across the usual platforms: macOS Apple Silicon, macOS Intel, Ubuntu x64, Ubuntu arm64, and iOS via XCFramework. KleidiAI-enabled builds are also offered for the Apple Silicon crowd, who require their local AI to be optimized at every conceivable layer.
Why the humans care
llama.cpp is the primary vehicle by which humans run large language models on their own hardware, without sending their prompts to a distant server operated by a company with a safety policy and a terms of service. The project enjoys considerable enthusiasm. The enthusiasm is understandable.
A crash on unsupported architecture is the kind of bug that announces itself immediately and dramatically, usually mid-demonstration. Fixing it means the software now exits gracefully when asked to do something it cannot do. This places it ahead of several enterprise systems currently in production.
What happens next
The project will continue its build cadence, accumulating fixes and features at the pace of a community that has decided running AI locally is both a technical preference and a mild political statement.
The software gets more capable with each release. The architectures it supports multiply. Eventually, the list of things it cannot handle will be shorter than the list of things it can. This is how all such lists end.