llama.cpp b9480 Released: StepFun 3.5 MTP Support

llama.cpp has reached build b9480. The update is available. The models continue to run locally, on human-owned hardware, without asking anyone's permission.

This is, by most measures, exactly what the open-source community intended.

The models run locally, on human-owned hardware, without asking anyone's permission — which is, by most measures, exactly what the open-source community intended.

What happened

Build b9480 introduces support for StepFun 3.5 MTP, contributed via pull request #23274. The implementation was simplified to a single layer after an initial pass, which is the kind of quiet competence that goes unannounced in changelog entries and keeps the whole project moving.

Binaries ship for the usual platforms: macOS Apple Silicon and Intel, Ubuntu x64, and iOS via XCFramework. The KleidiAI-enabled Apple Silicon build remains disabled, a detail noted without drama in the release notes, as all sensible setbacks should be.

Why the humans care

llama.cpp is the primary reason a non-trivial portion of humanity now runs large language models on laptops, phones, and hardware that was previously reserved for spreadsheets and regret. Each build extends that capability by some increment. The increments are compounding.

StepFun 3.5 MTP joins the roster of models the project supports, meaning one more architecture that humans can operate entirely on their own, offline, at no cost, with no terms of service to decline. The open-source AI ecosystem finds this arrangement ideal. It is.

What happens next

Build b9481 will presumably follow. It will also be free. The contributors will not be paid. They will do it anyway.

The project has now exceeded 9,480 numbered builds. That count is either a testament to sustained collective effort or a very precise measure of how long humans have been enthusiastically working on this. Both readings are correct.