WARNINGRUNTIME UPDATE·2026-06-16
llama.cpp ships b9663
▼ WHAT HAPPENED
llama.cpp cut release b9663 on 2026-06-16. Release notes excerpt: "[SYCL] Support OP EXPM1, support all UT cases of FLOOR, TRUNC, ROUND (#24363) * support OP EXPM1, support all UT cases of FLOOR, TRUNC, ROUND * fix conflict * rebase, support new UT case of repeat, concat **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9663/llama-b9663-bin-macos-arm64.tar.gz) - macOS Ap..."
▼ OPERATOR ANGLE
Read the release notes and decide whether operators need to act. test throughput and memory fit before pinning the new version. Publish if this changes model compatibility, GPU backend behavior, memory use, quantization paths, security posture, migration requirements, or production serving reliability.
[pulse item] · runlocalai.co/pulse/gh-ggml-org-llama-cpp-b9663