← /pulse/gh-ggml-org-llama-cpp-b9674
WARNINGRUNTIME UPDATE·2026-06-17

llama.cpp ships b9674

▼ WHAT HAPPENED

llama.cpp cut release b9674 on 2026-06-17. Release notes excerpt: "SYCL: fix use-after-free bug with async memcpy in MoE prefill (#24676) * SYCL: fix a bug with async memcpy * make mmid_row_mapping_host persistent * comment on stream->wait * Apply suggestion from @sanmai * Apply suggestion from @sanmai * Apply suggestion from @sanmai **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/rel..."

▼ OPERATOR ANGLE

Read the release notes and decide whether operators need to act. test throughput and memory fit before pinning the new version. Publish if this changes model compatibility, GPU backend behavior, memory use, quantization paths, security posture, migration requirements, or production serving reliability.
[pulse item] · runlocalai.co/pulse/gh-ggml-org-llama-cpp-b9674