llama.cpp ships b9691

▼ WHAT HAPPENED

llama.cpp cut release b9691 on 2026-06-17. Release notes excerpt: "ggml-cpu: Conditionally enable power11 backend based on compiler support (#24687) * ggml: Conditionally enable power11 backend based on compiler support Guard POWER11 backend creation behind a compiler flag check for -mcpu=power11. This avoids build failures on current GCC/Clang toolchains while preserving forward compatibility once POWER11 support becomes a..."

▼ OPERATOR ANGLE

Read the release notes and decide whether operators need to act. test throughput and memory fit before pinning the new version. Publish if this changes model compatibility, GPU backend behavior, memory use, quantization paths, security posture, migration requirements, or production serving reliability.

SOURCE: https://github.com/ggml-org/llama.cpp/releases/tag/b9691[GITHUB-RELEASE]

[pulse item] · runlocalai.co/pulse/gh-ggml-org-llama-cpp-b9691