llama.cpp ships b9656

▼ WHAT HAPPENED

llama.cpp cut release b9656 on 2026-06-15. Release notes excerpt: "chat: harden peg-native tool call parsing (#24329) * chat: harden peg-native tool call parsing accept an optional leading type: function field in build_json_tools_flat_keys so openai style tool calls parse on templates whose serialization opens on the name field. return a clean error and log the unparsed fragment on a final peg parse failure instead of throw..."

▼ OPERATOR ANGLE

Read the release notes and decide whether operators need to act. test throughput and memory fit before pinning the new version. Publish if this changes model compatibility, GPU backend behavior, memory use, quantization paths, security posture, migration requirements, or production serving reliability.

SOURCE: https://github.com/ggml-org/llama.cpp/releases/tag/b9656[GITHUB-RELEASE]

[pulse item] · runlocalai.co/pulse/gh-ggml-org-llama-cpp-b9656