Skip to content

Upgrade llama.cpp from b9495 to b9543#211

Merged
bernardladenthin merged 1 commit into
mainfrom
claude/blissful-meitner-f8Hr6
Jun 6, 2026
Merged

Upgrade llama.cpp from b9495 to b9543#211
bernardladenthin merged 1 commit into
mainfrom
claude/blissful-meitner-f8Hr6

Conversation

@bernardladenthin
Copy link
Copy Markdown
Owner

Summary

  • Bumps llama.cpp dependency from b9495 to b9543 (48 commits)
  • Updates CMakeLists.txt, README.md, and CLAUDE.md with new version pin
  • Comprehensive audit confirms zero breaking changes to project source code

Details

The b9495→b9543 range includes significant upstream refactoring across multiple subsystems:

hparams refactor: n_layer field split into n_layer_all (raw count) and n_layer() member function (effective count). Sibling rename nextn_predict_layersn_layer_nextn. New deepstack_mapping_arr for per-layer deepstack injection. Project does not reference any of these symbols directly (verified via grep).

State-seq API simplification: Removed LLAMA_STATE_SEQ_FLAGS_ON_DEVICE flag; on-device is now default for partial saves/loads. Project does not call llama_state_seq_* functions directly.

MTMD subsystem refactor: clip_image_u8/clip_image_f32 switched from public POD fields to private members with getter/setter API. Several public helpers removed (clip_embd_nbytes, clip_build_img_from_pixels, etc.). New Granite4 Vision projector type with QFormer-window deepstack. Project does not link MTMD TUs into JNI build; vision API surfaces through mtmd-helper.h with zero breaking references.

New imatrix loader library: Extracted shared imatrix-loading logic into common/imatrix-loader.{h,cpp}. Automatically picked up by JNI build via existing target_link_libraries(jllama PRIVATE common). Project does not use imatrix loading from Java.

Server endpoints & chat parser: New token-counting endpoints. process_mtmd_prompt() gained optional bool is_placeholder parameter (defaulted, so existing call sites compile unchanged). LFM2/2.5 chat parser unified with new tool_list_tokens flag.

Backend numerical work: Per-backend optimizations (CUDA, RISC-V, WebAssembly, Metal, OpenCL, SYCL, Vulkan, WebGPU, KleidiAI). All internal; no API surface change.

Python conversion tools: New Granite4VisionMmprojModel, Gemma4 fix-ups, new GGUF metadata keys for deepstack/vision projectors.

Test plan

  • Audit confirms zero grep matches in src/main/cpp/ for any renamed/removed symbols
  • Only project-visible signature change (process_mtmd_prompt parameter) is defaulted
  • All breaking changes absorbed inside upstream-compiled translation units
  • CI will verify clean build with new version pin

Related issues / PRs

Follows the llama.cpp upgrade process documented in CLAUDE.md.

Checklist

  • I have read CONTRIBUTING.md and CODE_OF_CONDUCT.md
  • My commits follow Conventional Commits
  • No security-sensitive changes

https://claude.ai/code/session_0162Zk8UWHYrRYjbaU4QPtt7

Audit: zero project sources reference any renamed or removed symbols
(hparams.n_layer, nextn_predict_layers, LLAMA_STATE_SEQ_FLAGS_ON_DEVICE,
clip_image_u8/f32 field access, removed clip helpers, new
mtmd_helper_bitmap placeholder parameter). All breaking changes are
absorbed inside upstream-compiled translation units.

https://claude.ai/code/session_0162Zk8UWHYrRYjbaU4QPtt7
@bernardladenthin bernardladenthin merged commit 09c9ee9 into main Jun 6, 2026
4 of 9 checks passed
@bernardladenthin bernardladenthin deleted the claude/blissful-meitner-f8Hr6 branch June 6, 2026 21:23
@sonarqubecloud
Copy link
Copy Markdown

sonarqubecloud Bot commented Jun 6, 2026

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants