ggml-cpu : Add Power12 MMA+ INT8 matmul kernels by shalinib-ibm · Pull Request #24 · shalinib-ibm/llama.cpp

shalinib-ibm · 2026-05-06T15:15:39Z

Implement int8 matmul kernels using Power12 MMA+ builtins, leveraging 1024-bit dense math registers.
Refactor code to separate Power10 and Power12 implementations of kernels for readability
Add a standalone test for int8 matmul with configurable dimensions (m, n, k). This can be used for functional validation.

Example: ./llama-bench-matmult -t 10 -i 1 -m 48 -n 20 -k 1

Overview

Additional information

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure:

- Implement int8 matmul kernels using Power12 MMA+ builtins, leveraging 128-bit dense math registers. - Refactor code to separate Power10 and Power12 implementations with zero runtime overhead. - Add a standalone test for int8 matmul with configurable dimensions (m, n, k). This can be used for functional validation. Example: ./llama-bench-matmult -t 10 -i 1 -m 48 -n 20 -k 1 Signed-off-by: Bodapati Shalini Salomi <bodapatishalinisalomi@Bodapatis-MacBook-Pro.local>

shalinib-ibm force-pushed the int8_mmaplus_p12_kernels branch from fd5d5aa to 6ad0a75 Compare May 7, 2026 15:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ggml-cpu : Add Power12 MMA+ INT8 matmul kernels#24

ggml-cpu : Add Power12 MMA+ INT8 matmul kernels#24
shalinib-ibm wants to merge 1 commit into
masterfrom
int8_mmaplus_p12_kernels

shalinib-ibm commented May 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

shalinib-ibm commented May 6, 2026

Overview

Additional information

Requirements

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant