Skip to content

ggml-cpu : Add Power12 MMA+ INT8 matmul kernels#24

Open
shalinib-ibm wants to merge 1 commit into
masterfrom
int8_mmaplus_p12_kernels
Open

ggml-cpu : Add Power12 MMA+ INT8 matmul kernels#24
shalinib-ibm wants to merge 1 commit into
masterfrom
int8_mmaplus_p12_kernels

Conversation

@shalinib-ibm
Copy link
Copy Markdown
Owner

  • Implement int8 matmul kernels using Power12 MMA+ builtins, leveraging 1024-bit dense math registers.

  • Refactor code to separate Power10 and Power12 implementations of kernels for readability

  • Add a standalone test for int8 matmul with configurable dimensions (m, n, k). This can be used for functional validation.

    Example: ./llama-bench-matmult -t 10 -i 1 -m 48 -n 20 -k 1

Overview

Additional information

Requirements

- Implement int8 matmul kernels using Power12 MMA+ builtins,
  leveraging 128-bit dense math registers.

- Refactor code to separate Power10 and Power12 implementations
  with zero runtime overhead.

- Add a standalone test for int8 matmul with configurable
  dimensions (m, n, k). This can be used for functional validation.

  Example:
    ./llama-bench-matmult -t 10 -i 1 -m 48 -n 20 -k 1

Signed-off-by: Bodapati Shalini Salomi <bodapatishalinisalomi@Bodapatis-MacBook-Pro.local>
@shalinib-ibm shalinib-ibm force-pushed the int8_mmaplus_p12_kernels branch from fd5d5aa to 6ad0a75 Compare May 7, 2026 15:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant