Skip to content

Add some AITER kernel routing for ROCm#46268

Draft
Abdennacer-Badaoui wants to merge 3 commits into
huggingface:mainfrom
Abdennacer-Badaoui:some-rocm-kernels
Draft

Add some AITER kernel routing for ROCm#46268
Abdennacer-Badaoui wants to merge 3 commits into
huggingface:mainfrom
Abdennacer-Badaoui:some-rocm-kernels

Conversation

@Abdennacer-Badaoui
Copy link
Copy Markdown
Member

@Abdennacer-Badaoui Abdennacer-Badaoui commented May 28, 2026

Routes ROCm to AITER Triton kernels on AMD GPUs:

Waiting on kernels-community PR #890 so the aiter-flash-attn Hub artifact ships its per-arch MHA autotune configs (without it, the kernel raises FileNotFoundError on first call).

Longer-term goal: ship the full AITER in Kernels (same model as liger-kernels) rather than one repo per kernel. We're starting with aiter-rope and aiter-flash-attn because those are the two we need in transformers right now.

@Abdennacer-Badaoui Abdennacer-Badaoui marked this pull request as draft May 28, 2026 17:15
@HuggingFaceDocBuilderDev
Copy link
Copy Markdown

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants