finegrained-fp8: [bugfix] add epsilon to act_quant scale to avoid NaNs. by wz-ml · Pull Request #680 · huggingface/kernels-community

wz-ml · 2026-04-30T21:41:58Z

A quick single-line change to fix this issue I was running into with this FP8 kernel:
When an X block is all zeros, the scale factor's also zero, which causes the kernel to return 0/0 = NaN.

…s are 0 have a scale factor of 0 (yielding 0/0 = NaN)

github-actions · 2026-04-30T21:42:19Z

Hi @wz-ml, thanks for your interest in contributing!

This project requires that pull request authors are vouched, and you are not in the list of vouched users.

This PR will be closed automatically. See https://github.com/huggingface/kernels-community/blob/main/CONTRIBUTING.md for more details.

github-actions · 2026-05-08T06:26:07Z

Hi @wz-ml, thanks for your interest in contributing!

This project requires that pull request authors are vouched, and you are not in the list of vouched users.

This PR will be closed automatically. See https://github.com/huggingface/kernels-community/blob/main/CONTRIBUTING.md for more details.

danieldk · 2026-05-08T06:26:53Z

@IlyasMoutawwakil could you do a review?

IlyasMoutawwakil · 2026-05-08T08:03:12Z

    offs = pid * BLOCK_SIZE + tl.arange(0, BLOCK_SIZE)
    x = tl.load(x_ptr + offs).to(tl.float32)
    s = tl.max(tl.abs(x)) / 448.0  # float8_e4m3fn max
+    s = tl.maximum(s, 1e-12)


yes this actually needed, thanks for the fix.

IlyasMoutawwakil · 2026-05-08T08:04:07Z

it seems i can't approve the pr 🥲

sayakpaul · 2026-05-29T12:05:33Z

@IlyasMoutawwakil this looks good to you?

IlyasMoutawwakil · 2026-05-29T12:06:56Z

@sayakpaul yes i couldn't approve it before

Add epsilon fix; otherwise, fp8 kernels with blocks where all element…

ca49909

…s are 0 have a scale factor of 0 (yielding 0/0 = NaN)

wz-ml requested review from danieldk and drbh as code owners April 30, 2026 21:41

github-actions Bot closed this Apr 30, 2026

wz-ml mentioned this pull request Apr 30, 2026

Bug: finegrained-fp8 act_quant produces NaNs on zero blocks #681

Open

danieldk reopened this May 8, 2026

github-actions Bot closed this May 8, 2026

danieldk self-assigned this May 8, 2026

IlyasMoutawwakil reviewed May 8, 2026

View reviewed changes

sayakpaul reopened this May 29, 2026

Merge branch 'main' into main

48451a8

sayakpaul linked an issue May 29, 2026 that may be closed by this pull request

Bug: finegrained-fp8 act_quant produces NaNs on zero blocks #681

Open

IlyasMoutawwakil approved these changes May 29, 2026

View reviewed changes

sayakpaul mentioned this pull request May 29, 2026

Security audit workflow: support external PRs without forking #901

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

finegrained-fp8: [bugfix] add epsilon to act_quant scale to avoid NaNs.#680

finegrained-fp8: [bugfix] add epsilon to act_quant scale to avoid NaNs.#680
wz-ml wants to merge 2 commits into
huggingface:mainfrom
wz-ml:main

wz-ml commented Apr 30, 2026

Uh oh!

github-actions Bot commented Apr 30, 2026

Uh oh!

github-actions Bot commented May 8, 2026

Uh oh!

danieldk commented May 8, 2026

Uh oh!

IlyasMoutawwakil May 8, 2026

Uh oh!

IlyasMoutawwakil commented May 8, 2026

Uh oh!

sayakpaul commented May 29, 2026

Uh oh!

IlyasMoutawwakil commented May 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

wz-ml commented Apr 30, 2026

Uh oh!

github-actions Bot commented Apr 30, 2026

Uh oh!

github-actions Bot commented May 8, 2026

Uh oh!

danieldk commented May 8, 2026

Uh oh!

IlyasMoutawwakil May 8, 2026

Choose a reason for hiding this comment

Uh oh!

IlyasMoutawwakil commented May 8, 2026

Uh oh!

sayakpaul commented May 29, 2026

Uh oh!

IlyasMoutawwakil commented May 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants