Skip to content

Missing MXFP4 quantized models for Qwen3-235B-A22B, MiniMax-M2.5, and GLM-5 #24

@sunway513

Description

@sunway513

Summary

Several high-priority frontier MoE models are missing from the AMD Quark MXFP4 quantized models collection on HuggingFace (amd/ org). NVIDIA has already published NVFP4 versions for all of these, creating a benchmark gap.

Missing Models

Model HuggingFace ID Params Active NVIDIA NVFP4 Available?
Qwen3-235B-A22B Qwen/Qwen3-235B-A22B 235B 22B nvidia/Qwen3-235B-A22B-NVFP4
MiniMax-M2.5 MiniMaxAI/MiniMax-M2.5 229B 10B nvidia/MiniMax-M2.5-NVFP4
GLM-5 zai-org/GLM-5 745B 44B nvidia/GLM-5-NVFP4

Existing AMD MXFP4 Models (for reference)

The following models already have MXFP4 versions:

  • amd/DeepSeek-R1-0528-MXFP4
  • amd/Kimi-K2.5-MXFP4
  • amd/GLM-5-MXFP4 ✅ (exists but has weight shape issues in ATOM — see below)
  • amd/Qwen3.5-397B-A17B-MXFP4
  • amd/Qwen3-Coder-Next-MXFP4

Why This Matters

  1. Benchmark parity: Without MXFP4 versions, MI355X benchmarks must use FP8 (2x larger, 2x less compute density), making MI355X vs B300 comparisons unfair
  2. InferenceMAX/InferenceX: SemiAnalysis benchmarks use NVFP4 on NVIDIA side — AMD needs equivalent MXFP4 for apples-to-apples comparison
  3. Customer readiness: These are top-5 deployed open MoE models as of March 2026

Additional Issue: GLM-5 MXFP4 Weight Shape Bug

amd/GLM-5-MXFP4 exists on HuggingFace but fails to load in ATOM with:

RuntimeError: The size of tensor a (3072) must match the size of tensor b (12288) at non-singleton dimension 1

This occurs in the weight loader during MXFP4 unpacking with TP=8. The GLM-5 hidden_size=6144 divided by TP=8 gives 768, but the MXFP4 packed tensor dimension is 12288 (expecting undivided size). Please verify the quantization output format.

Request

  1. Publish amd/Qwen3-235B-A22B-MXFP4 (or Instruct variant)
  2. Publish amd/MiniMax-M2.5-MXFP4
  3. Fix and re-publish amd/GLM-5-MXFP4 (weight shape issue)

cc @quark-team

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions