You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
refactor(hpc): bf16_tile_gemm fallback delegates to the polyfill (dedup)
PR #222 added ndarray::simd::bf16_tile_gemm_16x16 by copying the F32x16
kernel out of hpc::bf16_tile_gemm::fallback_path, leaving the same kernel in
two places. Collapse it: the polyfill fn is the single source of truth; the
hpc AMX wrapper's fallback now calls crate::simd::bf16_tile_gemm_16x16, with
the AMX TDPBF16PS tile path still layered on top. Drops the now-unused
F32x16 / bf16_to_f32_batch import.
Both suites pass (hpc fallback + simd_ops parity); clippy -D warnings + fmt clean.
Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
Claude-Session: https://claude.ai/code/session_01GJ4NVBSjq1w5h7RmTbVafb
0 commit comments