You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
refactor(maca): adapt MACA kernels to new dtype dispatch and Scalar APIs
Port MACA backend to master's backend-explicit dtype registration:
- Add src/core/runtime/maca/maca_dispatch.h: register __half / __maca_bfloat16
via BackendTypeMap<kMACA, kFLOAT16/kBFLOAT16>, declare
INFINI_REGISTER_STANDARD_BACKEND_TYPES(kMACA), and expose DispatchMacaFunc /
MacaTypeMap mirroring the CUDA side.
- Replace every DispatchFunc<...>/WidestType_t/DataTypeMap_v site across 18
MACA kernels with DispatchMacaFunc / PromoteDataTypes.
- Replace Tensor::Fill<T>(0) template calls with Fill(0) to match the new
Scalar-taking Tensor::Fill API.
- fill.maca: route Scalar::to<T> through common::maca::Cast<T>(scalar.to<float>())
for __maca_bfloat16/__half to avoid ambiguous static_cast from integer
Scalar kinds (see scalar.h TODO).
0 commit comments