Skip to content

Commit 15949e0

Browse files
authored
Merge pull request #372 from jondea/fix-perf-drop-after-torchao-api-change
pytorch: fix perf issue from torchao API change
2 parents bf70ba2 + 418c998 commit 15949e0

3 files changed

Lines changed: 3 additions & 0 deletions

File tree

ML-Frameworks/pytorch-aarch64/CHANGELOG.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -20,6 +20,7 @@ where `YY` is the year, and `MM` the month of the increment.
2020
- KLEIDIAI_HASH to bd2e6ae060014035e25bf4986be682762c446c2d, v1.14 from main.
2121
- Update torchvision from 0.23.0 to a nightly build, 0.25.0.dev20250923
2222
- Change of flag name in `./build.sh` from `--force` to `--fresh`
23+
- Add `intx_packing_format="opaque_aten_kleidiai"` to `Int8DynamicActivationIntxWeightConfig` due to torchao API change
2324

2425
### Removed
2526
- Removes WIP ComputeLibrary patch https://review.mlplatform.org/c/ml/ComputeLibrary/+/12818/1.

ML-Frameworks/pytorch-aarch64/examples/llama_vision_instruct.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -52,6 +52,7 @@ def main(args):
5252
weight_mapping_type=MappingType.SYMMETRIC_NO_CLIPPING_ERR, # MappingType.SYMMETRIC can also be used but increases error
5353
layout=layout,
5454
weight_dtype=torch.int4,
55+
intx_packing_format="opaque_aten_kleidiai",
5556
),
5657
)
5758

ML-Frameworks/pytorch-aarch64/examples/transformers_llm_text_gen.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -113,6 +113,7 @@ def get_quantized_model(args):
113113
weight_mapping_type=mapping_type,
114114
layout=layout,
115115
weight_dtype=torch.int4,
116+
intx_packing_format="opaque_aten_kleidiai",
116117
)
117118

118119
print("Quantization config:")

0 commit comments

Comments
 (0)