Skip to content

Commit 95a3864

Browse files
authored
Merge pull request #359 from nSircombe/feature/r25.08_updates
Updates to CHANGELOG and docs.
2 parents 101d456 + da8b7ff commit 95a3864

4 files changed

Lines changed: 8 additions & 7 deletions

File tree

ML-Frameworks/pytorch-aarch64/CHANGELOG.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,7 @@ where `YY` is the year, and `MM` the month of the increment.
1515

1616
### Fixed
1717

18-
## [r25.08] 2025-08-27
18+
## [r25.08] 2025-08-28
1919
https://github.com/ARM-software/Tool-Solutions/tree/r25.08
2020

2121
### Added
@@ -43,6 +43,7 @@ https://github.com/ARM-software/Tool-Solutions/tree/r25.08
4343

4444
### Removed
4545
- https://github.com/pytorch/pytorch/pull/151547, to update OpenBLAS commit as this has been merged upstream.
46+
- Temporarily removed --compile flag from transformers_llm_text_gen.py example as there is an unresolved issue with torch compile support in this case.
4647

4748
### Fixed
4849
- Updates various Python packages to address known vulnerabilies with a high CVSS score:

ML-Frameworks/pytorch-aarch64/examples/README.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -224,24 +224,24 @@ LD_PRELOAD=/usr/lib/aarch64-linux-gnu/libtcmalloc.so.4 OMP_NUM_THREADS=16 pyt
224224

225225
### Text Generation
226226

227-
The script [transformers_llm_text_gen.py](transformers_llm_text_gen.py) demonstrates how to generate text using Llama2 7B model via Transformers. It leverages the 4 bit dynamic quantization speedups and can supports vast number of text models.
227+
The script [transformers_llm_text_gen.py](transformers_llm_text_gen.py) demonstrates how to generate text using TinyLlama-1.1B-Chat-v1.0 model via Transformers. It leverages the 4 bit dynamic quantization and can support a wide range of text models.
228228

229229
Run inference using default (groupwise, layout-aware INT4) using tranformer call:
230230

231231
```
232-
LD_PRELOAD=/usr/lib/aarch64-linux-gnu/libtcmalloc.so.4 TORCHINDUCTOR_CPP_WRAPPER=1 TORCHINDUCTOR_FREEZING=1 OMP_NUM_THREADS=16 python transformers_llm_text_gen.py --compile
232+
LD_PRELOAD=/usr/lib/aarch64-linux-gnu/libtcmalloc.so.4 TORCHINDUCTOR_CPP_WRAPPER=1 TORCHINDUCTOR_FREEZING=1 OMP_NUM_THREADS=16 python transformers_llm_text_gen.py
233233
```
234234

235235
Run with symmetric_channelwise quantization:
236236

237237
```
238-
LD_PRELOAD=/usr/lib/aarch64-linux-gnu/libtcmalloc.so.4 TORCHINDUCTOR_CPP_WRAPPER=1 TORCHINDUCTOR_FREEZING=1 OMP_NUM_THREADS=16 python transformers_llm_text_gen.py --quant-scheme symmetric_channelwise --compile
238+
LD_PRELOAD=/usr/lib/aarch64-linux-gnu/libtcmalloc.so.4 TORCHINDUCTOR_CPP_WRAPPER=1 TORCHINDUCTOR_FREEZING=1 OMP_NUM_THREADS=16 python transformers_llm_text_gen.py --quant-scheme symmetric_channelwise
239239
```
240240

241241
Run with custom group size (e.g. 64):
242242

243243
```
244-
LD_PRELOAD=/usr/lib/aarch64-linux-gnu/libtcmalloc.so.4 TORCHINDUCTOR_CPP_WRAPPER=1 TORCHINDUCTOR_FREEZING=1 OMP_NUM_THREADS=16 python transformers_llm_text_gen.py --quant-scheme symmetric_groupwise --groupsize 64 --compile
244+
LD_PRELOAD=/usr/lib/aarch64-linux-gnu/libtcmalloc.so.4 TORCHINDUCTOR_CPP_WRAPPER=1 TORCHINDUCTOR_FREEZING=1 OMP_NUM_THREADS=16 python transformers_llm_text_gen.py --quant-scheme symmetric_groupwise --groupsize 64
245245
```
246246

247247

ML-Frameworks/pytorch-aarch64/examples/llama_vision_instruct.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -126,7 +126,7 @@ def main(args):
126126
parser.add_argument(
127127
"--prompt",
128128
type=str,
129-
default="Describe this image",
129+
default="what is the animal in this image",
130130
help="Input prompt.",
131131
)
132132
parser.add_argument(

ML-Frameworks/tensorflow-aarch64/CHANGELOG.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,7 @@ where `YY` is the year, and `MM` the month of the increment.
1515

1616
### Fixed
1717

18-
## [r25.08] 2025-08-26
18+
## [r25.08] 2025-08-28
1919
https://github.com/ARM-software/Tool-Solutions/tree/r25.08
2020

2121
### Changed

0 commit comments

Comments
 (0)