Skip to content

Commit 8e2511d

Browse files
committed
Adds PyTorch patch to implement bf16 GELU via LUT.
Gives ~8x speedups.
1 parent 222ca5c commit 8e2511d

2 files changed

Lines changed: 5 additions & 0 deletions

File tree

ML-Frameworks/pytorch-aarch64/CHANGELOG.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -8,6 +8,8 @@ where `YY` is the year, and `MM` the month of the increment.
88
## [unreleased]
99

1010
### Added
11+
- Adds https://github.com/pytorch/pytorch/pull/159859, a WIP LUT implmentation of bf15 GELU
12+
~8x speedup over existing oneDNN implementation
1113

1214
### Changed
1315
- Updates hashes for:

ML-Frameworks/pytorch-aarch64/get-source.sh

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -39,6 +39,9 @@ git-shallow-clone https://github.com/pytorch/pytorch.git $PYTORCH_HASH
3939
# https://github.com/pytorch/pytorch/pull/160184 - Draft: separate reqs for manywheel build and pin
4040
apply-github-patch pytorch/pytorch 9a8b0df99eac62e7ec6199dd0223a80d26e2dee0
4141

42+
# https://github.com/pytorch/pytorch/pull/159859 - PoC LUT optimisation for GELU bf16 operators
43+
apply-github-patch pytorch/pytorch 51626269d3730df1a6b465fa0191074fc31f7c29
44+
4245
git submodule sync
4346
git submodule update --init --checkout --force --recursive --jobs=$(nproc)
4447
(

0 commit comments

Comments
 (0)