Skip to content

Commit a60dbe9

Browse files
committed
save work
1 parent c001e19 commit a60dbe9

1 file changed

Lines changed: 2 additions & 2 deletions

File tree

docs/softmax_lowering.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -337,7 +337,7 @@ Previsouly we tiled all ops in the parallel dimension only (i.e. non softmax dim
337337

338338
---
339339

340-
## Decomposed → Tiled: Stage A - Tile div op
340+
## Stage A - Tile div op
341341

342342
**Notes:**
343343
- Tile the division operation with step size 16 along dimension 1
@@ -357,7 +357,7 @@ scf.forall ... {
357357
358358
// After: Division tiled into 64x16 chunks
359359
scf.forall ... {
360-
// Tiled Max, Center+Exp, Sum ops ...
360+
// Max, Center+Exp, Sum ops ...
361361
%11 = scf.for %arg4 = %c0_2 to %c64 step %c16 iter_args(%arg5 = %extracted_slice_0) -> (tensor<64x64xf32>) {
362362
%12 = linalg.generic {...} ins(%extracted_slice_3, %extracted_slice_4 : tensor<64x16xf32>, tensor<64xf32>) outs(%extracted_slice_5 : tensor<64x16xf32>) {
363363
^bb0(%in: f32, %in_6: f32, %out: f32):

0 commit comments

Comments
 (0)