❌ This version of handling ViT activation (through trucation) is problematic! The latest version will use the pytorch built-in method for saving and tracking activation and gradient.#2
Open
TyBruceChen wants to merge 10 commits intoold-versionfrom
Commits
Commits on Mar 22, 2026
- committed
- authored
- authored
- authored
- committed
- committed
- committed
- committed
- committed
- authored