Commit e4199ea
committed
Add CUDA wishlist document for persistent GPU actor features
Comprehensive analysis of CUDA features that would enable true
persistent GPU actors, based on RingKernel implementation experience.
Key proposals include:
- Native host-kernel signaling (replace polling)
- Kernel-to-kernel mailboxes (first-class messaging)
- Dynamic block scheduling (work stealing)
- Persistent kernel preemption (cooperative)
- Checkpointing and migration (fault tolerance)
- Extended cooperative groups (hierarchical sync)
- Memory model enhancements (SC atomics)
- Multi-GPU persistent kernels
https://claude.ai/code/session_01TD1CHULcRkSAJ1KUqyhpF91 parent d5e7844 commit e4199ea
1 file changed
Lines changed: 727 additions & 0 deletions
0 commit comments