Skip to content

Pull requests: EmbeddedLLM/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

support fp8 quant vllm ir on xpu
#82 opened Apr 17, 2026 by xinyu-intel Loading…
4 tasks
[ROCm] [Release] Update ROCm variant from rocm700 to rocm721
#80 opened Mar 28, 2026 by tjtanaa Member Loading…
5 tasks
[ROCm] Enable aiter group quant FP8 for RDNA4 gpus
#78 opened Mar 12, 2026 by big-yellow-duck Loading…
3 of 5 tasks
[DO NOT MERGE] Refactor/aiter integration
#76 opened Nov 17, 2025 by vllmellm Member Draft
5 tasks
Deepseek00 cuda graph
#72 opened Jul 8, 2025 by tjtanaa Member Draft
4 tasks
Skinny gemm
#71 opened Jul 8, 2025 by tjtanaa Member Draft
4 tasks
MLA
#70 opened Jul 3, 2025 by tjtanaa Member Draft
4 tasks
ProTip! Updated in the last three days: updated:>2026-04-17.