forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 48
Pull requests: ROCm/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[kernels] tune NUM_SEGMENTS, attn_warps for TRITON_ATTN 3d kernel on gfx1151
#887
opened Apr 17, 2026 by
amd-callumm
Loading…
5 tasks done
[ROCm][Refactor] Enable GPTQMarlinConfig on ROCm to use choose_mp_linear_kernel
#885
opened Apr 17, 2026 by
mgehre-amd
Loading…
Tune hybrid_triton_w4a16 prefill kernel for gfx1151
#879
opened Apr 15, 2026 by
mgehre-amd
•
Draft
3 tasks done
Use fp16 fdot2 for bf16 int4 GEMV dequant on RDNA 3.5
#877
opened Apr 15, 2026 by
mgehre-amd
•
Draft
Add hybrid MoE kernel with wvSplitK int4 GEMM
#876
opened Apr 15, 2026 by
mgehre-amd
Loading…
5 tasks done
Fix Triton WNA16 MoE fallback for CompressedTensorsWNA16MoEMethod
#875
opened Apr 15, 2026 by
mgehre-amd
•
Draft
Add gfx12 (RDNA4) Triton tile-size heuristic for W4A16 prefill kernel
#870
opened Apr 13, 2026 by
amd-xavierwang
Loading…
3 of 5 tasks
Enable FLASH_ATTN backend with upstream flash-attn CK on ROCm
#866
opened Apr 10, 2026 by
mgehre-amd
•
Draft
1 task
[CI/Build] Fix Dockerfile.rocm_base image build for ROCm 7.2
bug
Something isn't working
#863
opened Feb 23, 2026 by
jbelloncastro
Loading…
3 of 5 tasks
Add silu-and-mul and per-token dynamic FP8 quant fusion
stale
#852
opened Jan 6, 2026 by
kliuae-amd
Loading…
5 tasks
[Triton] fix rope kv_cache for RocmAiterUnifiedAttentionImpl + v.contiguous() remove
stale
#851
opened Jan 2, 2026 by
k50112113
Loading…
[rocm]use aiter triton kernel as triton mha fallback path
#809
opened Nov 14, 2025 by
zhuyuhua-v
•
Draft
add aiter fusion pattern for sequence parallel
#781
opened Oct 31, 2025 by
zhuyuhua-v
•
Draft
5 tasks
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.