Skip to content

Pull requests: flashinfer-ai/flashinfer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

cicd: Add sanity test script
#2212 opened Dec 12, 2025 by kahyunnam Loading…
5 tasks done
refactor: update fa3 codebase [part 2]
#2192 opened Dec 9, 2025 by yzh119 Loading…
4 of 5 tasks
Add CUDA graph buffers for persistent attention
#2185 opened Dec 7, 2025 by Edenzzzz Loading…
5 tasks
Fix/moe_sm110 (to be tested)
#2183 opened Dec 6, 2025 by aleozlx Draft
5 tasks
Enable Hopper FA3 FP8 attention in decode.py
#2148 opened Nov 28, 2025 by nvpohanh Loading…
5 tasks done
make DeepGEMM swapAB available for linear gemm SM90
#2131 opened Nov 22, 2025 by katec846 Loading…
3 of 5 tasks
A unified API for the MNNVL and single-node AllReduce kernels.
#2130 opened Nov 21, 2025 by nvmbreughe Loading…
3 of 5 tasks
Refactor trtllm_mnnvl_allreduce
#2118 opened Nov 20, 2025 by timlee0212 Loading…
5 tasks done
feat: support more head dim in RoPE kernel
#2109 opened Nov 19, 2025 by raayandhar Loading…
5 tasks done
Port TRT-LLM communication kernels to flashinfer
#2102 opened Nov 18, 2025 by djns99 Loading…
5 tasks done
make DeepGEMM swapAB available for linear gemm SM90
#2101 opened Nov 17, 2025 by xuanzic Loading…
5 tasks
feat: add sink to flashinfer decode
#2087 opened Nov 13, 2025 by djmmoss Loading…
feat: BF16 GEMM using CUTLASS backend for SM100
#2070 opened Nov 10, 2025 by raayandhar Loading…
5 tasks done
Blockwise GEMM with all reduce overlapping
#2007 opened Oct 30, 2025 by Amir-19 Draft
5 tasks
chore: agentic workflow for automatic version bump
#1947 opened Oct 19, 2025 by yzh119 Loading…
5 tasks
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.