-
Notifications
You must be signed in to change notification settings - Fork 593
Pull requests: flashinfer-ai/flashinfer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Move the run function definition out of BatchedGemmInterface
#2211
opened Dec 12, 2025 by
jhalabi-nv
•
Draft
5 tasks
Fix: Add mask_indptr conversion in BatchPrefillWithPagedKVCacheWrapper.plan()
#2201
opened Dec 11, 2025 by
Dutch-voyage
Loading…
5 tasks
Add CUDA graph buffers for persistent attention
#2185
opened Dec 7, 2025 by
Edenzzzz
Loading…
5 tasks
[Flashinfer-Bench integration] HF end-to-end inference
#2151
opened Nov 30, 2025 by
sfc-gh-goliaro
•
Draft
5 tasks
Enable Hopper FA3 FP8 attention in decode.py
#2148
opened Nov 28, 2025 by
nvpohanh
Loading…
5 tasks done
make DeepGEMM swapAB available for linear gemm SM90
#2131
opened Nov 22, 2025 by
katec846
Loading…
3 of 5 tasks
A unified API for the MNNVL and single-node AllReduce kernels.
#2130
opened Nov 21, 2025 by
nvmbreughe
Loading…
3 of 5 tasks
feat: support variable sequence length in decode kernel of trtllm-gen attention
#2125
opened Nov 20, 2025 by
yaoyaoding
Loading…
4 of 5 tasks
feat: support more head dim in RoPE kernel
#2109
opened Nov 19, 2025 by
raayandhar
Loading…
5 tasks done
Port TRT-LLM communication kernels to flashinfer
#2102
opened Nov 18, 2025 by
djns99
Loading…
5 tasks done
make DeepGEMM swapAB available for linear gemm SM90
#2101
opened Nov 17, 2025 by
xuanzic
Loading…
5 tasks
feat: BF16 GEMM using CUTLASS backend for SM100
#2070
opened Nov 10, 2025 by
raayandhar
Loading…
5 tasks done
Rebase FP8 SM100 Cutlass FMHA Attention to main (original PR#1238)
#2047
opened Nov 5, 2025 by
pavanimajety
Loading…
5 tasks
Added an initial implementation of Q and KV Cache in fp8 and to use t…
#2035
opened Nov 4, 2025 by
Anerudhan
Loading…
5 tasks done
Refactor flashinfer/__init__.py so that applications could selectively pack submodules without modifying __init__.py
#2027
opened Nov 3, 2025 by
bangshengtang
Loading…
5 tasks done
refactor: backend_requirement + supported_compute_capability decorator for gemm
#2000
opened Oct 29, 2025 by
jimmyzho
Loading…
5 tasks
chore: agentic workflow for automatic version bump
#1947
opened Oct 19, 2025 by
yzh119
Loading…
5 tasks
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.