forked from Dao-AILab/flash-attention
-
Notifications
You must be signed in to change notification settings - Fork 35
Pull requests: PaddlePaddle/flash-attention
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
support
csrc/flash_attn_with_bias_and_mask/src/fmha/smem_tile.h cuda132 build
#153
opened May 25, 2026 by
gouzil
Member
Loading…
[Sm100] FlashMask V4 fwd support head dim 256 via Split-D (q_stage=1, d==dv)
#152
opened May 22, 2026 by
wangxudong10
Loading…
Fix CUDA 13.2 flash attention build compatibility
#141
opened Apr 28, 2026 by
gouzil
Member
Loading…
[Feat] CP-balance formal incorporation as flash_mask sub-module
#127
opened Apr 9, 2026 by
Enigmatisms
Loading…
Support Global Sliding Window (num_vec == 4) on FM4 BWD
#111
opened Mar 3, 2026 by
umiswing
Member
Loading…
add flashmask v2 torch flash_api.cpp flashmask_interface.py setup.py
#98
opened Dec 23, 2025 by
clouds1238
Loading…
Removed redundant templates and related compile-time/runtime code
#91
opened Nov 14, 2025 by
Enigmatisms
Loading…
1 task
scan from right to left and skip masked block for each row at kernel begin
#55
opened Sep 23, 2024 by
GuoxiaWang
Collaborator
Loading…
Fix unpadding input with padding mask compute error
#38
opened Apr 15, 2024 by
wwbitejotunn
Loading…
ProTip!
Adding no:label will show everything without a label.