Contents:
Hugging Face Conceptual Guides: Flash Attention
Dao-AILab/flash-attention
FlashAttention-4: Algorithm and Kernel Pipelining Co-Design for Asymmetric Hardware Scaling