.. _flashattention: ====================== FlashAttention ====================== 参考 ====== - `Hugging Face Conceptual Guides: Flash Attention `_ - `Dao-AILab/flash-attention `_ - `FlashAttention-4: Algorithm and Kernel Pipelining Co-Design for Asymmetric Hardware Scaling `_