# Metadata Source URL:: https://arxiv.org/pdf/2205.14135.pdf Topics:: #ai, #llm --- # FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness ## Highlights