AI by Punit
February 24, 2025 at 06:18 PM
https://github.com/deepseek-ai/FlashMLA
Deep seek released flashMLA inspired by flash attention.
MLA - multi head latent attention