-
0faf3bf1 · CacheCraft: A Relevant Work on Chunk-Aware KV Cache Reuse for RAG (...
-
eb7e0d01 · Update DeepSeek/MLA Topics (#125)
-
99280223 · Add DeepSeek Open Sources modules (#124)
-
d32b3dde · update the title of SageAttention2 and add SpargeAttn (#123)
-
cc978444 ·
🔥 [MHA2MLA] Towards Economical Inference: Enabling DeepSeek’s Multi-...
-
4cb87630 · Add our ICLR2025 work Dynamic-LLaVA (#121)
-
0525c4d4 ·
🔥 [DeepSeek-NSA] Native Sparse Attention: Hardware-Aligned and Nativ...
-
1ddf093b · Add Multi-head Latent Attention(MLA) topic (#118)
- ... and 3 more commits. Compare b117b3c1...d7914c03
-
b117b3c1 · [feat] add deepseek-r1 (#113)
-
e49ea483 · Update README.md
- ... and 1 more commit. Compare 295f4088...e49ea483