Skip to content
Snippets Groups Projects
Unverified Commit bb1f1171 authored by DefTruth's avatar DefTruth Committed by GitHub
Browse files

:fire::fire:[DeServe] DESERVE: TOWARDS AFFORDABLE OFFLINE LLM INFERENCE VIA DECENTRALIZATION(#116)

parent a523f1df
Branches
Tags
No related merge requests found
......@@ -100,6 +100,7 @@ Awesome-LLM-Inference: A curated list of [📙Awesome LLM Inference Papers with
|:---:|:---:|:---:|:---:|:---:|
|2024.01|🔥🔥[**DistServe**] DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving(@PKU)|[[pdf]](https://arxiv.org/pdf/2401.09670)|[[DistServe]](https://github.com/LLMServe/DistServe) ![](https://img.shields.io/github/stars/LLMServe/DistServe.svg?style=social) |⭐️⭐️ |
|2024.12|🔥🔥[**KVDirect**] KVDirect: Distributed Disaggregated LLM Inference(@ByteDance)|[[pdf]](https://arxiv.org/pdf/2501.14743)|⚠️|⭐️ |
|2025.01|🔥🔥[**DeServe**] DESERVE: TOWARDS AFFORDABLE OFFLINE LLM INFERENCE VIA DECENTRALIZATION(@Berkeley)|[[pdf]](https://arxiv.org/pdf/2501.14784)|⚠️|⭐️ |
### 📖LLM Algorithmic/Eval Survey ([©️back👆🏻](#paperlist))
<div id="LLM-Algorithmic-Eval-Survey"></div>
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment