Skip to content
Snippets Groups Projects
Unverified Commit 32fdb843 authored by DefTruth's avatar DefTruth Committed by GitHub
Browse files

:fire:[BatchLLM] BatchLLM: Optimizing Large Batched LLM Inference with Global...

:fire:[BatchLLM] BatchLLM: Optimizing Large Batched LLM Inference with Global Prefix Sharing and Throughput-oriented Token Batching (#104)

:fire:[BatchLLM] BatchLLM: Optimizing Large Batched LLM Inference with Global Prefix Sharing and Throughput-oriented Token Batching
parent 9bb3f6a3
Branches
Tags
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment