Skip to content
GitLab
Explore
Sign in
Register
v2.6.5
06c76ad3
·
🔥
🔥
[TP: Comm Compression] Communication Compression for Tensor Parallel LLM Inference (#94)
·
Nov 18, 2024