Skip to content
Snippets Groups Projects
Unverified Commit 4cb87630 authored by Blank-z0's avatar Blank-z0 Committed by GitHub
Browse files

Add our ICLR2025 work Dynamic-LLaVA (#121)

Add paper "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification"
Dynamic-LLaVA is the first MLLM acceleration framework that simultaneously sparsifies both vision and language contexts while integrating inference efficiency optimization across different MLLM inference modes into a unified framework. In practice, Dynamic-LLaVA can achieve additional inference efficiency throughout the entire generation process, with negligible understanding and generation ability degradation or even performance gains compared to the full-context inference baselines.
GitHub: https://github.com/Osilly/dynamic_llava
parent fe502d8c
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment