Skip to content
Snippets Groups Projects
Unverified Commit d7dac51f authored by JYYHH's avatar JYYHH Committed by GitHub
Browse files

Add paper: DeFT: Flash Tree-attention with IO-Awareness for Efficient...

Add paper: DeFT: Flash Tree-attention with IO-Awareness for Efficient Tree-search-based LLM Inference
parent 944329f6
Branches
No related tags found
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment