- OLMoE
- FLUX that Plays Music
- Loopy
- LongLLaVA
- In Defense of RAG in the Era of Long-Context Language Models
- FuzzCoder
- Guide-and-Rescale
- LongCite
- MMMU-Pro
- Kvasir-VQA
- LongRecipe
- VQ4DiT
- xLAM
- AlphaProteo
- Late Chunking
overview for each + authors' explanations
read this in thread mode for the best experience
