- Qwen2.5-Coder Technical Report
- GRIN
- Moshi
- Training Language Models to Self-Correct via RL
- To CoT or not to CoT?
- OmniGen
- NVLM
- Qwen2-VL
- Kolmogorov-Arnold Transformer
- InfiMM-WebMath-40B
- MMSearch
- LVCD
- Scaling Smart
- Language Models Learn to Mislead Humans via RLHF
- A Controlled Study on Long Context Extension and Generalization in LLMs
- LLMs + Persona-Plug = Personalized LLMs
- Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think
- Promptriever
- Phidias
- On the limits of agency in agent-based models