• Qwen2.5-Coder Technical Report
  • GRIN
  • Moshi
  • Training Language Models to Self-Correct via RL
  • To CoT or not to CoT?
  • OmniGen
  • NVLM
  • Qwen2-VL
  • Kolmogorov-Arnold Transformer
  • InfiMM-WebMath-40B
  • MMSearch
  • LVCD
  • Scaling Smart
  • Language Models Learn to Mislead Humans via RLHF
  • A Controlled Study on Long Context Extension and Generalization in LLMs
  • LLMs + Persona-Plug = Personalized LLMs
  • Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think
  • Promptriever
  • Phidias
  • On the limits of agency in agent-based models