Repo for paper Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability.
Qihan Ren
jasonrqh
AI & ML interests
explainable AI, LLM
Recent Activity
upvoted a paper 2 days ago
MMSkills: Towards Multimodal Skills for General Visual Agents upvoted a paper 14 days ago
ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration