arxiv:2509.25133
Frank Chen
quantumfr
AI & ML interests
alignment and Interpretability
Recent Activity
upvoted a paper about 1 month ago
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability upvoted a paper about 1 month ago
ATBench: A Diverse and Realistic Trajectory Benchmark for Long-Horizon Agent Safety upvoted a paper about 1 month ago
DARE: Diffusion Large Language Models Alignment and Reinforcement Executor