M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models https://arxiv.org/abs/2504.10449
Junxiong Wang PRO
JunxiongWang
AI & ML interests
Attention Free Model / Subquadratic Language Models
Organizations
models 51
JunxiongWang/M1-3B
Text Generation • 3B • Updated • 6 • 2
JunxiongWang/M1-3B-SFT
Text Generation • 3B • Updated • 7 • 1
JunxiongWang/MambaInLlama1B_SFT_MATH
1B • Updated • 1
JunxiongWang/MambaInLlama3B_SFT_MATH
3B • Updated
JunxiongWang/MambaInLlama3B_DPO2
3B • Updated • 4
JunxiongWang/MambaInLlama3B_DPO1
3B • Updated • 2
JunxiongWang/MambaInLlama3B_Distill_MATH
3B • Updated • 2
JunxiongWang/MambaInLlama3B_v3
3B • Updated • 1
JunxiongWang/MambaInLlama1B_Distill_MATH
1B • Updated • 2
JunxiongWang/mamba_0_5_distill
Updated • 1
datasets 20
JunxiongWang/QwenFineMATH
Viewer • Updated • 6.71M • 143
JunxiongWang/R1_GR_SFT
Viewer • Updated • 44k • 16
JunxiongWang/R1_SFT
Updated • 73
JunxiongWang/R1_Sythetic_SFT
Viewer • Updated • 1M • 152
JunxiongWang/MATH_SFT
Viewer • Updated • 19.1M • 61
JunxiongWang/R1_OpenThoughts_SFT
Viewer • Updated • 862k • 155
JunxiongWang/R1_am_SFT
Viewer • Updated • 1.4M • 309
JunxiongWang/qwen1b_it_math
Viewer • Updated • 19.1M • 90
JunxiongWang/test_math
Viewer • Updated • 89.1k • 91
JunxiongWang/FineMathV4
Viewer • Updated • 6.7M • 209