view post Post 4348 I am very sad to say that the budget in creating of SnowflakeCore-G1 1b and 7b MoE models ran out and I can't pre-train them anymore. See translation
view post Post 801 the training for SnowflakeCore-G1-1B and 7B would be retaken because now I implemented DeepSpeed and management to use two gpus. See translation
Liquid Claude Liquid Claude is a small series of LiquidAI/LFM2.5-1.2B-Thinking model that have been fine tuned on Claude chats/data. FlameF0X/LFM2.5-1.2B-Distilled-Claude-4.6 Text Generation • 1B • Updated 1 day ago • 1.38k • 2 FlameF0X/LFM2.5-1.2B-Distilled-Claude Text Generation • 1B • Updated 25 days ago • 731 • 1 FlameF0X/Qwen3-4B-Distilled-Claude-4.6 Text Generation • 4B • Updated 29 days ago • 284 FlameF0X/LFM2.5-1.2B-Distilled-Claude-4.6-GGUF 1B • Updated 16 days ago • 3.48k • 4
Nano Stuff A collection of all the Nano models I made. Nano Stuff contains from SR to music generation and so on. FlameF0X/NanoSR-6x Image-to-Image • Updated 1 day ago • 2 FlameF0X/NanoSR Viewer • Updated 7 days ago • 1.6k • 1.45k • 1 FlameF0X/NanoStudio Text-to-Audio • Updated 1 day ago
Liquid Claude Liquid Claude is a small series of LiquidAI/LFM2.5-1.2B-Thinking model that have been fine tuned on Claude chats/data. FlameF0X/LFM2.5-1.2B-Distilled-Claude-4.6 Text Generation • 1B • Updated 1 day ago • 1.38k • 2 FlameF0X/LFM2.5-1.2B-Distilled-Claude Text Generation • 1B • Updated 25 days ago • 731 • 1 FlameF0X/Qwen3-4B-Distilled-Claude-4.6 Text Generation • 4B • Updated 29 days ago • 284 FlameF0X/LFM2.5-1.2B-Distilled-Claude-4.6-GGUF 1B • Updated 16 days ago • 3.48k • 4
Nano Stuff A collection of all the Nano models I made. Nano Stuff contains from SR to music generation and so on. FlameF0X/NanoSR-6x Image-to-Image • Updated 1 day ago • 2 FlameF0X/NanoSR Viewer • Updated 7 days ago • 1.6k • 1.45k • 1 FlameF0X/NanoStudio Text-to-Audio • Updated 1 day ago