AI & ML interests
None yet
Organizations
None yet
models 11
Blancy/Qwen3-1.7B-Open-R1-Code-GRPO
Text Generation
• 2B • Updated
• 1
Blancy/Qwen3-0.6B-Open-R1-Distill
Text Generation
• 2B • Updated
• 6
Blancy/Qwen3-0.6B-Open-R1-GRPO
Text Generation
• 2B • Updated
• 5
Blancy/Qwen3-1.7B-Open-R1-GRPO
Text Generation
• 2B • Updated
• 20
• • 2
Blancy/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Text Generation
• 2B • Updated
• 1
Blancy/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated
• 1
• 1
Blancy/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated
• 9
Blancy/Qwen-2.5-7B-Simple-RL
Text Generation
• 8B • Updated
• 1
• 1
Blancy/Qwen2.5-1.5B-Open-R1-Code-GRPO
Text Generation
• 2B • Updated
• 3
Blancy/DeepSeek-R1-Distill-Qwen-0.5B-GRPO
Text Generation
• 0.6B • Updated
• 39
datasets 43
Blancy/1ktestfrom10kwithdifficultyclasses_selfguided
Viewer
• Updated
• 1k • 16
Blancy/verifiable-coding-problems-SFT
Viewer
• Updated
• 1.09k • 99
Blancy/verifiable-coding-problems-CoT
Viewer
• Updated
• 1.09k • 13
Blancy/verifiable-coding-problems-python-filtered
Viewer
• Updated
• 2k • 19
Blancy/OpenThoughts-114k-Code_fit_code_reward
Viewer
• Updated
• 1k • 18
Blancy/OpenThoughts-114k-Code_oj_format
Viewer
• Updated
• 1k • 8
Blancy/OpenThoughts-114k-Code_decontaminated_final_verinfo
Viewer
• Updated
• 1k • 6
Blancy/OpenThoughts-114k-Code_decontaminated_final
Viewer
• Updated
• 1k • 6
Blancy/OpenThoughts-114k-Code_decontaminated_3000to5000_problem_leq400
Viewer
• Updated
• 1.82k • 6
Blancy/OpenThoughts-114k-Code_decontaminated_3000to5000
Viewer
• Updated
• 2.92k • 7