·
AI & ML interests
None yet
Organizations
None yet
RyanYr/grpo_neg0.5-aime24-qwen2.5math-1.5B-base-mbs128-n4-ref1230-975b46d_actor
Updated
RyanYr/grpo_neg0.1-aime24-qwen2.5math-1.5B-base-mbs128-n4-ref1230-975b46d_actor
Updated
RyanYr/brm-dapo-qwen2.5math-7B-base-lr2.5e-6-bs512-mbs8192-beta0.002-n16
Updated
RyanYr/brm-dapo-qwen2.5math-7B-base-lr2.5e-6-bs512-beta0.002-n16
Updated
RyanYr/grpo_neg0.001-aime24-qwen2.5math-1.5B-base-mbs128-n4-ref1230-975b46d_actor
Updated
RyanYr/grpo_neg0.01-aime24-qwen2.5math-1.5B-base-mbs128-n4-ref1230-975b46d_actor
Updated
RyanYr/grpo-aime24-qwen2.5math-1.5B-base-mbs128-n4-ref1350-ab3ac3de_actor
Updated
RyanYr/grpo-aime24-qwen2.5math-1.5B-base-mbs128-n4_actor_1350-ab3ac3de
Text Generation
•
2B
•
Updated
RyanYr/grpo-aime24-qwen2.5math-1.5B-base-mbs128-n4-ref1230-975b46d_actor
Updated
RyanYr/grpo-aime24-qwen2.5math-1.5B-base-mbs128-n4_actor_1230-975b46d
Text Generation
•
2B
•
Updated
RyanYr/grpo-aime24-qwen2.5math-1.5B-base-mbs128-n4-ref895-82bb89a_actor
Updated
RyanYr/grpo-aime24-qwen2.5math-1.5B-base-mbs128-n4_actor_895-82bb89a
Text Generation
•
2B
•
Updated
RyanYr/grpo-aime24-qwen2.5math-1.5B-base-mbs128-n4-ref730-585b44e_actor
Updated
RyanYr/grpo-aime24-qwen2.5math-1.5B-base-mbs128-n4-550_actor_730-585b44e
Text Generation
•
2B
•
Updated
RyanYr/brm-aime24-qwen2.5math-1.5B-base-lr2.5e-6-mbs1024-beta0.002-n4-ref350-c9317a9
Updated
RyanYr/grpo-aime24-qwen2.5math-1.5B-base-mbs128-n4-ref550-a182cb1_actor
Updated
RyanYr/grpo-aime24-qwen2.5math-1.5B-base-mbs128-n4_actor_550-a182cb1
Text Generation
•
2B
•
Updated
RyanYr/brm-aime24-qwen2.5math-1.5B-base-lr2.5e-6-mbs1024-beta0.002-n4_350-c9317a9
Text Generation
•
2B
•
Updated
RyanYr/brm-aime24-qwen2.5math-1.5B-base-lr2.5e-6-mbs1024-beta0.002-n4
RyanYr/grpo-aime24-qwen2.5math-1.5B-base-mbs128-n4_actor
Updated
RyanYr/brm-aime24-qwen2.5math-1.5B-base-lr2.5e-6-beta0.002
Updated
RyanYr/brm-dapo-qwen2.5math-7B-base_lr2e-6-beta0.002
RyanYr/brm-dapo-qwen2.5math-1.5B-base-lr2.5e-6-beta0.002
Updated
RyanYr/brm-dapo-qwen2.5math-7B-base-lr2.5e-6-mbs512-beta0.002-n4
Updated
RyanYr/brm-dapo-qwen2.5math-7B-base-lr2e-6-mbs512-beta0.002-n4
Updated
RyanYr/brm-dapo-qwen2.5math-1.5B-base-lr2.5e-6-beta0.002-n4
Updated
RyanYr/brm-dapo-qwen2.5math-7B-base-lr2.5e-6-beta0.002
Updated