AI & ML interests
None yet
Organizations
None yet
valerielucro/Qwen2.5-0.5B-Instruct-with-output-tokens
Text Generation
•
0.5B
•
Updated
valerielucro/qwen_0.5B-agent
Text Generation
•
0.5B
•
Updated
valerielucro/Qwen2-0.5B-GRPO-VLLM-mni-epoch-128-full
Text Generation
•
0.5B
•
Updated
•
1
valerielucro/Qwen2-0.5B-GRPO-VLLM-mni-epoch-32-full
Text Generation
•
0.5B
•
Updated
•
1
valerielucro/Qwen2-0.5B-GRPO-VLLM-mni-epoch-32-peft-merged
Text Generation
•
0.5B
•
Updated
valerielucro/Qwen2-0.5B-GRPO-VLLM-mni-epoch-16-peft-merged
Text Generation
•
0.5B
•
Updated
•
1
valerielucro/Qwen2-0.5B-GRPO-VLLM-mni-epoch-8-peft-merged
Text Generation
•
0.5B
•
Updated
•
1
valerielucro/Qwen2-0.5B-GRPO-VLLM-mni-epoch-32-peft
Updated
valerielucro/Qwen2-0.5B-GRPO-VLLM-mni-epoch-16-peft
Updated
valerielucro/Qwen2-0.5B-GRPO-VLLM-mni-epoch-8-peft
Updated
valerielucro/Qwen2-0.5B-agent-epochs-50-peft-merged-2
Text Generation
•
0.5B
•
Updated
•
1
valerielucro/Qwen2-0.5B-agent-epochs-50-peft-merged
Text Generation
•
0.5B
•
Updated
•
1
valerielucro/Qwen2-0.5B-agent-epochs-50-peft
Updated
valerielucro/Qwen2-0.5B-agent-epochs-20
Text Generation
•
0.5B
•
Updated
•
1
valerielucro/Qwen2-0.5B-agent-epochs-10
Text Generation
•
0.5B
•
Updated
•
1
valerielucro/Qwen2-0.5B-agent-epochs-32
Text Generation
•
0.5B
•
Updated
•
1
valerielucro/Qwen2-0.5B-GRPO-VLLM-mni-epoc-1024-full
Text Generation
•
0.5B
•
Updated
valerielucro/Qwen2-0.5B-GRPO-VLLM-mni-epoc-256-full
Text Generation
•
0.5B
•
Updated
valerielucro/Qwen2-0.5B-GRPO-VLLM-mni-epoc-64-full
Text Generation
•
0.5B
•
Updated
•
1
valerielucro/Qwen2-0.5B-GRPO-VLLM-30-epoch-merged
Text Generation
•
0.5B
•
Updated
valerielucro/Qwen2-0.5B-GRPO-VLLM-8-epoch-merged
Text Generation
•
0.5B
•
Updated
•
3
valerielucro/Qwen2-0.5B-GRPO-VLLM-1-epoch-merged
Text Generation
•
0.5B
•
Updated
•
3
valerielucro/Qwen2-0.5B-GRPO-VLLM-30-epoch
Updated
valerielucro/Qwen2-0.5B-GRPO-VLLM-8-epoch
Updated
valerielucro/Qwen2-0.5B-GRPO-VLLM-1-epoch
Updated
valerielucro/Qwen2-0.5B-GRPO_dummy
Text Generation
•
2.43M
•
Updated
•
1
valerielucro/Qwen2-0.5B-GRPO_peft
Updated
valerielucro/Qwen2-0.5B-GRPO_20_epochs
Text Generation
•
0.5B
•
Updated
•
1
valerielucro/Qwen2-0.5B-GRPO_1_epochs
Text Generation
•
0.5B
•
Updated
•
1