Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
MoeReward
/
rl_checkpoints
like
0
Follow
Project of MoE reward model
7
Safetensors
Model card
Files
Files and versions
xet
Community
main
rl_checkpoints
Commit History
qwen2.5 final checkpoints
bfe5658
shengyi-qian
commited on
Jun 27, 2025
upload qwen2.5 7 checkpoints
09ddbb5
shengyi-qian
commited on
Jun 27, 2025
drgrpo checkpoints
2581e08
shengyi-qian
commited on
Apr 21, 2025
upload diffdomain1
eaa1e52
shengyi-qian
commited on
Apr 16, 2025
upload diffdomain1
54cb21f
shengyi-qian
commited on
Apr 16, 2025
upload diff rewards
3a91c0a
shengyi-qian
commited on
Apr 12, 2025
nq checkpoint
d2c0d5d
shengyi-qian
commited on
Apr 10, 2025
three checkpoints
9c87696
shengyi-qian
commited on
Apr 9, 2025
qwen1.5 rule based
1a74a1a
shengyi-qian
commited on
Apr 7, 2025
initial commit
b90be05
verified
shengyi-qian
commited on
Apr 7, 2025