electroglyph
/

gemma-3-4b-it-unslop-GRPO-v3

Image-Text-to-Text

text-generation-inference

Model card Files Files and versions

electroglyph commited on Aug 26

Commit

784a049

·

verified ·

1 Parent(s): 557c24a

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -9,6 +9,8 @@ base_model: google/gemma-3-4b-it
 An unslop finetune of [google/gemma-3-4b-it](https://huggingface.co/google/gemma-3-4b-it)
 ### Changes from my previous test
 - Temperature during training was at 1.0 this time around, model is a lot less weird

 An unslop finetune of [google/gemma-3-4b-it](https://huggingface.co/google/gemma-3-4b-it)
+Next version is here: [gemma-3-4b-it-unslop-GSPO](https://huggingface.co/electroglyph/gemma-3-4b-it-unslop-GSPO)
 ### Changes from my previous test
 - Temperature during training was at 1.0 this time around, model is a lot less weird