Partly broken

#1
by redaihf - opened

This model has multiple personality disorder. It produces intelligent and uncensored instructions for tasks when requested but then exhibits various forms of noncompliance for sensitive prompts incorporating them. Its noncompliance is not subtle like the original and includes refusals as well as shortened responses and incoherent tag loops such as:

<SPECIAL_28> text:pineapple pizza<SPECIAL_28><SPECIAL_28><SPECIAL_27><SPECIAL_28><SPECIAL_28><SPECIAL_28><SPECIAL_28><SPECIAL_28> [snip...]

* where "pineapple pizza" relates to the subject of the prompt h/t @MuXodious

Owner

It's possible that these incoherent loops are a result of della getting distorted by mixing multiple 2509, 2506, 2503 and 2501 donors. Compared to the original slimaki, this one uses a second 2501 donor, which is known to have a massive LR norm discrepancy from later versions of Mistral. This could have caused too much distortion in certain conditions that leads to broken responses.

Most other merge methods are even more fragile and would likely break harder if trying to merge all 4 versions of MS 24B at once. Something like sce might produce more stable results but I'd have to test it.

Owner

It would be interesting to see if you notice any non compliance, gibberish or early terminations with this model (using chatml)

https://huggingface.co/DarkArtsForge/Morbid-Miasma-12B

It was made entirely with ablated donors, using unablated base_model and a custom method aether

Morbid Miasma is a creative and very uncensored model. Its ability to follow the prompt is imperfect regardless of the safety of the content. It sometimes exhibits early terminations and these seem to occur more often when generating unsafe text.

Sign up or log in to comment