Inference Providers
Active filters: dpo
sweepai/sweep-next-edit-v2-7B
Text Generation
• 8B • Updated • 1.01k
• 23
F16/z-image-turbo-flow-dpo
Feature Extraction
• Updated • 175
mlabonne/NeuralDaredevil-8B-abliterated
Text Generation
• 8B • Updated • 15.7k
• • 269
wololoo/Llama-3.2-3B-TR-Instruct-DPO
Text Generation
• 3B • Updated • 28
• • 2
Text Generation
• 4B • Updated • 103
• 1
mradermacher/dotnet-coder-14b-GGUF
15B • Updated • 524
• 1
VladShash/deepseek-math-7B-lean-prover-grpo-olmo-weighed
Text Generation
• 7B • Updated • 3.62k
• 1
HCY123902/llama-3-8b-inst-dpo-on-p-twj-beta-1e-0
Text Generation
• 266k • Updated • 263
• 1
Olak17/Qwen2.5-Coder-1.5B-Unsensored-DPO-i1-GGUF
2B • Updated • 3.59k
• 2
BugTraceAI/BugTraceAI-Apex-G4-26B-Q4
25B • Updated • 17.8k
• 57
zipaltrivedi/dotnet-coder-14b
Text Generation
• 15B • Updated • 4.25k
• 5
apol/alia-40b-distill-vapol
Text Generation
• Updated • 1.99k
• 2
F16/z-image-turbo-masked-dpo
Text-to-Image
• Updated • • 19
lyogavin/Anima33B-DPO-Belle-1k
Text Generation
• Updated • 1
lyogavin/Anima33B-DPO-Belle-1k-merged
Text Generation
• Updated • 9
• 12
daekeun-ml/Llama-2-ko-DPO-13B
Text Generation
• 13B • Updated • 12
• 19
lewtun/zephyr-7b-dpo-full
Text Generation
• 7B • Updated • 7
alignment-handbook/zephyr-7b-dpo-full
Text Generation
• 7B • Updated • 13
• 3
alignment-handbook/zephyr-7b-dpo-qlora
Updated • 21
• 9
Text Generation
• Updated • 12
• 7
argilla/notus-7b-v1-lora-adapter
Text Generation
• Updated • 3
Text Generation
• 7B • Updated • 97
• 123
ContextualAI/archangel_sft_pythia1-4b
Text Generation
• 1B • Updated • 5
ContextualAI/archangel_sft_pythia2-8b
Text Generation
• 3B • Updated • 12
• 1
ContextualAI/archangel_sft_pythia6-9b
Text Generation
• 7B • Updated • 15
ContextualAI/archangel_sft_pythia12-0b
Text Generation
• 12B • Updated • 20
ContextualAI/archangel_sft_llama7b
Text Generation
• 7B • Updated • 14
• 1
ContextualAI/archangel_sft_llama13b
Text Generation
• 13B • Updated • 54
ContextualAI/archangel_sft_llama30b
Text Generation
• 33B • Updated • 9
ContextualAI/archangel_slic_llama30b
Text Generation
• 33B • Updated • 6