Inference Providers
Active filters: fp8
Text Generation
• 12B • Updated • 2.94k
• 3
tngtech/DeepSeek-R1T-Chimera
Text Generation
• 685B • Updated • 75
• 272
tngtech/DeepSeek-TNG-R1T2-Chimera
Text Generation
• 685B • Updated • 215
• 280
Qwen/Qwen3-Coder-30B-A3B-Instruct-FP8
Text Generation
• 31B • Updated • 975k
• 183
mistralai/Ministral-3-3B-Instruct-2512
4B • Updated • 625k
• 245
nex-agi/DeepSeek-V3.1-Nex-N1
Text Generation
• 671B • Updated • 39
• 45
deepseek-ai/DeepSeek-V3.2
Text Generation
• 685B • Updated • 4.07M
• • 1.45k
RamonGuthrie/z_image_base-nvfp8-mixed
Text-to-Image
• Updated • 274
• 15
Qwen/Qwen3.5-397B-A17B-FP8
Image-Text-to-Text
• 403B • Updated • 954k
• 175
mistralai/Mistral-Medium-3.5-128B
128B • Updated • 432k
• 349
RedHatAI/gemma-4-26B-A4B-it-FP8-Dynamic
27B • Updated • 498k
• 30
batsclamp/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-FP8
Image-Text-to-Text
• 36B • Updated • 5.79k
• 7
Text Generation
• 862B • Updated • 567
• 54
coolthor/Huihui-Qwen3.6-35B-A3B-abliterated-FP8-DYNAMIC
Image-Text-to-Text
• 36B • Updated • 1.11k
• 4
tcclaviger/Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking-FP8-MTP
Image-Text-to-Text
• 40B • Updated • 5.64k
• 12
kasimat/Qwen3.6-27B-AEON-Ultimate-Uncensored-FP8-MTP
Image-Text-to-Text
• 28B • Updated • 21.5k
• 14
drbaph/HiDream-O1-Image-FP8
Image-Text-to-Image
• 9B • Updated • 5.1k
• 8
INSAIT-Institute/MamayLM-Gemma-3-27B-IT-v2.0-FP8-dynamic
Image-Text-to-Text
• 29B • Updated • 1.15k
• 2
bahadirakdemir/gemma-4-12B-it-text-fp8
Text Generation
• 12B • Updated • 945
• 2
bahadirakdemir/gemma-4-12B-it-assistant-fp8
Text Generation
• 0.4B • Updated • 151
• 2
8B • Updated • 30
• 2
RedHatAI/Qwen2-72B-Instruct-FP8
Text Generation
• 73B • Updated • 1.02k
• 16
Text Generation
• 671B • Updated • 78
• 27
GreenBitAI/DeepSeek-R1-671B-layer-mix-bpw-4.0-mlx
96B • Updated • 101
• 1
deepseek-ai/DeepSeek-R1-0528
Text Generation
• 685B • Updated • 6.54M
• • 2.45k
unsloth/DeepSeek-TNG-R1T2-Chimera
Text Generation
• 685B • Updated • 48
• 7
unsloth/DeepSeek-TNG-R1T2-Chimera-BF16
Text Generation
• 684B • Updated • 42
• 4
moonshotai/Kimi-K2-Instruct
Text Generation
• 1T • Updated • 609k
• • 2.36k
Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8
Text Generation
• 480B • Updated • 180k
• 154
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation
• 235B • Updated • 32.9k
• 85