-
-
-
-
-
-
Inference Providers
Active filters:
4-bit
mlx-community/GLM-4.6V-Flash-4bit
Image-Text-to-Text
•
Updated
•
1.19k
•
7
QuantTrio/DeepSeek-V3.2-AWQ
Text Generation
•
685B
•
Updated
•
2.96k
•
6
PLOI-Labs/lh-degen-001
Updated
•
86
•
4
Disty0/Z-Image-Turbo-SDNQ-uint4-svd-r32
Text-to-Image
•
Updated
•
26.5k
•
47
TheBloke/Wizard-Vicuna-30B-Uncensored-GPTQ
Text Generation
•
33B
•
Updated
•
67.6k
•
592
Tann-dev/sex-chat-dirty-girlfriend
Text Generation
•
7B
•
Updated
•
391
•
41
gaunernst/gemma-3-12b-it-int4-awq
Image-Text-to-Text
•
12B
•
Updated
•
4.27k
•
22
unsloth/Qwen3-14B-unsloth-bnb-4bit
Text Generation
•
15B
•
Updated
•
50.8k
•
11
Qwen/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
•
31B
•
Updated
•
487k
•
42
mlx-community/DeepSeek-V3.2-4bit
Text Generation
•
672B
•
Updated
•
2.73k
•
2
huihui-ai/Huihui-GLM-4.6-abliterated-mlx-4bit
Text Generation
•
353B
•
Updated
•
1.18k
•
17
MaziyarPanahi/Nemotron-Orchestrator-8B-GGUF
Text Generation
•
8B
•
Updated
•
32.2k
•
2
mlx-community/GLM-4.6V-4bit
Image-Text-to-Text
•
Updated
•
934
•
4
ExaltedSlayer/mistralai-devstral-small-2-24b-instruct-2512-mlx-mxfp4
Text Generation
•
24B
•
Updated
•
679
•
2
nn-tech/MetalGPT-1-AWQ
Text Generation
•
33B
•
Updated
•
150
•
2
Kirim-ai/Kirim-V1-Base
Text Generation
•
12B
•
Updated
•
55
•
2
TheBloke/Falcon-7B-Instruct-GPTQ
Text Generation
•
7B
•
Updated
•
189
•
68
TheBloke/Llama-2-7B-Chat-GPTQ
Text Generation
•
7B
•
Updated
•
10.5k
•
267
TheBloke/13B-BlueMethod-GPTQ
Text Generation
•
13B
•
Updated
•
146
•
6
TheBloke/Octocoder-GPTQ
Text Generation
•
16B
•
Updated
•
29
•
8
TheBloke/mixtral-8x7b-v0.1-AWQ
Text Generation
•
47B
•
Updated
•
231
•
11
TheBloke/Mistral-7B-Instruct-v0.2-GPTQ
Text Generation
•
7B
•
Updated
•
19.7k
•
55
MaziyarPanahi/Mixtral-8x22B-Instruct-v0.1-GGUF
Text Generation
•
141B
•
Updated
•
2.48k
•
34
MaziyarPanahi/Meta-Llama-3-8B-Instruct-GGUF
Text Generation
•
8B
•
Updated
•
151k
•
101
unsloth/Phi-3-mini-4k-instruct-bnb-4bit
Text Generation
•
4B
•
Updated
•
39.2k
•
39
MaziyarPanahi/Llama-3-8B-Instruct-v0.1-GGUF
Text Generation
•
8B
•
Updated
•
221
•
2
MaziyarPanahi/Meta-Llama-3.1-70B-Instruct-GGUF
Text Generation
•
71B
•
Updated
•
134k
•
40
Qwen/Qwen2.5-32B-Instruct-AWQ
Text Generation
•
33B
•
Updated
•
843k
•
90
unsloth/Qwen2.5-3B-Instruct-bnb-4bit
Text Generation
•
3B
•
Updated
•
8.31k
•
12
Qwen/Qwen2.5-Coder-1.5B-Instruct-AWQ
Text Generation
•
2B
•
Updated
•
109k
•
4