Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

40,511

Full-text search

Active filters: 4-bit

mlx-community/GLM-4.6V-Flash-4bit

Image-Text-to-Text • Updated 8 days ago • 1.19k • 7

QuantTrio/DeepSeek-V3.2-AWQ

Text Generation • 685B • Updated 13 days ago • 2.96k • 6

PLOI-Labs/lh-degen-001

Updated 5 days ago • 86 • 4

Disty0/Z-Image-Turbo-SDNQ-uint4-svd-r32

Text-to-Image • Updated 13 days ago • 26.5k • 47

TheBloke/Wizard-Vicuna-30B-Uncensored-GPTQ

Text Generation • 33B • Updated Sep 27, 2023 • 67.6k • 592

Tann-dev/sex-chat-dirty-girlfriend

Text Generation • 7B • Updated Feb 17, 2024 • 391 • 41

gaunernst/gemma-3-12b-it-int4-awq

Image-Text-to-Text • 12B • Updated Apr 6 • 4.27k • 22

unsloth/Qwen3-14B-unsloth-bnb-4bit

Text Generation • 15B • Updated May 13 • 50.8k • 11

Qwen/Qwen3-30B-A3B-GPTQ-Int4

Text Generation • 31B • Updated May 21 • 487k • 42

mlx-community/DeepSeek-V3.2-4bit

Text Generation • 672B • Updated 7 days ago • 2.73k • 2

huihui-ai/Huihui-GLM-4.6-abliterated-mlx-4bit

Text Generation • 353B • Updated 14 days ago • 1.18k • 17

MaziyarPanahi/Nemotron-Orchestrator-8B-GGUF

Text Generation • 8B • Updated 10 days ago • 32.2k • 2

mlx-community/GLM-4.6V-4bit

Image-Text-to-Text • Updated 8 days ago • 934 • 4

ExaltedSlayer/mistralai-devstral-small-2-24b-instruct-2512-mlx-mxfp4

Text Generation • 24B • Updated about 20 hours ago • 679 • 2

nn-tech/MetalGPT-1-AWQ

Text Generation • 33B • Updated 2 days ago • 150 • 2

Kirim-ai/Kirim-V1-Base

Text Generation • 12B • Updated 3 days ago • 55 • 2

TheBloke/Falcon-7B-Instruct-GPTQ

Text Generation • 7B • Updated Aug 21, 2023 • 189 • 68

TheBloke/Llama-2-7B-Chat-GPTQ

Text Generation • 7B • Updated Sep 27, 2023 • 10.5k • 267

TheBloke/13B-BlueMethod-GPTQ

Text Generation • 13B • Updated Sep 27, 2023 • 146 • 6

TheBloke/Octocoder-GPTQ

Text Generation • 16B • Updated Sep 27, 2023 • 29 • 8

TheBloke/mixtral-8x7b-v0.1-AWQ

Text Generation • 47B • Updated Dec 22, 2023 • 231 • 11

TheBloke/Mistral-7B-Instruct-v0.2-GPTQ

Text Generation • 7B • Updated Dec 11, 2023 • 19.7k • 55

MaziyarPanahi/Mixtral-8x22B-Instruct-v0.1-GGUF

Text Generation • 141B • Updated Apr 18, 2024 • 2.48k • 34

MaziyarPanahi/Meta-Llama-3-8B-Instruct-GGUF

Text Generation • 8B • Updated Apr 23, 2024 • 151k • 101

unsloth/Phi-3-mini-4k-instruct-bnb-4bit

Text Generation • 4B • Updated Sep 3, 2024 • 39.2k • 39

MaziyarPanahi/Llama-3-8B-Instruct-v0.1-GGUF

Text Generation • 8B • Updated May 4, 2024 • 221 • 2

MaziyarPanahi/Meta-Llama-3.1-70B-Instruct-GGUF

Text Generation • 71B • Updated Jul 29, 2024 • 134k • 40

Qwen/Qwen2.5-32B-Instruct-AWQ

Text Generation • 33B • Updated Oct 9, 2024 • 843k • 90

unsloth/Qwen2.5-3B-Instruct-bnb-4bit

Text Generation • 3B • Updated Feb 6 • 8.31k • 12

Qwen/Qwen2.5-Coder-1.5B-Instruct-AWQ

Text Generation • 2B • Updated Nov 18, 2024 • 109k • 4