Inference Providers
Active filters: GPTQ
QuantTrio/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int8
Text Generation
• 31B • Updated • 207
• 9
QuantTrio/GLM-4.5-GPTQ-Int4-Int8Mix
Text Generation
• 55B • Updated • 72
• 5
QuantTrio/Qwen3-30B-A3B-Thinking-2507-GPTQ-Int8
Text Generation
• 31B • Updated • 121
• 2
QuantTrio/Qwen3-Coder-30B-A3B-Instruct-GPTQ-Int8
Text Generation
• 31B • Updated • 780
• 8
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int8
Text Generation
• 36B • Updated • 46
• 4
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int4
Text Generation
• 36B • Updated • 46
• 5
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int3
Text Generation
• 34B • Updated • 10
• 3
QuantTrio/DeepSeek-V3.1-AWQ
Text Generation
• 485B • Updated • 218
• 5
QuantTrio/DeepSeek-V3.1-AWQ-Fp16Mix
Text Generation
• 286B • Updated • 27
• 1
JunHowie/Qwen3-4B-Instruct-2507-GPTQ-Int4
Text Generation
• 4B • Updated • 89.5k
• 3
JunHowie/Qwen3-4B-Instruct-2507-GPTQ-Int8
Text Generation
• 4B • Updated • 180
JunHowie/Qwen3-4B-Thinking-2507-GPTQ-Int4
Text Generation
• 4B • Updated • 27
• 1
JunHowie/Qwen3-4B-Thinking-2507-GPTQ-Int8
Text Generation
• 4B • Updated • 85
• 2
JunHowie/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int4
Text Generation
• 31B • Updated • 7.83k
JunHowie/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int8
Text Generation
• 31B • Updated • 7
JunHowie/Qwen3-30B-A3B-Thinking-2507-GPTQ-Int4
Text Generation
• 31B • Updated • 21
JunHowie/Qwen2-7B-Instruct-GPTQ-Int4
Text Generation
• 8B • Updated • 4
JunHowie/Qwen2-7B-Instruct-GPTQ-Int8
Text Generation
• 8B • Updated • 3
JunHowie/Qwen3-30B-A3B-Thinking-2507-GPTQ-Int8
Text Generation
• 31B • Updated • 2
JunHowie/Seed-OSS-36B-Instruct-GPTQ-Int4
Text Generation
• 36B • Updated • 6
JunHowie/Seed-OSS-36B-Instruct-GPTQ-Int8
Text Generation
• 36B • Updated • 4
QuantTrio/GLM-4.6-GPTQ-Int4-Int8Mix
Text Generation
• 69B • Updated • 11
• 4
QuantTrio/KAT-Dev-GPTQ-Int4
Text Generation
• 33B • Updated • 4
• 1
QuantTrio/KAT-Dev-GPTQ-Int8
Text Generation
• 33B • Updated • 3
• 1
QuantTrio/Kimi-Dev-72B-GPTQ-Int4
Text Generation
• 73B • Updated • 36
• 2
QuantTrio/Kimi-Dev-72B-GPTQ-Int8
Text Generation
• 73B • Updated • 12
• 2
AXERA-TECH/Qwen3-VL-2B-Instruct-GPTQ-Int4
Image-Text-to-Text
• Updated • 69
• 1
AXERA-TECH/Qwen3-VL-4B-Instruct-GPTQ-Int4
Image-Text-to-Text
• Updated • 41
AXERA-TECH/Qwen3-VL-8B-Instruct-GPTQ-Int4
Image-Text-to-Text
• Updated • 24
• 1
AXERA-TECH/Qwen3-VL-8B-Instruct
Image-Text-to-Text
• Updated • 5