plz
qenme
·
AI & ML interests
None yet
Recent Activity
new activity about 17 hours ago
cyankiwi/Qwen3.6-27B-AWQ-BF16-INT8:Compatible with vLLM on Ampere? new activity 6 days ago
Minachist/Qwen3.6-27B-INT8-AutoRound:Me again new activity 6 days ago
AesSedai/Qwen3.6-35B-A3B-GGUF:MTP support?Organizations
None yet
Compatible with vLLM on Ampere?
➕ 3
3
#3 opened 12 days ago
by
HenkTenk
Me again
3
#2 opened 6 days ago
by
qenme
MTP support?
👍 1
7
#5 opened 8 days ago
by
Nindaleth
Very bad results with model quant and KV cache quant, only BF16 works well
👍👀 5
4
#34 opened 29 days ago
by
qenme
F16 or BF16?
#6 opened 8 days ago
by
qenme
FYI : --spec-type mtp syntax has changed to --spec-type draft-mtp
👍 3
3
#14 opened 11 days ago
by
qenme
presence-penalty
4
#8 opened 12 days ago
by
owao
Good quant!
12
#1 opened 23 days ago
by
qenme
Working good on 96GB VRAM + DDR5 Setup
❤️ 1
5
#2 opened 24 days ago
by
phakio
GOOLE WHERE IS MTP ?
🔥 2
2
#82 opened 30 days ago
by
EvilinaMaller
10/10
🔥 6
1
#4 opened about 1 month ago
by
qenme
thanks!
➕❤️ 19
2
#1 opened about 1 month ago
by
qenme
Will there be a small model for speculative decoding?
3
#71 opened about 1 month ago
by
Regrin
Thanks
➕ 8
1
#2 opened 3 months ago
by
qenme
gguf when
🔥 3
2
#2 opened 3 months ago
by
kalota
New version with fixed
#6 opened 4 months ago
by
qenme