Spaces:
Running
Running
Commit History
vulkan: use uint array index to avoid glslang bug (llama/13193)
fd2d86d
vulkan: In coopmat2 mmq, load q4_k/q5_k scales through shared memory (llama/12833)
4b7a407
vulkan: optimize iq1 coopmat2 dequant functions (llama/12427)
53dd8ad
vulkan: use fp32 in coopmat2 q4_k dequant function (llama/12309)
9ca84c6
vulkan: matmul dequantization improvements (llama/12015)
ffdf466
Eve
commited on
vulkan: initial support for IQ1_S and IQ1_M quantizations (llama/11528)
0d2e888
Rémy O
commited on
vulkan: optimize coopmat2 iq2/iq3 callbacks (llama/11521)
3731f13
vulkan: initial support for IQ4_XS quantization (llama/11501)
ed46ad5
Rémy O
commited on