Spaces:
Running
Running
Commit History
ggml-cpu : split arch-specific implementations (llama/13892)
8c833e9
musa: fix all warnings, re-enable `-DLLAMA_FATAL_WARNINGS=ON` in ci and update doc (llama/12611)
12bb60d
R0CKSTAR
commited on
CUDA: use arch list for compatibility check (llama/11775)
b88e163
CUDA: rename macros to avoid conflicts with WinAPI (llama/10736)
8544072
Andreas Kieslinger
commited on
ggml : refactor online repacking (llama/10446)
163128e
ggml-cpu: support IQ4_NL_4_4 by runtime repack (llama/10541)
bf73242
ggml-quants : ternary packing for TriLMs and BitNet b1.58 (llama/8151)
d1c244a
feat: Support Moore Threads GPU (llama/8383)
a35db11
ggml : add AArch64 optimized GEMV and GEMM Q4 kernels (llama/5780)
9509586
Dibakar Gope
commited on