Spaces:
Running
Running
Commit History
metal : fuse add, mul + add tests (llama/14596)
66ae493
vulkan: Add fusion support for RMS_NORM+MUL (llama/14366)
737f12d
ggml-cpu: enable IBM NNPA Vector Intrinsics (llama/14317)
fea8f94
ggml : Print backtrace on uncaught C++ exceptions (ggml/1232)
1459465
Daniel Tang
commited on
ggml : riscv: add xtheadvector support (llama/13720)
4790d5d
xctan
commited on
ggml-impl.h: fix build on POWER9 (llama/12855)
3a1d5ca
Piotr Kubaj
commited on
ggml: don't include arm_neon.h when using CUDA 12 with ARM Neon (ggml/1187)
87f1ea3
cmdr2
commited on
ggml : add more generic custom op, remove deprecated custom ops (ggml/1183)
ba7a5f8
Diego Devesa
commited on
ggml : simplify Arm fp16 CPU logic (ggml/1177)
fb13b88
ggml : sync/merge cmake,riscv,powerpc, add common.cmake (ggml/0)
f695cbf
MUSA: support ARM64 and enable dp4a .etc (llama/11843)
ab96dac
Bodhi
Bodhi Hu
commited on
GGUF: C++ refactor, backend support, misc fixes (llama/11030)
21c5b64
tests: add tests for GGUF (llama/10830)
e7722cb
remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (llama/10797)
b38cecf
Diego Devesa
commited on
Avoid using __fp16 on ARM with old nvcc (llama/10616)
19743b6
Frankie Robertson
commited on
ggml : move AMX to the CPU backend (llama/10570)
3732429
Diego Devesa
commited on
Do not include arm_neon.h when compiling CUDA code (ggml/1028)
80663f4
Frankie Robertson
commited on
ggml-opt: fix data corruption (ggml/1022)
a916e92
ggml: new optimization interface (ggml/988)
dd33ace
ggml : build backends as libraries (llama/10256)
3dc93f3
ggml : move CPU backend to a separate file (llama/10144)
0f447f2
Diego Devesa
commited on
fix: use `vm_allocate` to allocate CPU backend buffer on macOS (llama/9875)
cf75979
Gilad S
commited on
ggml-backend : add device and backend reg interfaces (llama/9707)
9d74d85
Diego Devesa
commited on
ggml : refactoring (llama/#0)
1b62c96
ggml-quants : ternary packing for TriLMs and BitNet b1.58 (llama/8151)
d1c244a
ggml : reading the runtime sve config of the cpu (llama/8709)
c26339f
jdomke
domke
commited on
Fix conversion of unnormalized BF16->BF16 weights (llama/7843)
8b10f59
ggml : reduce hash table reset cost (llama/8698)
9808fbf
slaren
commited on
ggml : add AArch64 optimized GEMV and GEMM Q4 kernels (llama/5780)
9509586
Dibakar Gope
commited on