whisper.cpp / ggml /src /ggml-impl.h

Commit History

llama : add gpt-oss (llama/15091)
bf225d6

ggerganov ngxson HF Staff slaren commited on

metal : fuse add, mul + add tests (llama/14596)
66ae493

ggerganov commited on

vulkan: Add fusion support for RMS_NORM+MUL (llama/14366)
737f12d

jeffbolznv slaren commited on

ggml-cpu: enable IBM NNPA Vector Intrinsics (llama/14317)
fea8f94

taronaeo slaren commited on

ggml : Print backtrace on uncaught C++ exceptions (ggml/1232)
1459465

Daniel Tang commited on

ggml : riscv: add xtheadvector support (llama/13720)
4790d5d

xctan commited on

ggml-impl.h: fix build on POWER9 (llama/12855)
3a1d5ca

Piotr Kubaj commited on

ggml: don't include arm_neon.h when using CUDA 12 with ARM Neon (ggml/1187)
87f1ea3

cmdr2 commited on

ggml : add more generic custom op, remove deprecated custom ops (ggml/1183)
ba7a5f8

Diego Devesa commited on

ggml : simplify Arm fp16 CPU logic (ggml/1177)
fb13b88

ggerganov commited on

ggml : sync/merge cmake,riscv,powerpc, add common.cmake (ggml/0)
f695cbf

ggerganov commited on

MUSA: support ARM64 and enable dp4a .etc (llama/11843)
ab96dac

Bodhi Bodhi Hu commited on

GGUF: C++ refactor, backend support, misc fixes (llama/11030)
21c5b64

JohannesGaessler commited on

tests: add tests for GGUF (llama/10830)
e7722cb

JohannesGaessler commited on

remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (llama/10797)
b38cecf

Diego Devesa commited on

Avoid using __fp16 on ARM with old nvcc (llama/10616)
19743b6

Frankie Robertson commited on

ggml : move AMX to the CPU backend (llama/10570)
3732429

Diego Devesa commited on

Do not include arm_neon.h when compiling CUDA code (ggml/1028)
80663f4

Frankie Robertson commited on

ggml-opt: fix data corruption (ggml/1022)
a916e92

JohannesGaessler commited on

ggml: new optimization interface (ggml/988)
dd33ace

JohannesGaessler commited on

ggml : build backends as libraries (llama/10256)
3dc93f3

Diego Devesa ggerganov R0CKSTAR commited on

ggml : move CPU backend to a separate file (llama/10144)
0f447f2

Diego Devesa commited on

fix: use `vm_allocate` to allocate CPU backend buffer on macOS (llama/9875)
cf75979

Gilad S commited on

ggml-backend : add device and backend reg interfaces (llama/9707)
9d74d85

Diego Devesa commited on

ggml : refactoring (llama/#0)
1b62c96

ggerganov commited on

ggml-quants : ternary packing for TriLMs and BitNet b1.58 (llama/8151)
d1c244a

compilade commited on

ggml : reading the runtime sve config of the cpu (llama/8709)
c26339f

jdomke domke commited on

Fix conversion of unnormalized BF16->BF16 weights (llama/7843)
8b10f59

Sigbjørn Skjæret compilade commited on

ggml : reduce hash table reset cost (llama/8698)
9808fbf

slaren commited on

ggml : add AArch64 optimized GEMV and GEMM Q4 kernels (llama/5780)
9509586

Dibakar Gope commited on

whisper : reorganize source code + improve CMake (#2256)
f75c2e3
unverified

ggerganov commited on