whisper.cpp

Running

App Files Files Community

whisper.cpp / ggml /src /ggml-common.h

Commit History

llama : add gpt-oss (llama/15091)

bf225d6

ggerganov

ngxson HF Staff slaren commited on Aug 5

ggml-cpu : split arch-specific implementations (llama/13892)

8c833e9

xctan

ggerganov commited on Jun 9

musa: fix all warnings, re-enable `-DLLAMA_FATAL_WARNINGS=ON` in ci and update doc (llama/12611)

12bb60d

R0CKSTAR commited on Mar 30

CUDA: use arch list for compatibility check (llama/11775)

b88e163

JohannesGaessler Diego Devesa commited on Feb 10

CUDA: rename macros to avoid conflicts with WinAPI (llama/10736)

8544072

Andreas Kieslinger commited on Dec 10, 2024

ggml : refactor online repacking (llama/10446)

163128e

Djip007

ggerganov commited on Dec 7, 2024

ggml-cpu: support IQ4_NL_4_4 by runtime repack (llama/10541)

bf73242

shupeif commited on Nov 28, 2024

ggml-quants : ternary packing for TriLMs and BitNet b1.58 (llama/8151)

d1c244a

compilade commited on Sep 6, 2024

feat: Support Moore Threads GPU (llama/8383)

a35db11

yeahdongcn commited on Jul 27, 2024

ggml : add AArch64 optimized GEMV and GEMM Q4 kernels (llama/5780)

9509586

Dibakar Gope commited on Jul 10, 2024

CUDA: refactor and optimize IQ MMVQ (llama/8215)

afa1447

JohannesGaessler commited on Jul 1, 2024

whisper : reorganize source code + improve CMake (#2256)

f75c2e3
unverified

ggerganov commited on Jun 26, 2024

Commit History

llama : add gpt-oss (llama/15091) bf225d6

ggml-cpu : split arch-specific implementations (llama/13892) 8c833e9

musa: fix all warnings, re-enable `-DLLAMA_FATAL_WARNINGS=ON` in ci and update doc (llama/12611) 12bb60d

CUDA: use arch list for compatibility check (llama/11775) b88e163

CUDA: rename macros to avoid conflicts with WinAPI (llama/10736) 8544072

ggml : refactor online repacking (llama/10446) 163128e

ggml-cpu: support IQ4_NL_4_4 by runtime repack (llama/10541) bf73242

ggml-quants : ternary packing for TriLMs and BitNet b1.58 (llama/8151) d1c244a

feat: Support Moore Threads GPU (llama/8383) a35db11

ggml : add AArch64 optimized GEMV and GEMM Q4 kernels (llama/5780) 9509586

CUDA: refactor and optimize IQ MMVQ (llama/8215) afa1447

whisper : reorganize source code + improve CMake (#2256) f75c2e3 unverified