whisper.cpp

Running

App Files Files Community

whisper.cpp

Commit History

Vulkan k-quant mmq and ggml-backend offload functionality (llama/6155)

1ff7b08
unverified

OccamRazor commited on Mar 29, 2024

fix set main gpu crash (llama/6339)

3bdb5e6
unverified

Neo Zhang Jianyu commited on Mar 28, 2024

ggml : fix bounds checking of zero size views (llama/6347)

80db462
unverified

slaren commited on Mar 27, 2024

backend : fix typo in scheduler documentation (ggml/781)

e7ddd12
unverified

danbev commited on Apr 3, 2024

extra : sync ggml-cuda folder

fa0af15
unverified

ggerganov HF Staff commited on Apr 7, 2024

ggml: bypass code incompatible with CUDA < 11.1 (#2020)

32f4e35
unverified

primenko commited on Apr 4, 2024

ci : add building in MSYS2 environments (Windows) (#1994)

08d5ab5
unverified

Przemysław Pawełczyk commited on Mar 30, 2024

build : use pkg-config for OpenBLAS (#1778)

d5d466c
unverified

Przemysław Pawełczyk commited on Mar 29, 2024

main : add command-style grammar (#1998)

7e6ea10
unverified

ulatekh

ggerganov HF Staff commited on Mar 28, 2024

make : add grammar parser to common objects

b1f3938
unverified

ggerganov HF Staff commited on Mar 28, 2024

sync : ggml (#2001)

cbbfa9e
unverified

ggerganov HF Staff commited on Mar 27, 2024

whisper : improve handling of prompts (#1981)

15949a9
unverified

ggerganov HF Staff commited on Mar 25, 2024

whisper : improve support for distil-large-v3 (#1982)

749004e
unverified

Sanchit Gandhi commited on Mar 21, 2024

ruby : fix build (#1980)

20374d7
unverified

ggerganov HF Staff commited on Mar 21, 2024

docker : libcuda.so.1 in PATH (#1966)

2cd0d06
unverified

Tiago Fassoni commited on Mar 20, 2024

readme : add Fedora dependencies (#1970)

afc6b1a
unverified

Mohammadreza Hendiani commited on Mar 20, 2024

whisper : token-level timestamps with DTW (#1485)

ce7ca09
unverified

denersc

ggerganov HF Staff commited on Mar 20, 2024

examples : rename --audio-context to --audio-ctx per help text (#1953)

8e9c985
unverified

joliss commited on Mar 18, 2024

whisper : set outputs from conv graph (#1959)

62505d4
unverified

ggerganov HF Staff commited on Mar 16, 2024

alloc : fix allocation data of pre-allocated leafs

0c378f2
unverified

slaren commited on Mar 16, 2024

cmake : copy ggml-common.h to bin

f592e46
unverified

ggerganov HF Staff commited on Mar 16, 2024

gitignore : .vimspector.json

b593d9a
unverified

ggerganov HF Staff commited on Mar 16, 2024

talk-llama : sync llama.cpp

14e824b
unverified

ggerganov HF Staff commited on Mar 15, 2024

sync : ggml

1701a5d
unverified

ggerganov HF Staff commited on Mar 15, 2024

update examples and tests

007ebd7
unverified

slaren commited on Mar 14, 2024

ggml : add ggml-common.h

8cdfa17
unverified

ggerganov HF Staff commited on Mar 14, 2024

ggml : designate enum vals for integer types (llama/6050)

0bd0c7a
unverified

ggerganov HF Staff commited on Mar 14, 2024

metal : build metallib + fix embed path (llama/6015)

27311ef
unverified

ggerganov HF Staff commited on Mar 14, 2024

llama : add pipeline parallelism support (llama/6017)

b5bb3f3
unverified

slaren

compilade

ggerganov HF Staff commited on Mar 13, 2024

Update get version (llama/6025)

9a4e508
unverified

AidanBeltonS commited on Mar 13, 2024

ggml : reuse quantum structs across backends (llama/5943)

bb0625f
unverified

ggerganov HF Staff commited on Mar 12, 2024

ggml : fix UB in IQ2_S and IQ3_S (llama/6012)

0c552df
unverified

ggerganov HF Staff commited on Mar 12, 2024

sycl : update IQ1_S kernels (WIP - not working!) (llama/5995)

16dc72c
unverified

ggerganov HF Staff commited on Mar 12, 2024

1.5 bit: we can do even better (llama/5999)

36cc71e
unverified

Kawrakow

ikawrakow commited on Mar 11, 2024

ggml, ci : Windows ARM runner and build fixes (llama/5979)

507b9dd
unverified

Michael Podvitskiy commited on Mar 11, 2024

Better 1.5 bit quantization (llama/5971)

f3a62cc
unverified

Kawrakow

ikawrakow commited on Mar 11, 2024

Add q3_s and q1_s (llama/5886)

2957823
unverified

Abhilash Majumder commited on Mar 11, 2024

metal : move mm_id indices to shared mem (llama/5982)

1350705
unverified

ggerganov HF Staff commited on Mar 10, 2024

ggml : fix unnecessary f32 -> f16 -> f32 casts (mmla) (llama/5951)

cb8bbaa
unverified

ggerganov HF Staff commited on Mar 9, 2024

ggml : remove old quantization functions (llama/5942)

11a2545
unverified

ggerganov HF Staff commited on Mar 9, 2024

ggml : add ggml-common.h to deduplicate shared code (llama/5940)

0a37735
unverified

ggerganov HF Staff commited on Mar 9, 2024

llama : support Mamba Selective State Space Models (llama/5328)

224fbc2
unverified

compilade commited on Mar 8, 2024

extra : update sync scripts after ggml-common.h

2e29431
unverified

ggerganov HF Staff commited on Mar 15, 2024

whisper : document whisper_batch.n_seq_id (#1942)

f08549e
unverified

josharian commited on Mar 10, 2024

whisper : improve beam search candidate diversity (#1947)

6e9276c
unverified

josharian commited on Mar 10, 2024

bindings/go : add linker flags to make metal work (#1944)

3dee0de
unverified

josharian commited on Mar 9, 2024

whisper : make beam candidate sort more stable (#1943)

1316242
unverified

josharian commited on Mar 9, 2024

ggml : try fix 32-bit arm compat (#1938)

6ea3354
unverified

ggerganov HF Staff commited on Mar 8, 2024

talk-llama : use llama_decode instead of llama_eval

301b000
unverified

ggerganov HF Staff commited on Mar 8, 2024

talk-llama : sync llama.cpp

fe602cb
unverified

ggerganov HF Staff commited on Mar 8, 2024

Commit History

Vulkan k-quant mmq and ggml-backend offload functionality (llama/6155) 1ff7b08 unverified

fix set main gpu crash (llama/6339) 3bdb5e6 unverified

ggml : fix bounds checking of zero size views (llama/6347) 80db462 unverified

backend : fix typo in scheduler documentation (ggml/781) e7ddd12 unverified

extra : sync ggml-cuda folder fa0af15 unverified

ggml: bypass code incompatible with CUDA < 11.1 (#2020) 32f4e35 unverified

ci : add building in MSYS2 environments (Windows) (#1994) 08d5ab5 unverified

build : use pkg-config for OpenBLAS (#1778) d5d466c unverified

main : add command-style grammar (#1998) 7e6ea10 unverified

make : add grammar parser to common objects b1f3938 unverified

sync : ggml (#2001) cbbfa9e unverified

whisper : improve handling of prompts (#1981) 15949a9 unverified

whisper : improve support for distil-large-v3 (#1982) 749004e unverified

ruby : fix build (#1980) 20374d7 unverified

docker : libcuda.so.1 in PATH (#1966) 2cd0d06 unverified

readme : add Fedora dependencies (#1970) afc6b1a unverified

whisper : token-level timestamps with DTW (#1485) ce7ca09 unverified

examples : rename --audio-context to --audio-ctx per help text (#1953) 8e9c985 unverified

whisper : set outputs from conv graph (#1959) 62505d4 unverified

alloc : fix allocation data of pre-allocated leafs 0c378f2 unverified

cmake : copy ggml-common.h to bin f592e46 unverified

gitignore : .vimspector.json b593d9a unverified

talk-llama : sync llama.cpp 14e824b unverified

sync : ggml 1701a5d unverified

update examples and tests 007ebd7 unverified

ggml : add ggml-common.h 8cdfa17 unverified

ggml : designate enum vals for integer types (llama/6050) 0bd0c7a unverified

metal : build metallib + fix embed path (llama/6015) 27311ef unverified

llama : add pipeline parallelism support (llama/6017) b5bb3f3 unverified

Update get version (llama/6025) 9a4e508 unverified

ggml : reuse quantum structs across backends (llama/5943) bb0625f unverified

ggml : fix UB in IQ2_S and IQ3_S (llama/6012) 0c552df unverified

sycl : update IQ1_S kernels (WIP - not working!) (llama/5995) 16dc72c unverified

1.5 bit: we can do even better (llama/5999) 36cc71e unverified

ggml, ci : Windows ARM runner and build fixes (llama/5979) 507b9dd unverified

Better 1.5 bit quantization (llama/5971) f3a62cc unverified

Add q3_s and q1_s (llama/5886) 2957823 unverified

metal : move mm_id indices to shared mem (llama/5982) 1350705 unverified

ggml : fix unnecessary f32 -> f16 -> f32 casts (mmla) (llama/5951) cb8bbaa unverified

ggml : remove old quantization functions (llama/5942) 11a2545 unverified

ggml : add ggml-common.h to deduplicate shared code (llama/5940) 0a37735 unverified

llama : support Mamba Selective State Space Models (llama/5328) 224fbc2 unverified

extra : update sync scripts after ggml-common.h 2e29431 unverified

whisper : document whisper_batch.n_seq_id (#1942) f08549e unverified

whisper : improve beam search candidate diversity (#1947) 6e9276c unverified

bindings/go : add linker flags to make metal work (#1944) 3dee0de unverified

whisper : make beam candidate sort more stable (#1943) 1316242 unverified

ggml : try fix 32-bit arm compat (#1938) 6ea3354 unverified

talk-llama : use llama_decode instead of llama_eval 301b000 unverified

talk-llama : sync llama.cpp fe602cb unverified

Vulkan k-quant mmq and ggml-backend offload functionality (llama/6155)

1ff7b08
unverified

fix set main gpu crash (llama/6339)

3bdb5e6
unverified

ggml : fix bounds checking of zero size views (llama/6347)

80db462
unverified

backend : fix typo in scheduler documentation (ggml/781)

e7ddd12
unverified

extra : sync ggml-cuda folder

fa0af15
unverified

ggml: bypass code incompatible with CUDA < 11.1 (#2020)

32f4e35
unverified

ci : add building in MSYS2 environments (Windows) (#1994)

08d5ab5
unverified

build : use pkg-config for OpenBLAS (#1778)

d5d466c
unverified

main : add command-style grammar (#1998)

7e6ea10
unverified

make : add grammar parser to common objects

b1f3938
unverified

sync : ggml (#2001)

cbbfa9e
unverified

whisper : improve handling of prompts (#1981)

15949a9
unverified

whisper : improve support for distil-large-v3 (#1982)

749004e
unverified

ruby : fix build (#1980)

20374d7
unverified

docker : libcuda.so.1 in PATH (#1966)

2cd0d06
unverified

readme : add Fedora dependencies (#1970)

afc6b1a
unverified

whisper : token-level timestamps with DTW (#1485)

ce7ca09
unverified

examples : rename --audio-context to --audio-ctx per help text (#1953)

8e9c985
unverified

whisper : set outputs from conv graph (#1959)

62505d4
unverified

alloc : fix allocation data of pre-allocated leafs

0c378f2
unverified

cmake : copy ggml-common.h to bin

f592e46
unverified

gitignore : .vimspector.json

b593d9a
unverified

talk-llama : sync llama.cpp

14e824b
unverified

sync : ggml

1701a5d
unverified

update examples and tests

007ebd7
unverified

ggml : add ggml-common.h

8cdfa17
unverified

ggml : designate enum vals for integer types (llama/6050)

0bd0c7a
unverified

metal : build metallib + fix embed path (llama/6015)

27311ef
unverified

llama : add pipeline parallelism support (llama/6017)

b5bb3f3
unverified

Update get version (llama/6025)

9a4e508
unverified

ggml : reuse quantum structs across backends (llama/5943)

bb0625f
unverified

ggml : fix UB in IQ2_S and IQ3_S (llama/6012)

0c552df
unverified

sycl : update IQ1_S kernels (WIP - not working!) (llama/5995)

16dc72c
unverified

1.5 bit: we can do even better (llama/5999)

36cc71e
unverified

ggml, ci : Windows ARM runner and build fixes (llama/5979)

507b9dd
unverified

Better 1.5 bit quantization (llama/5971)

f3a62cc
unverified

Add q3_s and q1_s (llama/5886)

2957823
unverified

metal : move mm_id indices to shared mem (llama/5982)

1350705
unverified

ggml : fix unnecessary f32 -> f16 -> f32 casts (mmla) (llama/5951)

cb8bbaa
unverified

ggml : remove old quantization functions (llama/5942)

11a2545
unverified

ggml : add ggml-common.h to deduplicate shared code (llama/5940)

0a37735
unverified

llama : support Mamba Selective State Space Models (llama/5328)

224fbc2
unverified

extra : update sync scripts after ggml-common.h

2e29431
unverified

whisper : document whisper_batch.n_seq_id (#1942)

f08549e
unverified

whisper : improve beam search candidate diversity (#1947)

6e9276c
unverified

bindings/go : add linker flags to make metal work (#1944)

3dee0de
unverified

whisper : make beam candidate sort more stable (#1943)

1316242
unverified

ggml : try fix 32-bit arm compat (#1938)

6ea3354
unverified

talk-llama : use llama_decode instead of llama_eval

301b000
unverified

talk-llama : sync llama.cpp

fe602cb
unverified