ggml : add epsilon as a parameter for group_norm (llama/8818) d003891 mollysama commited on Aug 6, 2024
ggml : reading the runtime sve config of the cpu (llama/8709) c26339f jdomke domke commited on Aug 3, 2024
Fix conversion of unnormalized BF16->BF16 weights (llama/7843) 8b10f59 Sigbjørn Skjæret compilade commited on Aug 2, 2024
ggml-cuda: Adding support for unified memory (llama/8035) 686bb18 matteogeniaccio matteo serva JohannesGaessler commited on Aug 1, 2024
Build: Only include execinfo.h on linux systems that support it (llama/8783) 0019ddb Alex O'Connell commited on Aug 1, 2024
cuda : fix dmmv cols requirement to 2*GGML_CUDA_DMMV_X (llama/8800) 73e80d1 slaren commited on Aug 1, 2024
added android implementation of ggml_print_backtrace_symbols (llama/8751) 314d58a l3utterfly slaren commited on Jul 30, 2024
ggml: bugfix: fix the inactive elements is agnostic for risc-v vector (llama/8748) 6989631 carterli carter.li commited on Jul 29, 2024
cuda : organize vendor-specific headers into vendors directory (llama/8746) ec2f307 R0CKSTAR commited on Jul 29, 2024
ggml: handle ggml_init failure to fix NULL pointer deref (llama/8692) dc51517 DavidKorczynski commited on Jul 25, 2024
ggml : add and use ggml_cpu_has_llamafile() (llama/8664) efcca56 ggerganov HF Staff commited on Jul 25, 2024
Re-add erroneously removed -fsycl from GGML_EXTRA_LIBS (llama/8667) 6e12dfd Joe Todd commited on Jul 24, 2024
sycl : Add support for non-release DPC++ & oneMKL (llama/8644) 2a5814c Joe Todd commited on Jul 23, 2024
Allow all RDNA2 archs to use sdot4 intrinsic (llama/8629) 1d65fea Jeroen Mostert commited on Jul 23, 2024
CUDA: MMQ code deduplication + iquant support (llama/8495) 6d14124 JohannesGaessler commited on Jul 20, 2024
ggml : fix quant dot product with odd number of blocks (llama/8549) 0083f96 slaren ggerganov HF Staff commited on Jul 19, 2024
ggml : add friendlier error message to fopen errors (llama/8575) ab5b4e0 HanClinto commited on Jul 19, 2024
CUDA: fix partial offloading for ne0 % 256 != 0 (llama/8572) afc137c JohannesGaessler commited on Jul 18, 2024
make/cmake: add missing force MMQ/cuBLAS for HIP (llama/8515) 5096c91 JohannesGaessler commited on Jul 16, 2024
Refactor lora adapter support (llama/8332) 76bcfc6 Xuan Son Nguyen slaren compilade commited on Jul 15, 2024
metal : template-ify some of the kernels (llama/8447) 3c3094f ggerganov HF Staff commited on Jul 13, 2024