Commits
Branch selector
User selector
Datepicker
Commit History
Commits on Jun 12, 2023
- committed
- committed
- committed
- committed
- committed
- committed
Commits on Jun 11, 2023
Fix issue where interactive mode crashes when input exceeds ctx size (#1789)
Show description for fa84c4bauthoredFixed WSL cuda's OOM error (#1594)
Show description for 12b063f- authored
Commits on Jun 10, 2023
- committed
- authored
metal : fix issue with ggml-metal.metal path. Closes #1769 (#1782)
Show description for 303f580authoreddoc : fix wrong address of BLIS.md (#1772)
Show description for 059e990authoredggml : force no_alloc == false when creating opt tensors (close #1699)
Show description for 17c10accommittedmetal : add Q4_1 implementation (#1785)
Show description for e9b66eellama : support requantizing models instead of only allowing quantization from 16/32bit (#1691)
Show description for 4f0154bauthored- authored
- authoredrankaiyx
Commits on Jun 9, 2023
OpenCL: Add release memory (#1741)
Show description for 98ed165authoredWindows nvcc workaround (#1753)
Show description for ae9663fauthored- committed
metal : add GELU implementation (#1770)
Show description for 92f44ffauthoredmetal : faster q4_0 (#1775)
Show description for 245fc3c- committed
Commits on Jun 8, 2023
metal : add Q2_K implementation (#1762)
Show description for 72ff528Revert "ggml : load data into int8x16x4_t using vld4q_s8 on arm64 (#1738)"
Show description for 0bf7cf1committed- authored
metal : Q6_K implementation (#1752)
Show description for 0f291e1Add llama.cpp docker support for non-latin languages (#1673)
Show description for 8fc8179authored- authored
- committed
metal : add Q4_K implementation (#1733)
Show description for 4161bdc- authored
Commits on Jun 7, 2023
k-quants : allow to optionally disable at compile time (#1734)
Show description for 5c64a09authored