Commits
Branch selector
User selector
Datepicker
Commit History
Commits on Jun 25, 2023
- committed
- authored
- committed
fix server sampling: top k sampler first (#1977)
Show description for c2a08f8andanon998authoredanon- authored
zig : upgrade build system support (#1981)
Show description for e65ca7e
Commits on Jun 24, 2023
#1869 Fix null reference errors when training from scratch with CUDA (#1907)
Show description for 5ec8dd5- committed
- authored
- authored
ggml : improve ggml_graph_dump_dot, add ggml_format_name (#1978)
Show description for f2c754eauthored- committed
- authored
llama : fix top-p sampling to match the canonical definition (#1953)
Show description for b061ba9authoredllama : make model stateless and context stateful (llama_state) (#1797)
Show description for 527b6fb
Commits on Jun 23, 2023
Add OpenLLaMA instructions to the README (#1954)
Show description for d7b7484authoredeiery
Commits on Jun 22, 2023
rework convert.py to read hyper-parameters from config.json (#1958)
Show description for 7487137authored
Commits on Jun 21, 2023
Commits on Jun 20, 2023
- authored
- authored
llama : fix params struct slignment (#1936)
Show description for aacdbd4authored
Commits on Jun 19, 2023
[Fix] Reenable server embedding endpoint (#1937)
Show description for 20568feauthored- committed
llama : use aligned memory during ggml_init call from loading saved sessions (#1934)
Show description for ba4e85aauthored- committed
llama : only use Q6_K for output weights if tensor size is multiple of 256 (#1932)
Show description for cb40dfccuda : faster k-quants on older GPUs (#1930)
Show description for ca7c3f4ggml : sync latest ggml repo (#1924)
Show description for b97ca43authoredcmake : fix build shared ggml when CUDA is enabled (#1929)
Show description for 1e3abfcConvert vector to f16 for dequantize mul mat vec (#1913)
Show description for 16b9cd1authored
Commits on Jun 18, 2023
- authored
- authored
- authored
readme : update Android build instructions (#1922)
Show description for e1886cfauthored