Commits
Branch selector
User selector
Datepicker
Commit History
Commits on Nov 5, 2023
ggml-cuda : fix f16 mul mat (#3961)
Show description for 2833a6fauthoredAllow common process_escapes to handle \x sequences (#3928)
Show description for d9ccce2authored- authored
cuda : fix disabling device with --tensor-split 1,0 (#3951)
Show description for 132d25bllama : mark LLM_ARCH_STARCODER as full offload supported (#3945)
Show description for 3d48f42authoredcmake : MSVC instruction detection (fixed up #809) (#3923)
Show description for c41ea36- authored
cuda : revert CUDA pool stuff (#3944)
Show description for 48ade94authored
Commits on Nov 4, 2023
Commits on Nov 3, 2023
- authored
- authored
- authored
speculative : change default p_accept to 0.5 + CLI args (#3919)
Show description for 8f961abcommittedcommon : YAYF (yet another YARN fix) (#3925)
Show description for 0581602authored- authored
Commits on Nov 2, 2023
- authored
- authored
- authored
- authored
cuda : use CUDA memory pool with async memory allocation/deallocation when available (#3903)
Show description for d606905- authored
- authored
gguf : remove special-case code for GGUFv1 (#3901)
Show description for 2756c4fauthored- committed
build : link against build info instead of compiling against it (#3879)
Show description for b12fa0d- authored
- authored
- authored
Commits on Nov 1, 2023
- authored
ggml-cuda : compute ptrs for cublasGemmBatchedEx in a kernel (#3891)
Show description for d02e98cauthoredllama : implement YaRN RoPE scaling (#2268)
Show description for 898aeca- committed
- committed
metal : multi-simd softmax (#3710)
Show description for e16b9faauthored- committed