File tree Expand file tree Collapse file tree 1 file changed +1
-1
lines changed
Filter options
Expand file tree Collapse file tree 1 file changed +1
-1
lines changed
Original file line number Diff line number Diff line change @@ -199,7 +199,7 @@ https://user-images.githubusercontent.com/271616/225014776-1d567049-ad71-4ef2-b0
199
199
- We don't know yet how much the quantization affects the quality of the generated text
200
200
- Probably the token sampling can be improved
201
201
- The Accelerate framework is actually currently unused since I found that for tensor shapes typical for the Decoder,
202
- there is no benefit compared to the ARM_NEON intrinsics implementation. Of course, it's possible that I simlpy don't
202
+ there is no benefit compared to the ARM_NEON intrinsics implementation. Of course, it's possible that I simply don't
203
203
know how to utilize it properly. But in any case, you can even disable it with ` LLAMA_NO_ACCELERATE=1 make ` and the
204
204
performance will be the same, since no BLAS calls are invoked by the current implementation
205
205
You can’t perform that action at this time.
0 commit comments