Fix logits_to_logprobs for 2-D and 3-D logits #1002

kddubey · Dec 12, 2023

The implementation in main (from this PR) only works for 1-D logits. It silently fails for 2-D or 3-D logits. The implementation in this PR works out-of-the-box for 1-D, 2-D, and 3-D logits. (3-D is possible in the future w/ batch inference and logits_all=True.) This feature might be useful b/c there are some places in the code where we can save time by vectorizing / not converting data to lists. I'll do that in a future PR.

The minimal and sufficient fix is to set axis=-1 in the np.max call, and set keepdims=True in the np.sum call. I decided to instead go with a more robust implementation. It's almost copy-pasted from scipy.special.log_softmax. I decided against adding scipy as a required dependency b/c it's not lightweight—the latest version is ~37 MB.

How has this been tested?

Script

Install the new test dependency, scipy, which contains a correct implementation
```
python -m pip install scipy
```
Checkout main
```
git checkout main
```

Run this script in main to verify that the current implementation is silently wrong for 2-D logits

from __future__ import annotations

import numpy as np
from scipy.special import log_softmax

from llama_cpp import Llama

atol = 1e-3  # intentionally set to be loose when testing the impl in main
size = (2, 3)
logits: list = (
    (-np.random.uniform(low=0, high=60, size=size)).astype(np.single).tolist()
)

logprobs = Llama.logits_to_logprobs(logits)
logprobs_correct = log_softmax(logits, axis=-1)
assert np.allclose(logprobs, logprobs_correct, atol=atol)

Checkout this branch

git checkout kddubey/fix-logits-to-logprobs

Run the same script with atol=1e-6. No error should be raised.

New unit tests

pytest tests/test_llama.py -k test_logits_to_logprobs

kddubey · Dec 12, 2023

pyproject.toml

 ]
 test = [
    "pytest>=7.4.0",
    "httpx>=0.24.1",
+    "scipy>=1.10",


This is the oldest version compatible with numpy>=1.20.0

source: https://docs.scipy.org/doc/scipy/dev/toolchain.html#numpy

abetlen · Dec 16, 2023

@kddubey thank you, yes that's a good idea wrt vectorizing the logits -> logprobs calculation

kddubey commented Dec 12, 2023

View reviewed changes

kddubey changed the title ~~Fix logits_to_logprobs~~ Fix logits_to_logprobs for 2-D and 3-D logits Dec 12, 2023

kddubey added 3 commits December 12, 2023 04:04

Fix logits_to_logprobs for 2-D and 3-D logits

68c56cf

Set dtype to single

5e08113

Test size

b11a456

abetlen merged commit 5a89446 into abetlen:main Dec 16, 2023

kddubey deleted the kddubey/fix-logits-to-logprobs branch December 17, 2023 00:13

kddubey mentioned this pull request Dec 17, 2023

Don't convert logprobs arrays to lists #1021

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix logits_to_logprobs for 2-D and 3-D logits #1002

Fix logits_to_logprobs for 2-D and 3-D logits #1002

Uh oh!

kddubey commented Dec 12, 2023 •

edited

Loading

Uh oh!

kddubey Dec 12, 2023

Uh oh!

abetlen commented Dec 16, 2023

Uh oh!

Uh oh!

Search code, repositories, users, issues, pull requests...

Fix logits_to_logprobs for 2-D and 3-D logits #1002

Fix logits_to_logprobs for 2-D and 3-D logits #1002

Uh oh!

Conversation

kddubey commented Dec 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

How has this been tested?

Script

New unit tests

Uh oh!

kddubey Dec 12, 2023

Choose a reason for hiding this comment

Uh oh!

abetlen commented Dec 16, 2023

Uh oh!

Uh oh!

kddubey commented Dec 12, 2023 •

edited

Loading