This release fixes a regression introduced in 0.48.0 related to LLM.int8(). This issue caused poor inference results with pre-quantized checkpoints in HF transformers.
What's Changed
- Add trove-classifiers requirement to pyproject.toml by @ccoulombe in #1774
- Fix regression in 8bit parameter device movement by @matthewdouglas in #1776
Full Changelog: 0.48.0...0.48.1