Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

GPTQModel v0.9.5

Choose a tag to compare

@Qubitium Qubitium released this 05 Jul 13:48
· 1544 commits to main since this release
f0a1ee8

What's Changed

Another large update with added support for Intel/Qbits quantization/inference on CPU. Cuda kernels have been fully deprecated in favor of better performing Exllama (v1/v2), Marlin, and Triton kernels.

Full Changelog: v0.9.4...v0.9.5

Morty Proxy This is a proxified and sanitized view of the page, visit original site.