Model Quantization for PyTorch (Proposal)

🚀 tl;dr

Attached is a proposal for graph mode quantization in pytorch (model_quantizer) that provides end to end post training quantization support for both mobile and server backends. Model quantization supports fp32 and int8 precisions as a starting point and will expand to support other precision types based on customer needs. Details can be found in the attached pdf doc:

Model Quantization for Pytorch.pdf

cc @soumith, @gchanan, @raghuramank100

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Model Quantization for PyTorch (Proposal) #18318

🚀 tl;dr

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Search code, repositories, users, issues, pull requests...

Model Quantization for PyTorch (Proposal) #18318

Description

🚀 tl;dr

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions