Transformers-Patch 🛠️

Memory optimization patches for HuggingFace Transformers.

Features ✨

Memory Reduction - Significantly lowers memory usage in Transformers models
Zero Configuration - Works automatically after import

Installation ⚡

pip install git+https://github.com/GeeeekExplorer/transformers-patch.git

Quick Start 🚀

Just import the patch before loading any Transformers models:

import transformers_patch
from transformers import AutoModel

Benchmark 📊

Test Configuration:

8x GPU machine
Micro batch size: 1
Sequence length: 4096
Gradient checkpointing: Disabled
Model: Qwen3-8B

Memory Component	Fixed Allocation	Before Patch	After Patch
Model + Gradients	30.5 GB	-	-
ZeRO Optimizer States	11.4 GB	-	-
Activations	-	35.4 GB	17.8 GB

50% reduction in activation memory!

Example Usage 📋

See complete example in train.py.

Acknowledgements 🙏

unsloth

Name	Name	Last commit message	Last commit date
Latest commit History 13 Commits 13 Commits
transformers_patch	transformers_patch
.gitignore	.gitignore
LICENSE	LICENSE
README.md	README.md
setup.py	setup.py
train.py	train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformers-Patch 🛠️

Features ✨

Installation ⚡

Quick Start 🚀

Benchmark 📊

Example Usage 📋

Acknowledgements 🙏

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Search code, repositories, users, issues, pull requests...

Folders and files

Latest commit

History

Repository files navigation

Transformers-Patch 🛠️

Features ✨

Installation ⚡

Quick Start 🚀

Benchmark 📊

Example Usage 📋

Acknowledgements 🙏

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages