The project includes bilingual support.

项目简介

这是一个数字分身项目，核心思想是利用C2C 聊天记录作为数据集，对大模型进行微调，让模型尽可能还原你独有的表达风格和聊天方式。

This project is a personal digital twin built by fine-tuning a large language model on your own chat history. The goal is to recreate your unique style of expression and conversational behavior with high fidelity.

The project includes bilingual support.

项目包含双语支持

中文文档

English Documents

项目包含了完整的教程，包括：

QQ 数据库的解密与处理
聊天数据清洗与转换
QLora 微调流程
微调模型的测试与使用
使用unsloth加速训练!

我知道类似的项目其实已经有不少了，但也许我的教程、流程、代码实现能给你一些不一样的帮助或启发。如果对你有用，欢迎点个 star，我会很开心的！

目前这个项目还有很多不足：

暂时不知道有什么不足
(如果有问题欢迎开Issues)
但已经可以在 4090 24G 显卡上用 fp8 精度微调 Qwen3-8B（亲测可用） "部分代码参考自 Weclone" 如果你也想打造属于自己的数字分身，那也来试试吧!

—— X: @qqqqqf5 Email: qingf622@outlook.com Github:@qqqqqf-q

项目版本

V 0.1.5 Develop

项目状态

由于0.1.4版本对于代码进行了许多重构
所以可能有更多的Bug
欢迎各位开发者来提Issues,PR
贡献这个小项目

开发问题

cli的train,data convert都存在问题,暂时还是只能用老版本调用
微调脚本需要重构(正在思考是继续Qlora+Unsloth还是转向Llama Factory)
文档部分由于重构了项目还有一些没有修改的
已经被重构的部分没有增加双语支持
todo1.增加serverapi为webui做准备
代码未优化

Name	Name	Last commit message	Last commit date
Latest commit History 91 Commits
.github/workflows	.github/workflows
cli	cli
dataset/examples	dataset/examples
docs	docs
environment	environment
finetune	finetune
merge_data	merge_data
process_data	process_data
utils	utils
.gitattributes	.gitattributes
.gitignore	.gitignore
CLAUDE.md	CLAUDE.md
LICENSE	LICENSE
cli.py	cli.py
readme.md	readme.md
requirements.txt	requirements.txt
run_finetune.py	run_finetune.py
setting_template.jsonc	setting_template.jsonc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

项目简介

这是一个数字分身项目，核心思想是利用C2C 聊天记录作为数据集，对大模型进行微调，让模型尽可能还原你独有的表达风格和聊天方式。

This project is a personal digital twin built by fine-tuning a large language model on your own chat history. The goal is to recreate your unique style of expression and conversational behavior with high fidelity.

The project includes bilingual support.

项目包含双语支持

中文文档

English Documents

项目包含了完整的教程，包括：

—— X: @qqqqqf5 Email: qingf622@outlook.com Github:@qqqqqf-q

项目版本

V 0.1.5 Develop

项目状态

开发问题

About

Uh oh!

Contributors 2

Uh oh!

Languages

Search code, repositories, users, issues, pull requests...

License

qqqqqf-q/Qing-Digital-Self

Folders and files

Latest commit

History

Repository files navigation

项目简介

这是一个数字分身项目，核心思想是利用C2C 聊天记录作为数据集，对大模型进行微调，让模型尽可能还原你独有的表达风格和聊天方式。

This project is a personal digital twin built by fine-tuning a large language model on your own chat history. The goal is to recreate your unique style of expression and conversational behavior with high fidelity.

The project includes bilingual support.

项目包含双语支持

中文文档

English Documents

项目包含了完整的教程，包括：

—— X: @qqqqqf5 Email: qingf622@outlook.com Github:@qqqqqf-q

项目版本

V 0.1.5 Develop

项目状态

开发问题

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors 2

Uh oh!

Languages