# minigpt

**Repository Path**: yi_peng2000/minigpt

## Basic Information

- **Project Name**: minigpt
- **Description**: No description available
- **Primary Language**: Unknown
- **License**: MulanPSL-2.0
- **Default Branch**: master
- **Homepage**: None
- **GVP Project**: No

## Statistics

- **Stars**: 0
- **Forks**: 0
- **Created**: 2024-04-04
- **Last Updated**: 2024-08-04

## Categories & Tags

**Categories**: Uncategorized

**Tags**: None

## README

# minigpt

#### 介绍
必须要使用 deep speed chat, 必须要安装 nvidia-cuda-toolkit , 经过 对ubuntu操作系统环境做深入调整， 解决了英伟达rtx4080显卡驱动和  nvidia-cuda-toolkit bu不兼容问题。     微调非常消耗GPU显存，因为我的显卡性能有限，因此出现了溢出，经过降低精度，解决了fp16 类型的溢出问题，跑完了deep speed chat 微调全流程， 并生成了模型，进行了问答测试。 



 个人感觉，微调的难度还在于对齐， 如果deep speed chat 使用的模型和待微调模型如果对不齐， 就会导致报错，必须要对齐，比如训练时用的是gpt2， 微调时，deep speed chat就要用gpt2对训练后的模型，进行微调，才能很快成功。



 这时如果你的老板非要让你用 OPT-6.7B去微调gpt2训练出来的模型，就会对不齐，你就得花很多时间去想办法对齐，这是不容易的，需要做数据预处理和转换，调整微调策略，甚至逐层微调，还要不断采取监控和评估等 一系列复杂的措施才有可能成功微调。这是我 个人的微调感受

#### 软件架构
软件架构说明


#### 安装教程

1.  xxxx
2.  xxxx
3.  xxxx

#### 使用说明

1.  xxxx
2.  xxxx
3.  xxxx

#### 参与贡献

1.  Fork 本仓库
2.  新建 Feat_xxx 分支
3.  提交代码
4.  新建 Pull Request


#### 特技

1.  使用 Readme\_XXX.md 来支持不同的语言，例如 Readme\_en.md, Readme\_zh.md
2.  Gitee 官方博客 [blog.gitee.com](https://blog.gitee.com)
3.  你可以 [https://gitee.com/explore](https://gitee.com/explore) 这个地址来了解 Gitee 上的优秀开源项目
4.  [GVP](https://gitee.com/gvp) 全称是 Gitee 最有价值开源项目，是综合评定出的优秀开源项目
5.  Gitee 官方提供的使用手册 [https://gitee.com/help](https://gitee.com/help)
6.  Gitee 封面人物是一档用来展示 Gitee 会员风采的栏目 [https://gitee.com/gitee-stars/](https://gitee.com/gitee-stars/)