试用中英双语对话模型chatglm2-6b-int4

wffger 收录于类别 AI

2023-07-03 2023-07-03 约 233 字预计阅读 1 分钟

本地试用chatglm2-6b-int4

本地机器16GB内存，不足以跑通chatglm2-6b，只能使用chatglm2-6b-int4。

步骤

安装git lfs
创建环境
下载模型
命令行调用

准备环境：

1
2
3
4
5
6


mkdir u_chatglm2
cd u_chatglm2
pipenv --python 3.11
pipenv shell
pip install protobuf transformers==4.30.2 cpm_kernels torch>=2.0 gradio mdtex2html sentencepiece accelerate
git clone https://huggingface.co/THUDM/chatglm2-6b-int4

使用：

1
2
3
4
5
6
7
8


from transformers import AutoTokenizer, AutoModel
tokenizer = AutoTokenizer.from_pretrained("chatglm2-6b-int4", trust_remote_code=True)
model = AutoModel.from_pretrained("chatglm2-6b-int4",trust_remote_code=True).float()
model = model.eval()
response, history = model.chat(tokenizer, "你好", history=[])
print(response)
response, history = model.chat(tokenizer, "内脏脂肪多怎么办", history=history)
print(response)

效果

/images/chatglm2-6b-int4.png — chatglm2-6b-int4 result

评价

回答第二个问题需要二十多分钟。

目录

试用中英双语对话模型chatglm2-6b-int4

本地试用chatglm2-6b-int4

步骤

效果

评价

相关内容

目录

试用中英双语对话模型chatglm2-6b-int4

本地试用chatglm2-6b-int4

步骤

效果

评价

相关内容

AWS Lambda Image Resizer by Python