模型下载
安装git lfs
curl -s https://packagecloud.io/install/repositories/github/git-lfs/script.deb.sh | sudo bash
apt-get install git-lfs
git lfs install
git clone https://huggingface.co/THUDM/chatglm-6b
这样就下载完成了。
如果是llama.app
https://github.com/ggerganov/llama.cpp
下载程序
git clone https://github.com/ggerganov/llama.cpp
cd llama.cppOn Windows:
- Download the latest fortran version of w64devkit.
- Extract
w64devkiton your pc. - Run
w64devkit.exe. - Use the
cdcommand to reach thellama.cppfolder. - From here you can run:
make
python3 convert.py models/7B/上面命令转换成ggml-model-f16.bin文件然后./quantize ./models/7B/ggml-model-f16.bin ./models/7B/ggml-model-q4_0.bin q4_0转换成gml-model-q4_0.bin文件运行下面命令就可以运行了:./main -m ./models/7B/ggml-model-q4_0.bin -n 128
没有评论:
发表评论