Closed
Description
Reminder
- I have read the README and searched the existing issues.
System Info
llamafactory
version: 0.8.2.dev0- Platform: Linux-5.4.119-19-0009.11-x86_64-with-glibc2.35
- Python version: 3.11.7
- PyTorch version: 2.3.0+cu121 (GPU)
- Transformers version: 4.41.2
- Datasets version: 2.19.2
- Accelerate version: 0.30.1
- PEFT version: 0.11.1
- TRL version: 0.8.6
- GPU type: NVIDIA Graphics Device
- DeepSpeed version: 0.14.0
- vLLM version: 0.4.3
Reproduction
部署API的yam文件
model_name_or_path: /mnt/sft_full_qwen2_7B_Instruct_v4/checkpoint-100
template: qwen
cutoff_len: 4096
do_sample: false
部署结果
Expected behavior
No response
Others
No response
Activity