AI Models & Infra

LLaMA Factory: a Versatile Efficient Training Framework for LLMs

Speakers

Yaowei Zheng

Date / Time

2024-10-18

11:10

Presentation Slides

Presentation Video

YouTube

LLaMA Factory is an efficient, easy-to-use, and scalable open source framework for efficient training of large models. It has received more than 30,000 Stars in the GitHub open source community and has attracted the attention or application of many domestic and foreign companies. This sharing will analyze the construction motivation and component modules of LLaMA Factory in detail from the perspective of efficient training of large models, including the full-stack fine-tuning adaptation principles of hundreds of large models, LoRA operator optimization acceleration methods, and various fine-tuning trick integration ideas.