AI Models & Infra

LLaMA Factory: a Versatile Efficient Training Framework for LLMs

Speakers
Date / Time
2024-10-18
11:10

LLaMA Factory is an efficient, easy-to-use, and scalable open source framework for efficient training of large models. It has received more than 30,000 Stars in the GitHub open source community and has attracted the attention or application of many domestic and foreign companies. This sharing will analyze the construction motivation and component modules of LLaMA Factory in detail from the perspective of efficient training of large models, including the full-stack fine-tuning adaptation principles of hundreds of large models, LoRA operator optimization acceleration methods, and various fine-tuning trick integration ideas.