TRL setup_chat_format()

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2024 Mar 4 9:40
Editor
Edited
Edited
2025 Jul 13 22:40
Preparing models and tokenizers for chat-based fine-tuning
  • Adds essential tokens like mark conversation boundaries
  • Automatically resizes the model's embedding layer to accommodate the newly added special tokens
  • Sets up the tokenizer's chat_template to format input data into a chat-like structure, defaulting to OpenAI's
    ChatML
    format
 
 
 
 
How to Fine-Tune LLMs in 2024 with Hugging Face
In this blog post you will learn how to fine-tune LLMs using Hugging Face TRL, Transformers and Datasets in 2024. We will fine-tune a LLM on a text to SQL dataset.
How to Fine-Tune LLMs in 2024 with Hugging Face
 
 

Recommendations