FSDP-QLoRA doc updates for TRL integration #1471

blbadger · 2025-01-08T23:39:47Z

Small documentation changes to maintain compatibility as the SFTTrainer class received updates as of TRL v0.13.0:

max_seq_length and dataset_text_field moved to become SFTConfig class arguments from the SFTTrainer, causing errors when calling the trainer if these arguments are retained.
The tokenizer argument is being deprecated in favor of the more general processing_class, which can be a tokenizer.

The doc currently does not define the training_arguments arg, but we can assume that this is a an instance of the SFTConfig class in which case max_seq_length and dataset_text_field would be defined there. I have removed these args from the SFTTrainer instantiation because of this, although these arguments could be defined in a config if that is preferable.

Swapping the tokenizer for processing_class deals with the second change. This modification is optional as the tokenizer kwarg is deprecated and swapped automatically, but will probably be useful going forward.

doc updates

c6a494c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FSDP-QLoRA doc updates for TRL integration #1471

FSDP-QLoRA doc updates for TRL integration #1471

blbadger commented Jan 8, 2025 •

edited

Loading

FSDP-QLoRA doc updates for TRL integration #1471

Are you sure you want to change the base?

FSDP-QLoRA doc updates for TRL integration #1471

Conversation

blbadger commented Jan 8, 2025 • edited Loading

blbadger commented Jan 8, 2025 •

edited

Loading