can it make Lora sft？ #24

ReverseSystem001 · 2023-11-06T07:49:54Z

Limited by graphics card devices. For most people, Lora is the only way to fine-tuning. can it make lora sft?

CoinCheung · 2023-11-07T01:36:03Z

Hi, thanks for paying attention to this !!

This repo is currently designed for full parameter finetuning, but lora freezes most of the parameters. Since they contradict with each other, currently this repo does not support lora.

This repo bases on pipeline method, which allows you to train your model with DP + PP (megatronLM is DP + PP + TP, the so-called 3D layout). This is faster and requires less memory than zero-based methods when there is not so many gpus (100+). You can train a 7b or 13b model on a server with 8 gpus (24G), which I believe many companies can afford to.

ReverseSystem001 · 2023-11-07T01:54:42Z

great job

…

---Original--- From: ***@***.***> Date: Tue, Nov 7, 2023 09:36 AM To: ***@***.***>; Cc: ***@***.******@***.***>; Subject: Re: [CoinCheung/gdGPT] can it make Lora sft？ (Issue #24) Hi, thanks for paying attention to this !! This repo is currently designed for full parameter finetuning, but lora freezes most of the parameters. Since they contradict with each other, currently this repo does not support lora. This repo bases on pipeline method, which allows you to train your model with DP + PP (megatronLM is DP + PP + TP, the so-called 3D layout). This is faster and requires less memory than zero-based methods when there is not so many gpus (100+). You can train a 7b or 13b model on a server with 8 gpus (24G), which I believe many companies can afford to. — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.Message ID: ***@***.***>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

can it make Lora sft？ #24

can it make Lora sft？ #24

ReverseSystem001 commented Nov 6, 2023

CoinCheung commented Nov 7, 2023

ReverseSystem001 commented Nov 7, 2023 via email

can it make Lora sft？ #24

can it make Lora sft？ #24

Comments

ReverseSystem001 commented Nov 6, 2023

CoinCheung commented Nov 7, 2023

ReverseSystem001 commented Nov 7, 2023 via email