-
Notifications
You must be signed in to change notification settings - Fork 27.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support constant lr with cooldown #35453
base: main
Are you sure you want to change the base?
Support constant lr with cooldown #35453
Conversation
Thanks for the PR @LoserCheems, we already have warmup_stable_decay scheduler with |
oh, thank for your suggestion, I intend to integrate different cooldown methods and minimum learning rate into |
Thanks ! We can't really rename it wsc, wsd is a real term that appears in the MiniCPM paper. |
Thank you, the renaming of the function is done. |
What does this PR do?
Fixes #35449
Added the 'warmup_stable_cooldown' learning rate scheduler method.
This method allows three phases, linear warmup, stable, and cooldown, where cooldown can be done using the
linear
,cosine
and1-sqrt
methods.Before submitting
Pull Request section?
to it if that's the case.
documentation guidelines, and
here are tips on formatting docstrings.
Who can review?
@muellerzr and @SunMarc