0.43.2: significant QLoRA mem savings due to bug fix, CUDA 12.5 support #1291
Titus-von-Koeller
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
0.43.2
This release is quite significant as the QLoRA bug fix big implications for higher
seqlen
and batch sizes.For each sequence (i.e. batch size increase of one) we expect memory savings of:
This was due to activations being unnecessary for frozen parameters, yet the memory for them was still erroneously allocated due to the fixed bug.
Improvements:
Bug Fixes
str2optimizer32bit
(Add"lamb"
tostr2optimizer32bit
#1222, thanks @EtienneDosSantos)This discussion was created from the release 0.43.2: significant QLoRA mem savings due to bug fix, CUDA 12.5 support.
Beta Was this translation helpful? Give feedback.
All reactions