-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
LUMI scripts - Mosaic/llm-foundry #190
Conversation
And yes I know that there's a Huggingface token in there. It's invalid, need to find a better way to manage that. |
This PR is stale because it has been open 1+ days with no activity. Feel free to either 1) remove the stale label or 2) comment. If nothing happens, this will be closed in 7 days. |
Please don't close my draft PR because it's not active over xmas 😅 |
Okay, so this whole thing works now. There are a few unused scripts containing some of my other attempts at setting things up on LUMI - I should perhaps move these somewhere else to keep things clean. The important files are:
Additionally, I've added two submodules (so you have to clone with |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good but have a few questions. Will try to make it run on LUMI as well (after Christmas) and that might lead to more questions, but that does not need to hold the PR back.
src/dfm/projects/production/model_training/scripts/continue_mistral.py
Outdated
Show resolved
Hide resolved
src/dfm/projects/production/model_training/scripts/continue_mistral.py
Outdated
Show resolved
Hide resolved
src/dfm/projects/production/model_training/scripts/continue_mistral.py
Outdated
Show resolved
Hide resolved
src/dfm/projects/production/model_training/scripts/continue_mistral.py
Outdated
Show resolved
Hide resolved
src/dfm/projects/production/model_training/scripts/continue_mistral.sh
Outdated
Show resolved
Hide resolved
src/dfm/projects/production/model_training/scripts/pretrain_llama2.sh
Outdated
Show resolved
Hide resolved
src/dfm/projects/production/model_training/scripts/yamls/continue-mistral-7b.yaml
Outdated
Show resolved
Hide resolved
src/dfm/projects/production/model_training/scripts/yamls/continue-mistral-7b.yaml
Outdated
Show resolved
Hide resolved
Stuff that is still being worked on feel free to keep that, but stuff that could be deleted and recovered from the history (if needed) might as well be deleted
def. pin them |
This PR is stale because it has been open 1+ days with no activity. Feel free to either 1) remove the stale label or 2) comment. If nothing happens, this will be closed in 7 days. |
@rlrs will remove the stale label (this will give it another 7 days) as I assume you might be on vacation |
This PR is stale because it has been open 1+ days with no activity. Feel free to either 1) remove the stale label or 2) comment. If nothing happens, this will be closed in 7 days. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good!
This PR is stale because it has been open 1+ days with no activity. Feel free to either 1) remove the stale label or 2) comment. If nothing happens, this will be closed in 7 days. |
(Continued) Pretraining setup for LUMI. These scripts are now for a
mosaicai/llm-foundry
stack. Everything should work.