v0.3.1-cpt-dynamic_batch_loading: Llama2 CPT with Dynamic Batch Loading
Spico197
released this
17 Nov 09:09
·
51 commits
to main
since this release
- Llama2 CPT with 4096 context length training.
- Dynamic batch loading from ShearedLlama Implementation.