Skip to content

A reproduced implementation of "Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding".

License

Notifications You must be signed in to change notification settings

tosiyuki/LLM-to-SLM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LLM-to-SLM

A reproduced implementation of "Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding".

Setup

git clone https://github.com/tosiyuki/LLM-to-SLM.git

Download Training Data

wget -P dataset/ https://github.com/tatsu-lab/stanford_alpaca/blob/main/alpaca_data.json

Train

bash scripts/t5-3b-to-gpt2.sh

Demo

python demo.py

About releasing weights

Acknowledgement

Bergner et al, Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding, 2024

About

A reproduced implementation of "Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding".

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published