Skip to content
Carlos Lizarraga-Celaya edited this page Dec 9, 2024 · 15 revisions

Build a LLM from Scratch

Goal: Replicate Sebastian Raschka's book and code

![Book](https://camo.githubusercontent.com/54a738f9f8e7a0d8660d69a63af04c1f74b7c3059c349c78c29e545422ea73ad/68747470733a2f2f73656261737469616e72617363686b612e636f6d2f696d616765732f4c4c4d732d66726f6d2d736372617463682d696d616765732f636f7665722e6a70673f313233 =300x)


Instructors

Enrique Noriega

Enrique is a computational research scientist in the Department of Computer Science and the Data Science Institute at the University of Arizona. He specializes in developing AI applications for medical sciences and is passionate about working with deep learning models.

Carlos Lizárraga

Carlos is a Computational and Data Science Educator at the Data Science Institute at the University of Arizona. With a strong background in applied mathematics and physics, he focuses on applying machine learning and deep learning models to scientific research.


Schedule Spring 2025 (Jan 28th - Mar 25th)

Time: Thursdays 1 PM **Where: ** **Zoom link: **

Topic Date Description
PyTorch refresh and setup Jan 28
Understanding large language models Feb 4
Working with text data Feb 11
Coding attention mechanisms Feb 18
Implementing a GPT from scratch to generate text Feb 25
Pretraining on unlabeled data Mar 4
Spring break - NO session Mar 8 - 16
Fine-tuning for classification Mar 18
Fine-tuning to follow instructions Mar 25

Created: 12/08/2024 (C. Lizárraga); Last update: 12/08/2024 (C. Lizárraga)

CC BY-NC-SA

UArizona DataLab, Data Science Institute, University of Arizona, 2024.

Clone this wiki locally