ptdeco

ptdeco is a library for model optimization by matrix decomposition built on top of PyTorch.

There is an introductory presentation about ptdeco from Warsaw AI meetup 2024.05.23 - Practical low-rank decomposition (not only) for large language models.

Contents of this README:

Introduction
Installation
Saving and loading a decomposed model
- Saving a decomposed model
- Loading a decomposed model
Links to other methods for model compression by decomposition

Introduction

Currently, ptdeco implements the following methods:

dwain - iterative method based on low-rank decomposition of features (dwain = Decomposing Weights Algorithm - an Iterative techNique). Tested on LLMs (large language models) and vision models
lockd - method based on local knowledge distillation. (lockd = LOCal Knowledge Distillation). Tested on vision models
falor - method based on low-rank decomposition of features inspired by Compressing Transformers: Features Are Low-Rank, but Weights Are Not! by Yu Hao, Wu Jianxin (2023), (falor = Features Are LOw Rank). Tested on vision models

dwain method does not require pretraining. It can decompose linear layers and 1x1 convolutions.

lockd method requires short (~ 10 ImageNet epochs) knowledge distillation pretraining before decomposition is made. It can decompose linear layers and convolutions.

falor method does not require pretraining. Model decomposition lasts < 1 GPU hour (depending on model size and parameters). It can decompose linear layers and 1x1 convolutions.

Installation

pip install ptdeco

Saving and loading a decomposed model

Saving a decomposed model

As a result of decomposition you get decompose_config dictionary. You need to serialize this e.g. to JSON. This will let you recreate the structure of a decomposed model. Except this, you need to save state_dict to recover the weights of a decomposed model. The code below illustrates the procedure:

import json
import pathlib

# Your decomposition code

output_path = pathlib.Path("YOUR/CHEKCPOINT/DIRECTORY")
out_decompose_config_path = output_path / "decompose_config.json"
with open(out_decompose_config_path, "wt") as f:
    json.dump(decompose_config, f)
out_decompose_state_dict_path = output_path / "decompose_state_dict.pt"
torch.save(model.state_dict(), out_decompose_state_dict_path)

Loading a decomposed model

To load the model, you need to recreate the original model first. Next, you load and apply the decompose_config. Finally, you load the state_dict (note the state dict "fits" the decomposed model, so you need to do it as a last step). The code below illustrates the procedure:

import json
import pathlib

import ptdeco

model = ... # Build original model
device = ...     # Specify the device original model uses

output_path = pathlib.Path("YOUR/CHEKCPOINT/DIRECTORY")

with open(output_path / "decompose_config.json", "rt") as f:
        decompose_config = json.load(f)

ptdeco.utils.apply_decompose_config_in_place(model, decompose_config)

sd = torch.load(output_path / "decompose_state_dict.pt")

model.load_state_dict(sd, map_location=device)

# Now `model` is decomposed and contains appropriate weights

Links to other methods for model compression by decomposition

Other methods using decomposition for model compression, not implemented in this package:

Name		Name	Last commit message	Last commit date
Latest commit History 388 Commits
.github/workflows		.github/workflows
aux		aux
examples		examples
src/ptdeco		src/ptdeco
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
Makefile		Makefile
Makefile.inc		Makefile.inc
README.md		README.md
pyproject.toml		pyproject.toml
requirements-check.txt		requirements-check.txt
requirements-test.txt		requirements-test.txt
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ptdeco

Introduction

Installation

Saving and loading a decomposed model

Saving a decomposed model

Loading a decomposed model

Links to other methods for model compression by decomposition

About

Languages

License

TCLResearchEurope/ptdeco

Folders and files

Latest commit

History

Repository files navigation

ptdeco

Introduction

Installation

Saving and loading a decomposed model

Saving a decomposed model

Loading a decomposed model

Links to other methods for model compression by decomposition

About

Topics

Resources

License

Stars

Watchers

Forks

Languages