tf-protoNN

This repository contains the code for ProtoNN (a KNN based algorithm) implemented in Tensorflow for large-scale multi-label learning. This repository also has a script to run the training on multiple GPUs.

Note: some modifications have been made to improve run-time and performance on large-scale datasets. For more details about ProtoNN, please refer to ProtoNN: Compressed and Accurate kNN for Resource-scarce Devices. If you are seeking to reproduce the results in the original paper, please use the official code provided by the authors.

Extreme multi-label (XML) algorithms

Unlike multi-class or binary classification, extreme multi-label (XML) algorithms tag data points with a subset of labels (rather than just a single label) from an extremely large label-set. XML problems usually deal with a large number of labels (10³ - 10⁶ labels) and a large number of dimensions and training points.

For datasets, check: XML-repository

Required packages

Tensorflow
FAISS
Numpy
Scipy
Easydict

Usage

Check the ipython notebook to run the code on Eurlex-4k dataset. To change the parameters, modify the config file.

To run on a new dataset:

Create a new folder with the directory name. Place two separate files train_data.mat and test_data.mat in that directory. Note that each of these files must have two variables: X with shape: (num instances, num features) and Y with shape (num instances, num labels)
Create a config file in cfgs folder with the required parameters.
For single GPU: Modify eurlex_train.py -> train.py (import the correct config file). For training on multiple GPUs modify eurlex_multigpu_train.py -> train.py and run python train.py

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
cfgs		cfgs
datasets/eurlex		datasets/eurlex
experiments/eurlex		experiments/eurlex
model		model
preprocess		preprocess
trainer		trainer
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
eurlex_multigpu_train.py		eurlex_multigpu_train.py
eurlex_train.py		eurlex_train.py
run_eurlex_with_preprocessing.ipynb		run_eurlex_with_preprocessing.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tf-protoNN

Extreme multi-label (XML) algorithms

Required packages

Usage

About

Releases

Packages

Languages

License

saisrivatsan/tf-protoNN

Folders and files

Latest commit

History

Repository files navigation

tf-protoNN

Extreme multi-label (XML) algorithms

Required packages

Usage

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages