Pretraining and finetuning different vision transformer models on the ImageNet and Ham10000 dataset.
This repository constians implementation of 11 image classifier models.
List of implemented models:
- VGGNet19bn
- ResNet152
- DenseNet
- InceptionV3
- ViT_base
- DeepViT_base
- CaiT_base
- T2TViT_base
- ViT_pretrained
- DeiT_pretrained
- BeiT_pretrained
Carry out the following steps to download and preprocess the dataset:
- Download the HAM10000 dataset from the following link: https://www.kaggle.com/datasets/kmader/skin-cancer-mnist-ham10000
- Extract the zip file and put all put the HAM10000 folder in the parent directory.
- Copy all the images from the HAM10000_images_part_2 folder and paste them into the HAM10000_images_part_1 folder
- Run the data_preprocessing.py script
To implement each model run the python script with the model name