InstanSeg is a pytorch-based cell and nucleus segmentation pipeline for fluorescence and brightfield microscopy images. This README provides instructions for setting up the environment, installing dependencies, and using the provided tools and models.
- InstanSeg is freely available and open source.
- It's faster than other cell segmentation methods… sometimes much faster.
- It's capable of accurately segmenting both nuclei and whole cells.
- InstanSeg can be entirely compiled in TorchScript - including postprocessing! This means it's not only easy to use in Python but also works with LibTorch alone. This allows you to run InstanSeg directly in QuPath!
- You can use InstanSeg on multiplexed images (images that have more than three channels) on novel biomarker panels, without retraining or manual intervention.
- We plan to release more InstanSeg models trained on public datasets. If there's a nucleus and/or cell segmentation dataset under a permissive open license (e.g. CC0 or CC-BY) that we missed, let us know, and we may be able to increase our InstanSeg model zoo.
InstanSeg is introduced in the QuPath pre-release v0.6.0-rc2, so you can start using InstanSeg immediately. You can find the QuPath extension source code in its GitHub repository.
If you use InstanSeg for nucleus segmentation of brightfield histology images, please cite:
Goldsborough, T. et al. (2024) ‘InstanSeg: an embedding-based instance segmentation algorithm optimized for accurate, efficient and portable cell segmentation’. arXiv. Available at: https://doi.org/10.48550/arXiv.2408.15954.
If you use InstanSeg for nucleus and/or cell segmentation in fluorescence images, please cite:
Goldsborough, T. et al. (2024) ‘A novel channel invariant architecture for the segmentation of cells and nuclei in multiplexed images using InstanSeg’. bioRxiv, p. 2024.09.04.611150. Available at: https://doi.org/10.1101/2024.09.04.611150.
- Overview
- Why should I use InstanSeg?
- InstanSeg has its own QuPath extension!
- How to cite InstanSeg:
- Table of Contents
- Installing using pip
- Usage
For a minimal installation:
pip install instanseg-torch
if you want all the requirements used for training:
pip install instanseg-torch[full]
You can get started immediately by calling the InstanSeg class:
from instanseg import InstanSeg
instanseg_brightfield = InstanSeg("brightfield_nuclei", image_reader= "tiffslide", verbosity=1)
labeled_output = instanseg_brightfield.eval(image = "../instanseg/examples/HE_example.tif",
save_output = True,
save_overlay = True)
Alternatively, if you want more control over the intermediate steps:
image_array, pixel_size = instanseg_brightfield.read_image("../instanseg/examples/HE_example.tif")
labeled_output, image_tensor = instanseg_brightfield.eval_small_image(image_array, pixel_size)
display = instanseg_brightfield.display(image_tensor, labeled_output)
from instanseg.utils.utils import show_images
show_images(image_tensor,display, colorbar=False, titles = ["Normalized Image", "Image with segmentation"])
To install InstanSeg locally, follow these steps:
-
Install either Anaconda, Mamba, or micromamba. We use micromamba for speed and simplicity, but you can replace "micromamba" with the distribution you are using.
-
In your terminal or Anaconda prompt on Windows, create a new environment and install dependencies using the provided
env.yml
file:micromamba create -n instanseg --file env.yml
-
Activate your environment:
micromamba activate instanseg
If you intend to use GPU acceleration and CUDA, follow these additional steps:
-
Uninstall existing PyTorch and reinstall with CUDA support:
micromamba remove pytorch torchvision monai micromamba install pytorch==2.1.1 torchvision==0.16.1 monai=1.3.0 pytorch-cuda=12.1 -c conda-forge -c pytorch -c nvidia pip install cupy-cuda12x
-
Check if CUDA is available:
python -c "import torch; print('CUDA is available') if torch.cuda.is_available() else print('CUDA is not available')"
The repository may work with older versions of CUDA. For this replace "12.1" and "12" with the required version.
- Build repository:
pip install -e .
To download public datasets and example images, follow the instructions under instanseg/notebooks/load_datasets.ipynb
To train InstanSeg on your own dataset, extend the instanseg/notebooks/load_datasets.ipynb with one of the templates provided.
To train models using InstanSeg, use the train.py script under the scripts folder.
For example, to train InstanSeg on the TNBC_2018 dataset over 250 epochs at a pixel resolution of 0.25 microns/pixel, run the following command:
cd instanseg/scripts
python train.py -data segmentation_dataset.pth -source "[TNBC_2018]" --num_epochs 250 --experiment_str my_first_instanseg --requested_pixel_size 0.25
To train a channel invariant InstanSeg on the CPDMI_2023 dataset, predicting both nuclei and cells, run the following command:
cd instanseg/scripts
python train.py -data segmentation_dataset.pth -source "[CPDMI_2023]" --num_epochs 250 --experiment_str my_first_instanseg -target NC --channel_invariant True --requested_pixel_size 0.5
Each epoch should take approximately 1 to 3 minutes to complete (with mps or cuda support).
For more options and configurations, refer to the parser arguments in the train.py file.
To test trained models and obtain F1 metrics, use the following command:
python test.py --model_folder my_first_instanseg -test_set Validation --optimize_hyperparameters True
python test.py --model_folder my_first_instanseg -test_set Test --params best_params
python inference.py --model_folder my_first_instanseg --image_path ../examples
Replace "../examples" with the path to your images. If InstanSeg cannot read the image pixel size from the image metadata, the user is required to provide a --pixel_size parameter. InstanSeg provides (limited) support for whole slide images (WSIs). For more options and configurations, refer to the parser arguments in the inference.py file.
Links to different model versions are stored in instanseg/models/model-index.json
. When releasing new models,
you should add entries to this JSON file, optionally removing any previous versions that shouldn't be available in future versions.
An example entry looks like this:
{
"name": "[MODEL_NAME]",
"url": "https://github.com/instanseg/instanseg/releases/download/[RELEASE_NAME]/[MODEL_NAME].zip",
"version": "0.1.0",
"license": "Apache-2.0"
}