Feature/improve metric calculations #45

josafatburmeister · 2021-12-13T15:27:09Z

May also contribute to #25, #43, #44

This PR implements the following features:

completely GPU-sided calculation of the Hausdorff distance using PyTorch
computation of the Hausdorff distance for whole 3d images (instead of computing it for each slice separately)
option to normalize Hausdorff distance
general improvements of device handling during metric calculations
improvements of metric tracking:
- the config options train_metrics, train_metric_confidence_levels, test_metrics and test_metric_confidence_levels were added to specify which metrics should be tracked for each training step and which should only be tracked for the best model
- the config option model_selection_criterion was introduced to specify which metric should be used to select the best model from each training run
- the model checkpointing was adapted to save the best model for each training run

With the changes from this PR, a training epoch on the BraTS 2018 dataset takes approximately three minutes on an A100 GPU and the model evaluation at the end of the training takes approximately five minutes. During each training epoch, the Dice score is computed for three confidence levels (0.25, 0.5, 0.75) on both the training and the validation set. During model evaluation, the Dice score, sensitivity, specificity and the Hausdorff distance are computed for nine confidence levels (0.1 - 0.9) on the validation set using the best model.

To ensure that the metric tracking is working in all cases, I would like to extend the test cases in a follow-up PR.

… on each individual slice

jonaskordt

I didn't get to testing this yet, but I did look through the code. See my minor comments below.

jonaskordt · 2021-12-15T16:35:36Z

brats_example_config.json

+    "train_metrics": ["dice_score"],
+    "train_metric_confidence_levels": [0.25, 0.5, 0.75],
+    "test_metrics": ["dice_score", "sensitivity", "specificity", "hausdorff95"],
+    "test_metric_confidence_levels": [0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9]


All these should be documented in the README

jonaskordt · 2021-12-15T16:44:44Z

src/metric_tracking/combined_per_epoch_metric.py

        metrics: Iterable[str],
        confidence_levels: Iterable[float],
+        image_ids: Iterable[str],
+        slices_per_image: int,


This will be problematic for the Medical Segmentation Decathlon data. There, images for the same segmentation task can have different amount of slices.

jonaskordt · 2021-12-15T16:45:57Z

src/datasets/dataset_hooks.py

+        """
+
+    @abstractmethod
+    def slices_per_image(self, **kwargs):


Missing return type annotation.

See my other comment about this being a single int.

jonaskordt · 2021-12-15T16:47:54Z

src/main.py

@@ -109,15 +112,20 @@ def run_active_learning_pipeline(
    else:
        raise ValueError("Invalid data_module name.")

+    if checkpoint_dir is not None:
+        checkpoint_dir = os.path.join(checkpoint_dir, f"{wandb_logger.experiment.id}")


We should use a similar naming for the prediction directory

…lices

jonaskordt

Everything looks good to me now. Great job!

josafatburmeister added 18 commits December 12, 2021 17:31

enable automatic device handling for metrics by Pytorch Lightning

a94b1f5

compute Hausdorff distance on whole 3d images instead of computing it…

0b1381c

… on each individual slice

pass tensors with correct dimensions to metrics

d7069a2

implement normalization of Hausdorff distance

3e9d707

fix black errors

98cd8df

restrict detailed metric compuations to the best model

0af8034

update model_selection_criterion in BraTs example config

bed1363

fix pre-computation of Hausdorff distance

3df6440

improve formatting of confidence levels in metric names

e0b498d

disable metric aggergation if metrics_to_aggregate is an empty list

ecac124

fix metric names

37300c0

fix metric aggregation

e9e7f80

improve metric logging

7ff8406

fix pre-computation of Hausdorff distance

8427946

fix black and pylint errors

64bec3c

fix tests

2c36e08

improve model checkpointing

cb02d57

implement Hausdorff distance using Pytorch

1bf98ba

josafatburmeister requested review from Jasperhino and jonaskordt December 13, 2021 15:27

jonaskordt reviewed Dec 15, 2021

View reviewed changes

josafatburmeister added 4 commits December 16, 2021 14:34

adapt metrics to support cases when images have different number of s…

1638ae4

…lices

update prediction dir

d9e2aab

fix linter error

5011ce6

fix black errors

1e4a05f

jonaskordt approved these changes Dec 17, 2021

View reviewed changes

josafatburmeister merged commit afd18b9 into main Dec 18, 2021

josafatburmeister deleted the feature/improve_metric_calculations branch December 18, 2021 12:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/improve metric calculations #45

Feature/improve metric calculations #45

josafatburmeister commented Dec 13, 2021 •

edited

Loading

jonaskordt left a comment

jonaskordt Dec 15, 2021

jonaskordt Dec 15, 2021

jonaskordt Dec 15, 2021

jonaskordt Dec 15, 2021

jonaskordt left a comment

Feature/improve metric calculations #45

Feature/improve metric calculations #45

Conversation

josafatburmeister commented Dec 13, 2021 • edited Loading

jonaskordt left a comment

Choose a reason for hiding this comment

jonaskordt Dec 15, 2021

Choose a reason for hiding this comment

jonaskordt Dec 15, 2021

Choose a reason for hiding this comment

jonaskordt Dec 15, 2021

Choose a reason for hiding this comment

jonaskordt Dec 15, 2021

Choose a reason for hiding this comment

jonaskordt left a comment

Choose a reason for hiding this comment

josafatburmeister commented Dec 13, 2021 •

edited

Loading