Fix halo implementation and tiling artefact #113

qin-yu · 2024-04-10T18:53:54Z

Fix halo implementation and tiling artefact

I've revisited the issue of the tiling artifact and identified that the current implementation of the halo is incorrect. For each patch, the halo should constitute the surrounding margins of that patch. The use of mirror padding is responsible for our tiling artifact.

Before my fix, with a patch size of 96x96x96, a stride of 96x96x96, and a halo of 32x64x64, the prediction exhibited a clear tiling artifact. With the correct implementation of the halo, using the same configuration, the prediction shows significant improvement. Note that with such settings, there is no overlap between neighbour patches, yet we still see almost no artefact with the new implementation.

Comparison

Left: Mirror pad; Right: Halo pad

Design Choices

Padding with halo happens in Dataset's get method, while removal of padding happens in Predictor. Since Predictors takes data Datasets wrapped in loaders, I move halo config to SliceBuilder under loader config. Predictors can access the halo directly from the input Datasets.
The functions for padding and unpadding have been refactored and moved into dataset-utils.

Old and New Config Files

Old config file for inference (Predictor taking halo, patch mirror-padded)

# path to the checkpoint file containing the model
model_path: /.../plantseg_original_1135_rest_rotate2d_fmaps16_max/best_checkpoint.pytorch
# model configuration
model:
  # model class
  name: UNet3D
  # number of input channels to the model
  in_channels: 1
  # number of output channels
  out_channels: 2
  # determines the order of operators in a single layer (gcr - GroupNorm+Conv3d+ReLU)
  layer_order: gcr
  # feature maps scale factor
  f_maps: 16
  # number of groups in the groupnorm
  num_groups: 8
  # apply element-wise nn.Sigmoid after the final 1x1 convolution, otherwise apply nn.Softmax
  final_sigmoid: true
  # if True applies the final normalization layer (sigmoid or softmax), otherwise the networks returns the output from the final convolution layer; use False for regression problems, e.g. de-noising
  is_segmentation: true
# predictor configuration
predictor:
  # standard in memory predictor
  name: 'StandardPredictor'
  # halo around each input patch, created with mirror reflectiion
  patch_halo: [32, 64, 64]
# specify the test datasets
loaders:
  # batch dimension; if number of GPUs is N > 1, then a batch_size of N * batch_size will automatically be taken for DataParallel
  batch_size: 1
  # mirror pad the raw data in each axis for sharper prediction near the boundaries of the volume
  mirror_padding: [32, 32, 32]
  # path to the raw data within the H5
  raw_internal_path: raw/noisy
  # how many subprocesses to use for data loading
  num_workers: 8
  # test loaders configuration
  test:
    # paths to the test datasets; if a given path is a directory all H5 files ('*.h5', '*.hdf', '*.hdf5', '*.hd5')
    # inside this this directory will be included as well (non-recursively)
    file_paths:
      - /.../1135.h5

    # SliceBuilder configuration, i.e. how to iterate over the input volume patch-by-patch
    slice_builder:
      # SliceBuilder class
      name: SliceBuilder
      # train patch size given to the network (adapt to fit in your GPU mem, generally the bigger patch the better)
      patch_shape: [96, 96, 96]
      # train stride between patches
      stride_shape: [96, 96, 96]

    transformer:
      raw:
        - name: Standardize
        - name: ToTensor
          expand_dims: true

New config file for inference (SliceBuilder taking halo, patch halo-padded)

# path to the checkpoint file containing the model
model_path: /.../plantseg_original_1135_rest_rotate2d_fmaps16_max/best_checkpoint.pytorch
# model configuration
model:
  # model class
  name: UNet3D
  # number of input channels to the model
  in_channels: 1
  # number of output channels
  out_channels: 2
  # determines the order of operators in a single layer (gcr - GroupNorm+Conv3d+ReLU)
  layer_order: gcr
  # feature maps scale factor
  f_maps: 16
  # number of groups in the groupnorm
  num_groups: 8
  # apply element-wise nn.Sigmoid after the final 1x1 convolution, otherwise apply nn.Softmax
  final_sigmoid: true
  # if True applies the final normalization layer (sigmoid or softmax), otherwise the networks returns the output from the final convolution layer; use False for regression problems, e.g. de-noising
  is_segmentation: true
# predictor configuration
predictor:
  # standard in memory predictor
  name: 'StandardPredictor'
# specify the test datasets
loaders:
  # batch dimension; if number of GPUs is N > 1, then a batch_size of N * batch_size will automatically be taken for DataParallel
  batch_size: 1
  # mirror pad the raw data in each axis for sharper prediction near the boundaries of the volume
  mirror_padding: [0, 0, 0]
  # path to the raw data within the H5
  raw_internal_path: raw/noisy
  # how many subprocesses to use for data loading
  num_workers: 8
  # test loaders configuration
  test:
    # paths to the test datasets; if a given path is a directory all H5 files ('*.h5', '*.hdf', '*.hdf5', '*.hd5')
    # inside this this directory will be included as well (non-recursively)
    file_paths:
      - /.../1135.h5

    # SliceBuilder configuration, i.e. how to iterate over the input volume patch-by-patch
    slice_builder:
      # SliceBuilder class
      name: SliceBuilder
      # train patch size given to the network (adapt to fit in your GPU mem, generally the bigger patch the better)
      patch_shape: [96, 96, 96]
      # train stride between patches
      stride_shape: [96, 96, 96]
      # raw image halo around each patch
      halo_shape: [32, 64, 64]

    transformer:
      raw:
        - name: Standardize
        - name: ToTensor
          expand_dims: true

Required changes in config files:

Previously each patch is mirror-padded with itself

Only changed `predictor.py` manually

wolny · 2024-04-11T11:26:39Z

This is great @qin-yu! Thanks a lot for the PR. It looks good to me from the first glance. I'll do a proper review tomorrow, merge it and release the new version.

wolny

Overall looks good, I've left couple of comments/questions

pytorch3dunet/datasets/hdf5.py

pytorch3dunet/datasets/utils.py

pytorch3dunet/datasets/hdf5.py

qin-yu · 2024-04-12T16:01:13Z

Here I illustrate why raw is padded on all sides but slicing is simply extending the end by 2 x halo

self.raw_padded = mirror_pad(self.raw, self.halo_shape)
raw_idx_padded = tuple(slice(this_index.start, this_index.stop + 2 * this_halo, None) for this_index, this_halo in zip(raw_idx, self.halo_shape))

On the left I show how padded input are predicted and put back to the original shape; on the right I show how patches should be sliced correctly:

Note that the image in #113 (comment) is produced with patch shape = stride shape, i.e. the tiling can only match if slicing is done properly.

qin-yu · 2024-04-12T16:25:30Z

Make sure only testing has halo padded patches in AbstractHDF5Dataset
Make sure new methods have Google style docstrings
Make a test for slicer

wolny

LGTM! Thanks for the PR

qin-yu added 4 commits April 9, 2024 18:48

Fix Halo: Use actual surrounding margins in raw image as halo

4c0ea3e

Previously each patch is mirror-padded with itself

Fix Halo: Improve functions around (un)pad

e7a38c5

Merge changes from upstream

ca528cb

Only changed `predictor.py` manually

Fix Halo: Only slice builder takes halo in config now

0c63712

qin-yu requested a review from wolny April 10, 2024 21:19

This was referenced Apr 11, 2024

patch_halo seems to be ignored kreshuklab/plant-seg#205

Closed

Fix halo implementation and tiling artefact kreshuklab/plant-seg#220

Merged

Strong tiling artifacts kreshuklab/plant-seg#190

Closed

qin-yu added bug Something isn't working enhancement New feature or request labels Apr 12, 2024

wolny requested changes Apr 12, 2024

View reviewed changes

qin-yu added 2 commits April 12, 2024 21:27

Fix Halo: Restrict halo to inference only and improve docstring

a225984

Fix Halo: Add test for halo-padding and -removal

1280f3e

qin-yu requested a review from wolny April 12, 2024 23:46

wolny approved these changes Apr 15, 2024

View reviewed changes

wolny merged commit 3dce0d7 into wolny:master Apr 15, 2024

qin-yu mentioned this pull request Apr 17, 2024

Re-train all PlantSeg models to eliminate hallucination during inference kreshuklab/plant-seg#224

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix halo implementation and tiling artefact #113

Fix halo implementation and tiling artefact #113

qin-yu commented Apr 10, 2024

wolny commented Apr 11, 2024

wolny left a comment

qin-yu commented Apr 12, 2024

qin-yu commented Apr 12, 2024 •

edited

Loading

wolny left a comment

Fix halo implementation and tiling artefact #113

Fix halo implementation and tiling artefact #113

Conversation

qin-yu commented Apr 10, 2024

Fix halo implementation and tiling artefact

Comparison

Design Choices

Old and New Config Files

wolny commented Apr 11, 2024

wolny left a comment

Choose a reason for hiding this comment

qin-yu commented Apr 12, 2024

qin-yu commented Apr 12, 2024 • edited Loading

wolny left a comment

Choose a reason for hiding this comment

qin-yu commented Apr 12, 2024 •

edited

Loading