WIP Allowing for grouped (random) transformations of tensors/tensor-like objects [proof-of-concept] #4267

vmoens · 2021-08-10T10:29:22Z

Implements a POC solution to #1406 (+ #9). Some aspects are similar to #1315.
I propose a GroupTransform class that can take a tuple of inputs, and applied the same transformation to each.
In between two queries of GroupTransform, the random transformations are 'frozen' (i.e. their random parameters are stored as attributes so that they can be re-used as much as needed) and the same transform is applied to each input. This ensures that all the transformations match exactly. When GroupTransform is done, it resets the state of the transformations that are provided to it.

e.g.

t = transforms.GroupTransform( transforms.RandomOrder( [
                             transforms.GaussianBlur(kernel_size=3, reset_auto=False),
                             transforms.RandomCrop(size=10, reset_auto=False)]}) ], reset_auto=False ) )
img = torch.arange(1024, dtype=torch.float).view(1, 32, 32).expand(3, 32, 32).contiguous()
mask = img[:1]
imgs = (img, mask)
imgs_out = t(imgs)
torch.testing.assert_close(imgs_out[0][0], imgs_out[1][0], rtol=1e-6, atol=1e-6, check_stride=False)

The reset_auto=False keyword of the transforms basically means that those objects won't be responsible of resetting their state after applying a transform. This responsibility is taken by transforms.GroupTransform (if its own reset_auto is set to True).

## Advantages

Low risk of BC issues as far as i can tell
few -- if any -- code changes from the users in most cases. Only those that may need GroupTransform will need to care about reset_auto.

Objects of probable contention

I implemented a parent Transform class that inherits from nn.Module, and all transforms are now subclasses of it. This may not be ideal and it is totally open to discussion.
Transforms are now 'stateful', and store the parameters of the transform in between execution.
Many get_params methods, that were static, have now lost that attribute (but one can override the default behaviour which is to take the stored parameters by providing other kwargs).

TODOs:

custom functions, such as those proposed in [RFC] Abstractions for segmentation / detection transforms #1406 for tensor-like objects, should still be implemented. This will be the predominant way of ensuring that the transforms are applied in a meaningful manner (e.g. interpolation mode for images vs boxes) to each of the inputs, depending on their types.
this code heavily relies on the reset_auto attribute, that ensures that random transformations are applied similarly to each input. One could think of a way to set this attribute value after creating the object using a .reset_auto_(mode=True) method, but for now I feel that this isn't a priority, though it might make things easier to code in the future.
For now, compatibility with torchscript has been overlooked.
I did not change the helpers as of now, as i would prioritise the discussions about the feasibility of this solution.
Some more tests may be necessary

fmassa

Thanks for the interesting proposal!

This RFC seems like a good way of approaching this problem, and I had similar thoughts to what you proposed here.

Before we dive into the specific implementation details (which includes BC considerations, torchscript, etc), I think it would be good if you could have a look into how different libraries solved this problem in the past, so that we can benefit from past experiences.

Some libraries that come to mind (non-exhaustive):

fmassa · 2021-08-10T13:57:36Z

torchvision/transforms/transforms.py

+        return output
+
+
+class Compose(Transform):


We've had a lot of discussions with @vfdev-5 about making Compose inheriting from nn.Module, and in the end we decided against it, because it would break a lot of people's code if we enforced torchscriptability.
IIRC the problem was that patterns where the user passes an arbitrary callable (which is not a nn.Module) wouldn't work anymore, and this is something which is often done

fmassa · 2021-08-11T12:59:27Z

torchvision/transforms/transforms.py

+                "All transforms should be of type torchvision.transforms.Transform. "
+                "Custom typed transforms will be forbidden in future releases."


I'm not sure this is something that we can actually commit to forbid. cc @vfdev-5 who has done some research on how people were using transforms during his torchscript-compatibility work

Here is a PR where I tried to make Compose scriptable: #2645
Open discussions there can show the blockers we had.

datumbox · 2021-09-27T10:50:53Z

Do we consider this still WIP or it's superseded by the work at pmeier/torchvision-datasets-rework#1?

vmoens · 2021-09-27T10:53:38Z

Do we consider this still WIP or it's superseded by the work at pmeier/torchvision-datasets-rework#1?

let's suspend this for now!

Vincent Moens added 3 commits August 9, 2021 16:14

grouped_transforms init

8fcab62

Merge remote-tracking branch 'upstream/master' into grouped_transforms

813f612

removing unnecessary formatting changes

e055536

facebook-github-bot added the cla signed label Aug 10, 2021

vmoens requested a review from fmassa August 10, 2021 10:30

fmassa reviewed Aug 11, 2021

View reviewed changes

vmoens closed this Sep 27, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP Allowing for grouped (random) transformations of tensors/tensor-like objects [proof-of-concept] #4267

WIP Allowing for grouped (random) transformations of tensors/tensor-like objects [proof-of-concept] #4267

vmoens commented Aug 10, 2021

fmassa left a comment

fmassa Aug 10, 2021

fmassa Aug 11, 2021

vfdev-5 Aug 11, 2021

datumbox commented Sep 27, 2021

vmoens commented Sep 27, 2021

		"All transforms should be of type torchvision.transforms.Transform. "
		"Custom typed transforms will be forbidden in future releases."

WIP Allowing for grouped (random) transformations of tensors/tensor-like objects [proof-of-concept] #4267

WIP Allowing for grouped (random) transformations of tensors/tensor-like objects [proof-of-concept] #4267

Conversation

vmoens commented Aug 10, 2021

Objects of probable contention

TODOs:

fmassa left a comment

Choose a reason for hiding this comment

fmassa Aug 10, 2021

Choose a reason for hiding this comment

fmassa Aug 11, 2021

Choose a reason for hiding this comment

vfdev-5 Aug 11, 2021

Choose a reason for hiding this comment

datumbox commented Sep 27, 2021

vmoens commented Sep 27, 2021