Always pass transform and target_transform to abstract dataset #1126

pmeier · 2019-07-16T09:08:15Z

Kicked of by @zhiqwang (comment). Now, transforms, transform, and target_transform are passed to VisionDataset.__init__() for all datasets as intended.

@fmassa Should the usage of separate transforms deprecated? I'm asking since SBDataset only has joint transform:

vision/torchvision/datasets/sbd.py

Lines 52 to 57 in 8837e0e

    
           def __init__(self, 
        
                        root, 
        
                        image_set='train', 
        
                        mode='boundaries', 
        
                        download=False, 
        
                        transforms=None):

I'll fix the docstrings afterwards.

fmassa · 2019-07-16T09:31:32Z

Thanks for the PR! I'll have it reviewed tomorrow.

Just to answer your question:

@fmassa Should the usage of separate transforms deprecated? I'm asking since SBDataset only has joint transform:

I think we should still keep the separate transforms for now, as in most cases it's simpler for the user.

pmeier · 2019-07-16T09:34:49Z

I think we should still keep the separate transforms for now, as in most cases it's simpler for the user.

Should I add it to SBDataset?

codecov-io · 2019-07-16T09:40:18Z

Codecov Report

Merging #1126 into master will increase coverage by 0.34%.
The diff coverage is 56.66%.

@@            Coverage Diff             @@
##           master    #1126      +/-   ##
==========================================
+ Coverage   64.89%   65.24%   +0.34%     
==========================================
  Files          68       68              
  Lines        5413     5384      -29     
  Branches      835      835              
==========================================
  Hits         3513     3513              
+ Misses       1643     1615      -28     
+ Partials      257      256       -1

Impacted Files	Coverage Δ
torchvision/datasets/flickr.py	`24.67% <0%> (+1.21%)`	⬆️
torchvision/datasets/phototour.py	`24.24% <0%> (+0.24%)`	⬆️
torchvision/datasets/cifar.py	`78.16% <100%> (-0.5%)`	⬇️
torchvision/datasets/folder.py	`82.05% <100%> (-0.45%)`	⬇️
torchvision/datasets/mnist.py	`50.71% <100%> (-0.47%)`	⬇️
torchvision/datasets/svhn.py	`66% <100%> (-1.31%)`	⬇️
torchvision/datasets/lsun.py	`21.59% <33.33%> (+0.93%)`	⬆️
torchvision/datasets/caltech.py	`21.59% <50%> (+0.93%)`	⬆️
torchvision/datasets/sbu.py	`23.72% <50%> (+0.77%)`	⬆️
torchvision/datasets/omniglot.py	`33.33% <50%> (+1.33%)`	⬆️
... and 9 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 8837e0e...e1d776c. Read the comment docs.

zhiqwang · 2019-07-17T11:24:57Z

In the references directory, the dataset of CocoDetection also have this problem:

vision/references/detection/coco_utils.py

Line 209 in 8837e0e

def __init__(self, img_folder, ann_file, transforms):

Is it necessary to pass transforms (here is _transforms) to the constructor?

fmassa · 2019-07-17T11:28:34Z

@zhiqwang that is a custom dataset wrapper, and I on purpose not passed it to the constructor because I want the transforms to be applied after modifying the annotations to include the img_id, see

vision/references/detection/coco_utils.py

Line 216 in 8837e0e

target = dict(image_id=image_id, annotations=target)

Ideally I'd remove that and let the image id be returned by the dataloader, but that is currently not possible.

@pmeier

Should I add it to SBDataset?

It generally makes more sense to have joint transforms only because it are a segmentation dataset, and if you perform a transformation on the image you generally need to apply the same one to the target.
I would not be against adding it (for consistency), but I think it won't generally be that useful for this dataset.

zhiqwang · 2019-07-17T11:37:58Z

@fmassa Thanks for your explanation, and it help me understand the detection datasets a lot.

fmassa

Thanks for the PR @pmeier !

I think that we should keep the changes that passes the transform and target_transform to the constructor of VisionDataset, but I'm less sure of the benefits of adding transforms to all datasets, because that is generally not needed, and I'm not clear on which use-case this would come in handy.

Could you remove the added transforms in the constructor, and just pass the transform and target_transform?

torchvision/datasets/caltech.py

fmassa

One more comment and then this is good to go, thanks a lot!

torchvision/datasets/caltech.py

fmassa

LGTM, thanks a lot for you contribution!

Just waiting for CI to finish so that I can merge this

Philip Meier added 2 commits July 16, 2019 10:59

fixed call to the VisionDataset constructor

78a5241

change call from keyword arguments to positional

426ad9d

fmassa requested changes Jul 17, 2019

View reviewed changes

torchvision/datasets/caltech.py Outdated Show resolved Hide resolved

Philip Meier added 2 commits July 17, 2019 14:18

changed order of arguments

f15c7cc

removed transforms argument once again

9f169a3

pmeier changed the title ~~Always pass transform(s) to abstract dataset~~ Always pass transform and target_transform to abstract dataset Jul 18, 2019

Philip Meier added 2 commits July 18, 2019 14:56

Fixed call to constructor of parent class

0df7caa

fixed LSUN

24fef25

fmassa requested changes Jul 19, 2019

View reviewed changes

torchvision/datasets/caltech.py Outdated Show resolved Hide resolved

fixed Caltech256

e1d776c

fmassa approved these changes Jul 19, 2019

View reviewed changes

fmassa merged commit 2b81ad8 into pytorch:master Jul 19, 2019

pmeier deleted the dataset_transforms branch July 22, 2019 09:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Always pass transform and target_transform to abstract dataset #1126

Always pass transform and target_transform to abstract dataset #1126

pmeier commented Jul 16, 2019

fmassa commented Jul 16, 2019

pmeier commented Jul 16, 2019

codecov-io commented Jul 16, 2019 •

edited

Loading

zhiqwang commented Jul 17, 2019

fmassa commented Jul 17, 2019 •

edited

Loading

zhiqwang commented Jul 17, 2019

fmassa left a comment

fmassa left a comment

fmassa left a comment

	def __init__(self,
	root,
	image_set='train',
	mode='boundaries',
	download=False,
	transforms=None):

Always pass transform and target_transform to abstract dataset #1126

Always pass transform and target_transform to abstract dataset #1126

Conversation

pmeier commented Jul 16, 2019

fmassa commented Jul 16, 2019

pmeier commented Jul 16, 2019

codecov-io commented Jul 16, 2019 • edited Loading

Codecov Report

zhiqwang commented Jul 17, 2019

fmassa commented Jul 17, 2019 • edited Loading

zhiqwang commented Jul 17, 2019

fmassa left a comment

Choose a reason for hiding this comment

fmassa left a comment

Choose a reason for hiding this comment

fmassa left a comment

Choose a reason for hiding this comment

codecov-io commented Jul 16, 2019 •

edited

Loading

fmassa commented Jul 17, 2019 •

edited

Loading