Prithvi ViT behaviour when input size not divisible by patch size #172

CarlosGomes98 · 2024-09-25T11:44:56Z

Describe the issue
With prithvi_vit, when the input spatial dimensions are not divisible by the patch size, part of the input is ignored.
To Reproduce (optional, but appreciated)
Steps to reproduce the behavior:

Create a prithvi vit model
Pass to it an input of size not divisible by the patch size
No error is thrown

Expected behavior (optional)
Either we should pad the input to a size divisible by the patch size, or throw an error

Joao-L-S-Almeida · 2024-10-14T21:05:53Z

That's strange. When using the test tests/test_backbones.py::test_vit_models_non_divisible_input (from the branch associated to this issue) I got:

>           raise EinopsError(message + "\n {}".format(e))
E           einops.EinopsError:  Error while processing rearrange-reduction pattern "b c (t tub) (h p) (w q) -> b (t h w) (tub p q c)".
E            Input tensor shape: torch.Size([1, 6, 4, 220, 230]). Additional info: {'tub': 1, 'p': 16, 'q': 16}.
E            Shape mismatch, can't divide axis of length 220 in chunks of 16

Isn't that the expected behaviour ?

singam96 · 2024-11-02T01:27:48Z

Please check this PR #218

Joao-L-S-Almeida self-assigned this Oct 14, 2024

Joao-L-S-Almeida linked a pull request Nov 5, 2024 that will close this issue

Fix/nondivisible images #222

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prithvi ViT behaviour when input size not divisible by patch size #172

Prithvi ViT behaviour when input size not divisible by patch size #172

CarlosGomes98 commented Sep 25, 2024

Joao-L-S-Almeida commented Oct 14, 2024

singam96 commented Nov 2, 2024

Prithvi ViT behaviour when input size not divisible by patch size #172

Prithvi ViT behaviour when input size not divisible by patch size #172

Comments

CarlosGomes98 commented Sep 25, 2024

Joao-L-S-Almeida commented Oct 14, 2024

singam96 commented Nov 2, 2024