Multi-size videos in data pipelines #440

talmo · 2020-12-22T00:30:34Z

Support for datasets with videos with different image shapes in the data pipeline

Addresses the longstanding #209.

talmo · 2021-01-04T14:16:46Z

The approach here should be:

Determine target height/width (max of all videos or user-specified)
Resize longest side to target height/width
Pad (bottom/right) the short side to target width/height

Other notes:

This metadata will need to be saved in the config so these steps can be reproduced during inference time. This may not be strictly necessary since we can retrace for different sizes as long as each batch is from the same video, but we should test.
This padding should be aware of/compatible with the padding done to account for model stride (sleap.nn.data.resizing.pad_to_stride/Resizer). Ideally they'd be done in the same step, but this gets complicated for multi-model approaches (topdown) that may require different padding.

* Add track indices to instance cropper * Add class vector generator * Split class vectors correctly in instance cropper * Move head output layer construction to heads module - Heads now subclass a base Head class - Naming doesn't include _0 anymore since we don't have any multi-output models for now. - Better input validation in Model.from_config constructor - Add loss weight to all heads in config - Test coverage for heads and (minimally for) model * Add topdown config, head and model - Rename multiclass to multiclass_bottomup * Add trainer * Data pipeline * Apply black to 'sleap' and 'tests' (#465) Co-authored-by: Arie Matsliah <[email protected]> * Fix model creation and add pooling param to head * Symmetry-aware flip augmentation (#455) * Implement symmetry-aware instance reflection * Fix symmetries sometimes not being returned uniquely * Add fancier indexing to instances * Add random flipping transformer * Fix failing linux test - Make sure indices are all cast to int32 * Add vertical flip * Add flip augmentation to config, GUI and pipeline builders * Update profiles with default fields Co-authored-by: ariematsliah-princeton <[email protected]> * Multi-size videos in data pipelines (#440) Add support for variable size videos within the same dataset by matching their size with padding or resizing Co-authored-by: Arie Matsliah <[email protected]> * Type check + Lint in CI (#470) * Try lint and typecheck in CI workflow * update * nit * continue on MyPy errors * test * correct * correct * correct Co-authored-by: Arie Matsliah <[email protected]> * Rename predictors for consistency with inference layers - TopdownPredictor -> TopDownPredictor - BottomupPredictor -> BottomUpPredictor * Create PULL_REQUEST_TEMPLATE.md * Update authors list (#471) Co-authored-by: Arie Matsliah <[email protected]> * Add CLA (#473) * Add CLA * update links Co-authored-by: Arie Matsliah <[email protected]> * Update PULL_REQUEST_TEMPLATE.md * Miscellaneous QOL (#467) Pre-1.1.0 update features (changelist in #467) * Bump pre-release version * Add back load_model that got lost in the merge - Add detection of bottomup and topdown multi-class model loading * Fix more missing things post-merge * Fix lint * Fix training from config * Add inference * Tweak describe tensor to accept nested tuples/dicts * Lint * Fix test * Lint * Fix load video dataset arg * Fix inference * Fix evals * Add BU MC to evals * Remove batch norm from TD MC head * Add option to disable batch norm in pretrained models * Add track matching when merging labels * Don't error when training finishes with no inference target Co-authored-by: ariematsliah-princeton <[email protected]> Co-authored-by: Arie Matsliah <[email protected]>

talmo changed the title ~~Add breaking test.~~ Multi-size videos in data pipelines Dec 22, 2020

arie-matsliah force-pushed the multi_size_videos branch from f9462b6 to 0702d25 Compare January 26, 2021 16:28

talmo and others added 15 commits February 2, 2021 11:03

Add breaking test.

aa17db0

wip

0a2f43d

wip

5ca2efb

wip

be85e69

tests

6b4ded2

fix normalization test

c82cb18

support smaller target size

0f24ac9

support smaller target size

e39c6fe

config

0f8dd42

util

58d6863

from config

5e9e6e6

pipelines

e9355bb

remove type

7d35c91

order

fecc751

black

60d269e

arie-matsliah force-pushed the multi_size_videos branch from ad55f9f to 60d269e Compare February 2, 2021 19:09

arie-matsliah added 4 commits February 3, 2021 08:59

is multi size predicate

ec008f5

is multi size predicate

8f65d87

black

efd4872

property

bba0401

talmo marked this pull request as ready for review February 3, 2021 20:15

talmo merged commit 5f95150 into develop Feb 3, 2021

talmo mentioned this pull request Feb 4, 2021

Datagen fails on videos with different dimensions #209

Closed

talmo deleted the multi_size_videos branch February 4, 2021 07:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-size videos in data pipelines #440

Multi-size videos in data pipelines #440

talmo commented Dec 22, 2020

talmo commented Jan 4, 2021

Multi-size videos in data pipelines #440

Multi-size videos in data pipelines #440

Conversation

talmo commented Dec 22, 2020

talmo commented Jan 4, 2021