Merge load / resize / cache to optimize data loading efficiency for classification #2438

goodsong81 · 2023-08-18T01:14:32Z

Summary

[Changes]

Implement base LoadResizeDataFromOTXDataset to merge & wrap
- LoadImageFromOTXDataset
- (Optional) LoadAnnotationFromOTXDataset
- (Optional) Resize
Enable in-memory cache after resize for memory efficiency
Implement ResizeTo to run Resize only if the expected size is different from current size
Apply to classification (other tasks will be handled in upcoming PRs)
Apply to configurable input size

[Results]

Branch	Size	Cache	Data time	Iter time	E2E time
develop	224x224	off	0.0997	0.1670	15.94
	224x224	35M/2G	0.0304	0.0970	13.29
	64x64	off	0.0897	0.1349	13.21
	64x64	35M/2G	0.0164	0.0624	10.61
feat/load-resize-cache	224x224	off	0.1044	0.1745	13.07
	224x224	18M/2G	0.0304	0.0983	11.06
	64x64	off	0.0881	0.1348	12.28
	64x64	9M/2G	0.0176	0.0636	10.03

How to test

[Example CMD]

otx train src/otx/algorithms/classification/configs/efficientnet_b0_cls_incr/template.yaml --train-data-roots ./data/CUB_200_2011_64/train --val-data-roots ./data/CUB_200_2011_64/val --workspace outputs/load-resize-cls//effb0_CUB_200_2011_64_64x64_cache2g params --learning_parameters.input_size 64x64 --algo_backend.mem_cache_size 2000000000

Checklist

I have added unit tests to cover my changes.
I have added integration tests to cover my changes.
I have added e2e tests for validation.
I have added the description of my changes into CHANGELOG in my target branch (e.g., CHANGELOG in develop).
I have updated the documentation in my target branch accordingly (e.g., documentation in develop).
I have linked related issues.

License

I submit my code changes under the same Apache License that covers the project.
Feel free to contact the maintainers if that's a concern.
I have updated the license header for each file (see an example below).

# Copyright (C) 2023 Intel Corporation
# SPDX-License-Identifier: Apache-2.0

Signed-off-by: Songki Choi <[email protected]>

…ache

Signed-off-by: Songki Choi <[email protected]>

…ache

Signed-off-by: Songki Choi <[email protected]>

…ache

Signed-off-by: Songki Choi <[email protected]>

…ache

Signed-off-by: Songki Choi <[email protected]>

eunwoosh

Thanks for your work! LGTM but I have some questions. please take a look.

src/otx/algorithms/classification/configs/base/data/data_pipeline.py

src/otx/algorithms/common/adapters/mmcv/pipelines/load_image_from_otx_dataset.py

src/otx/algorithms/classification/adapters/mmcls/datasets/pipelines/otx_pipelines.py

Signed-off-by: Songki Choi <[email protected]>

goodsong81 added 10 commits August 10, 2023 19:10

Implement base LoadResizeDataFromOTXDataset

afc69dc

Signed-off-by: Songki Choi <[email protected]>

Merge remote-tracking branch 'origin/develop' into feat/load-resize-c…

0bcb40f

…ache

Implement LoadResizeDataFromOTXDataset for classification

79ecd59

Signed-off-by: Songki Choi <[email protected]>

Merge remote-tracking branch 'origin/develop' into feat/load-resize-c…

a691b7b

…ache

Add downscale_only option to LoadResizeDataFromOTXDataset

ef92ced

Signed-off-by: Songki Choi <[email protected]>

Implement ResizeTo for classification

3f6372d

Signed-off-by: Songki Choi <[email protected]>

Merge remote-tracking branch 'origin/develop' into feat/load-resize-c…

ec67dc0

…ache

Apply LoadResizeDataFromOTXDataset to classification

d07e7d9

Signed-off-by: Songki Choi <[email protected]>

Merge remote-tracking branch 'origin/develop' into feat/load-resize-c…

93139be

…ache

Fix caching issues

55dad39

Signed-off-by: Songki Choi <[email protected]>

goodsong81 added the ENHANCE Enhancement of existing features label Aug 18, 2023

github-actions bot added ALGO Any changes in OTX Algo Tasks implementation TEST Any changes in tests labels Aug 18, 2023

Fix pre-commit

fcdb50e

Signed-off-by: Songki Choi <[email protected]>

goodsong81 marked this pull request as ready for review August 18, 2023 06:38

goodsong81 requested a review from a team as a code owner August 18, 2023 06:38

goodsong81 requested a review from eunwoosh August 18, 2023 06:39

eunwoosh previously approved these changes Aug 18, 2023

View reviewed changes

src/otx/algorithms/classification/configs/base/data/data_pipeline.py Show resolved Hide resolved

src/otx/algorithms/classification/configs/base/data/data_pipeline.py Outdated Show resolved Hide resolved

sungmanc reviewed Aug 18, 2023

View reviewed changes

src/otx/algorithms/common/adapters/mmcv/pipelines/load_image_from_otx_dataset.py Show resolved Hide resolved

src/otx/algorithms/classification/adapters/mmcls/datasets/pipelines/otx_pipelines.py Show resolved Hide resolved

Use default loading in case of None cfg

c6642cc

Signed-off-by: Songki Choi <[email protected]>

goodsong81 dismissed eunwoosh’s stale review via c6642cc August 21, 2023 01:50

jaegukhyun previously approved these changes Aug 21, 2023

View reviewed changes

Remove img_load_cfg to restrict to use base image loading logic

664b965

Signed-off-by: Songki Choi <[email protected]>

goodsong81 dismissed jaegukhyun’s stale review via 664b965 August 21, 2023 12:27

Fix pre-commit

446cf9b

Signed-off-by: Songki Choi <[email protected]>

eunwoosh previously approved these changes Aug 22, 2023

View reviewed changes

goodsong81 mentioned this pull request Aug 22, 2023

Enable memcache to classification task #2447

Closed

8 tasks

Fix test failure

07cffa1

Signed-off-by: Songki Choi <[email protected]>

goodsong81 dismissed eunwoosh’s stale review via 07cffa1 August 22, 2023 05:54

sungmanc approved these changes Aug 22, 2023

View reviewed changes

goodsong81 requested a review from eunwoosh August 22, 2023 07:26

eunwoosh approved these changes Aug 22, 2023

View reviewed changes

goodsong81 merged commit 59acdcd into openvinotoolkit:develop Aug 22, 2023

goodsong81 deleted the feat/load-resize-cache branch August 22, 2023 07:39

goodsong81 mentioned this pull request Aug 25, 2023

Merge load / resize / cache to optimize data loading efficiency for detection & instance segmentation #2453

Merged

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge load / resize / cache to optimize data loading efficiency for classification #2438

Merge load / resize / cache to optimize data loading efficiency for classification #2438

goodsong81 commented Aug 18, 2023 •

edited

Loading

eunwoosh left a comment

Merge load / resize / cache to optimize data loading efficiency for classification #2438

Merge load / resize / cache to optimize data loading efficiency for classification #2438

Conversation

goodsong81 commented Aug 18, 2023 • edited Loading

Summary

How to test

Checklist

License

eunwoosh left a comment

Choose a reason for hiding this comment

goodsong81 commented Aug 18, 2023 •

edited

Loading