Bug and Fix: Memory Leakage #41

jhliu17 · 2020-12-08T12:12:10Z

Bug Details:
I try to extract the RoI features in my own dataset (a big dataset including more than 50k+ pictures), the extract_features.py will continually allocate the memory without releasing and finally exceed the memory limitation. The main reason is due to the ray memory management mechanism which will hold the memory until the task nodes are deleted. So the way we generate npz file is infeasible, which will cause memory leakage.

Solution
I reorganize the save mechanism without hurting the speed performance which can reach 6.48it/s on average, also be faster than before, while the memory usage is controllable.

Zoroaster97

Thank you, but there are some syntax errors in line 119, 128, 152 ( unmatched parentheses )

Zoroaster97 · 2020-12-09T11:59:08Z

Thanks for your concern. In our current method, if the number of CPUs is not enough, it will cause memory leakage problem indeed, because the bottleneck of processing speed is in CPU( in NMS). When we running it on our machine which has 32 CPU cores and 4 TITAN V GPUs, the memory usage is stable, because the computing capability of CPU is matched with GPU. And when we limit the number of CPUs on 8, the mismatch of computing capability leads to the memory leakage problem, it could be something like waiting queues stacking in memory.

Zoroaster97 · 2020-12-09T12:24:35Z

When we are using 32 CPUs & 4 GPUs, 16CPUs & 2 GPUs, and 8 CPUs & 1 GPU, the speed is about 15it/s, 7.4it/s, and 3.7it/s respectively, and without memory leakage. So if you have enough CPU cores or take a appropriate match strategy of CPUs and GPUs, the current method could be powerful and safe. Unfortunately, the appropriate match strategy cannot be detected automatically. So we decide to keep your solution as a safety method. Thanks again！

jhliu17 · 2020-12-09T12:33:01Z

😊 Thanks for your experiments verifying the true reason is the mismatch of computing capability leads to the memory leakage problem. In my extraction process, I used 4 CPUs & 3 GPUs and that is why the memory usage grows up quickly.

unmatched parentheses

Zoroaster97

fix possible memory leakage problem

debug: memory continual grow up

326e6e5

Zoroaster97 suggested changes Dec 9, 2020

View reviewed changes

unmatched parentheses

116d2de

unmatched parentheses

Zoroaster97 approved these changes Dec 9, 2020

View reviewed changes

MIL-VLG merged commit 6b62524 into MILVLG:master Dec 9, 2020

Zoroaster97 mentioned this pull request Dec 9, 2020

Take the old version of extract features as a faster version #42

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug and Fix: Memory Leakage #41

Bug and Fix: Memory Leakage #41

jhliu17 commented Dec 8, 2020

Zoroaster97 left a comment

Zoroaster97 commented Dec 9, 2020

Zoroaster97 commented Dec 9, 2020

jhliu17 commented Dec 9, 2020

Zoroaster97 left a comment

Bug and Fix: Memory Leakage #41

Bug and Fix: Memory Leakage #41

Conversation

jhliu17 commented Dec 8, 2020

Zoroaster97 left a comment

Choose a reason for hiding this comment

Zoroaster97 commented Dec 9, 2020

Zoroaster97 commented Dec 9, 2020

jhliu17 commented Dec 9, 2020

Zoroaster97 left a comment

Choose a reason for hiding this comment