Classifier trainin - Mosaic data augmentation #4432

AlexeyAB · 2019-12-02T14:49:51Z

Related to: #4264

Use for Classifier training:

cutmix=1 - will be used CutMix (36%-91%, 9%-64%) Classifier - CutMix: Regularization Strategy to Train Strong Classifiers #4419
mosaic=1 - will be used Mosaic data augmentation (4%-64%, 4%-64%, 4%-64%, 4%-64%)
will be used randomly CutMix and Mosaic if set:
```
[net]
cutmix=1
mosaic=1
```

Run training with flag -show_imgs to see how images are changed (in separate windows and saved to files aug_... .jpg) and how labels are changed (see the console).

The text was updated successfully, but these errors were encountered:

AlexeyAB · 2019-12-02T14:51:58Z

@WongKinYiu I also implemented Mosaic data augmentation for Classifier.

WongKinYiu · 2019-12-02T16:46:57Z

@AlexeyAB no available gpu 🐧🐧🐧
which combination of data augmentation is suggested?

Look4-you · 2019-12-03T08:59:34Z

@AlexeyAB Hi.

Is it to randomly select one of these data enhancement methods (like hue, mosaic, saturation) to process each batch of images when training?
Besides, the jitter is set at each [yolo] layer, is it aim at feature map?

AlexeyAB · 2019-12-03T10:56:37Z

@WongKinYiu

May be cutmix=1 mosaic=1

I would recommend to train the smallest model (for quick comparison) using alternately one approach and compare accuracy %Top1 gain:

mixup=1
cutmix=1
mosaic=1
cutmix=1 mosaic=1
blur=1
large mini_batch (if you have 32-256 GB CPU-RAM) Beta: Using CPU-RAM instead of GPU-VRAM for large Mini_batch=32 - 128 #4386

AlexeyAB · 2019-12-03T10:59:43Z

@Look4-you

Is it to randomly select one of these data enhancement methods (like hue, mosaic, saturation) to process each batch of images when training?

Every data augmentation method occurs randomly if enabled.

Besides, the jitter is set at each [yolo] layer, is it aim at feature map?

random= and jitter= are used ony from the last [yolo]-layer.
jitter - resizes input image
random - resizes network

Look4-you · 2019-12-03T11:47:00Z

@AlexeyAB Thanks a lot!

WongKinYiu · 2019-12-03T11:59:17Z

I would recommend to train the smallest model (for quick comparison) using alternately one approach and compare accuracy %Top1 gain:

@AlexeyAB OK,

I will do experiments for smallest model first.
Thanks for your suggestion.

AlexeyAB · 2019-12-03T12:48:48Z

@WongKinYiu Also you can try to train with large mini_batch (if you have 32-256 GB CPU-RAM) #4386

WongKinYiu · 2019-12-03T12:59:52Z

@AlexeyAB Hmm... I think my GPU scheduling are full until next year.

I will do more comparison if I can borrow more gpus/machines.
Currently I borrow a Titan RTX and a Tesla V100 to compare with the results of my Titan X and 1080 ti of different mini batch size.

I think I can do an experiment with large mini_batch using single 1080ti and 64 GB CPU-RAM next week.

AlexeyAB · 2019-12-03T13:38:40Z

@WongKinYiu If you use CPU-RAM #4386 for increasing mini_batch size, then there is a bottleneck in PCIe, so it doesn't require high-end GPU, you can use GTX 1060/1070. So if you have 128-256 GB CPU-RAM then you can set mini_batch size 4-8x large than on Titan RTX 24 GB or Tesla V100 16/32 GB.

WongKinYiu · 2019-12-03T13:53:28Z

@AlexeyAB I know, but I would like to make it has only one control factor.

AlexeyAB · 2019-12-03T13:59:25Z

@WongKinYiu

I know, but I would like to make it has only one control factor.

What is the control factor?

WongKinYiu · 2019-12-03T14:13:59Z

@AlexeyAB I hope the only difference is mini_batch size, so I want to run the experiment on same machine, same gpu, .... (maybe it is controlled variable?)

AlexeyAB · 2019-12-03T15:05:01Z

@WongKinYiu

I hope the only difference is mini_batch size, so I want to run the experiment on same machine, same gpu, .... (maybe it is controlled variable?)

Yes.
Also if you use CPU-RAM + GPU-processing then this is still a controllable factor.

AlexeyAB · 2019-12-04T20:46:12Z

@WongKinYiu Also I added blur=1 for training Classifier: #3320 (comment)

Look4-you · 2019-12-06T04:12:35Z

@AlexeyAB Hi.
How random works to resizing the network in the last [yolo]-layer.
I know that random=1 - randomly resizes network for each 10 iterations from 1/1.4 to 1.4(data augmentation parameter is used only from the last layer) , but how?

AlexeyAB · 2019-12-06T16:14:04Z

@Look4-you This thread is for Mosaic data augmentation for Classifier, not for random=, which can be used only for Detector. Create new issue.

WongKinYiu · 2019-12-22T10:11:58Z

@AlexeyAB

Can I set

mosaic=1
cutmix=1
blur=1
label_smooth_eps=0.1

AlexeyAB · 2019-12-22T10:27:34Z

@WongKinYiu Yes.

You can set:

for Classifier:

mosaic=1
cutmix=1
blur=1
lable_smooth_eps=0.1

for Detector:

mosaic=1
blur=1
lable_smooth_eps=0.1

WongKinYiu · 2019-12-22T10:38:36Z

@AlexeyAB thank u very much.

AlexeyAB · 2019-12-22T12:08:38Z

@WongKinYiu

Also try to train Classifier with

[net]
mosaic=1

but change this line:

darknet/src/data.c

Line 1631 in b2749c9

    
           d.y.vals[i][j] = d.y.vals[i][j] * s1 + d2.y.vals[i][j] * s2 + d3.y.vals[i][j] * s3 + d4.y.vals[i][j] * s4;

to these 2 lines and recompile:

const float max_s = max_val_cmp(s1, max_val_cmp(s2, max_val_cmp(s3, s4)));
d.y.vals[i][j] = d.y.vals[i][j] * s1 / max_s + d2.y.vals[i][j] * s2 / max_s + d3.y.vals[i][j] * s3 / max_s + d4.y.vals[i][j] * s4 / max_s;

WongKinYiu · 2019-12-27T06:40:31Z

@AlexeyAB

Hello,

The memory leak problem is very serious when i set

mosaic=1
cutmix=1
blur=1
lable_smooth_eps=0.1

or even

mosaic=1
cutmix=1

or even disable OPENCV with above setting

i try to modify the code https://github.com/AlexeyAB/darknet/blob/master/src/data.c#L1510
but not solve the problem.

WongKinYiu · 2019-12-27T09:47:22Z

https://github.com/AlexeyAB/darknet/blob/master/src/data.c#L1531
this line will check if mixup mode is set as:

mosaic = 1 (mixup mode 3)
mosaic = 1, cutmix = 1 (mixup mode 4)

however, https://github.com/AlexeyAB/darknet/blob/master/src/data.c#L1548
this line will re-assign mixup mode:

mosaic = 1 (mixup mode 3)
cutmix = 1 (mixup mode 2)

https://github.com/AlexeyAB/darknet/blob/master/src/data.c#L1640
so the free_data in this line is not expected:

d3 and d4 have content even though mixup mode is re-assigned as 2

so i change this part from

        if (mixup == 3) {
            free_data(d3);
            free_data(d4);
        }

to

        free_data(d3);
        free_data(d4);

but I think the better way is to make a copy of original mixup mode.

AlexeyAB · 2019-12-27T09:56:07Z

@WongKinYiu Hi, Thanks!

I fixed this bug: b8605bd

It seems that mosaic give Top1/Top2 improvement: https://github.com/WongKinYiu/CrossStagePartialNetworks

Also did you try to use mosaic with such modification? #4432 (comment)

WongKinYiu · 2019-12-27T10:08:12Z

not yet, i m trying to solve the problem of optimized_memory = 1.

the memory usage of optimized_memory = 1 is as follows:

however, the expected is:

AlexeyAB · 2019-12-27T10:58:54Z

@WongKinYiu

Are you planning to use optimized_memory = 1, or optimized_memory = 3 ?

I didn't find very simple solution for this.

or we should pass net.optimized_memory parameter to the make_...() function of each layer (make_convolutional(), make_shortcut(), ...) to suppress memory allocation for output_gpu, delta_gpu, activation_input_gpu

or we just can free these arrays after these make_...() functions - but there will be such a surge in memory consumption:

darknet/src/parser.c

Lines 1263 to 1294 in b8605bd

    
           // futher GPU-memory optimization: net.optimized_memory == 2 
        
           if (net.optimized_memory >= 2 && params.train && l.type != DROPOUT) 
        
           { 
        
               l.optimized_memory = net.optimized_memory; 
        
               if (l.output_gpu) { 
        
                   cuda_free(l.output_gpu); 
        
                   //l.output_gpu = cuda_make_array_pinned(l.output, l.batch*l.outputs); // l.steps 
        
                   l.output_gpu = cuda_make_array_pinned_preallocated(NULL, l.batch*l.outputs); // l.steps 
        
               } 
        
               if (l.activation_input_gpu) { 
        
                   cuda_free(l.activation_input_gpu); 
        
                   l.activation_input_gpu = cuda_make_array_pinned_preallocated(NULL, l.batch*l.outputs); // l.steps 
        
               } 
        
               if (l.x_gpu) { 
        
                   cuda_free(l.x_gpu); 
        
                   l.x_gpu = cuda_make_array_pinned_preallocated(NULL, l.batch*l.outputs); // l.steps 
        
               } 
        
               // maximum optimization 
        
               if (net.optimized_memory >= 3 && l.type != DROPOUT) { 
        
                   if (l.delta_gpu) { 
        
                       cuda_free(l.delta_gpu); 
        
                       //l.delta_gpu = cuda_make_array_pinned_preallocated(NULL, l.batch*l.outputs); // l.steps 
        
                       //printf("\n\n PINNED DELTA GPU = %d \n", l.batch*l.outputs); 
        
                   } 
        
               } 
        
               if (l.type == CONVOLUTIONAL) { 
        
                   set_specified_workspace_limit(&l, net.workspace_size_limit);   // workspace size limit 1 GB 
        
               } 
        
           }

WongKinYiu · 2019-12-27T16:03:33Z

@AlexeyAB Thanks,

currently i also do not find a good way to deal with it.
ok, i ll take a look modified mosaic first.

AlexeyAB · 2019-12-27T16:42:52Z

@WongKinYiu

optimized_memory = 1 optimizes memory consumption very poorly.

Anyway, for significant optimization, you should use optimized_memory = 3 and in this case, the CPU-memory consumption will be much more important than the GPU-memory consumption, and this issue (surge in consumption) will not be so significant.

WongKinYiu · 2019-12-28T04:23:35Z

#4432 (comment) got nan

BernoGreyling · 2020-01-04T20:30:54Z

Hi @AlexeyAB,

I think there might be a bug with mosaic flag on a Detector with the combination of settings I might be using. The bounding boxes overlap from one image to another for example :

My config has the following [net] params:
`[net]

Testing

#batch=1
#subdivisions=1

Training

batch=64
subdivisions=16
width=736
height=1280
channels=3
momentum=0.9
decay=0.0005
angle=0
saturation = 1.5
exposure = 1.5
hue=.1
mixup=0
mosaic=1
#blur=1
letter_box=1`

Is mosaic supposed to work with detectors or only for classifiers? From the top discussion it looks like it should work?

Thanks!

AlexeyAB · 2020-01-04T20:48:00Z

@BernoGreyling

Do you use the latest repository?
Do you get this issue for all images or only for some images?
mosiac=1 is supported for both Classifier and Detector and it improves accuracy, there is separate issue for mosaic for Detector: Detector - Mosaic data augmentation #4264

AlexeyAB added the enhancement label Dec 2, 2019

AlexeyAB mentioned this issue Dec 4, 2019

About Mixup #4446

Open

AlexeyAB mentioned this issue Dec 30, 2019

BoF (Bag of Freebies) - Visually Coherent Image Mixup ~+4 AP@[.5, .95] #3272

Closed

BernoGreyling mentioned this issue Jan 4, 2020

Detector - Mosaic data augmentation #4264

Closed

AlexeyAB mentioned this issue Jan 21, 2020

Mosaic Augmentation Paper? WongKinYiu/CrossStagePartialNetworks#8

Open

marcoslucianops mentioned this issue Apr 27, 2020

Where can I see a description of the config parameter?for example :mosaic=1 #5357

Closed

cenit closed this as completed Jan 23, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Classifier trainin - Mosaic data augmentation #4432

Classifier trainin - Mosaic data augmentation #4432

AlexeyAB commented Dec 2, 2019

AlexeyAB commented Dec 2, 2019

WongKinYiu commented Dec 2, 2019 •

edited

Loading

Look4-you commented Dec 3, 2019 •

edited

Loading

AlexeyAB commented Dec 3, 2019 •

edited

Loading

AlexeyAB commented Dec 3, 2019

Look4-you commented Dec 3, 2019

WongKinYiu commented Dec 3, 2019

AlexeyAB commented Dec 3, 2019

WongKinYiu commented Dec 3, 2019 •

edited

Loading

AlexeyAB commented Dec 3, 2019

WongKinYiu commented Dec 3, 2019

AlexeyAB commented Dec 3, 2019

WongKinYiu commented Dec 3, 2019 •

edited

Loading

AlexeyAB commented Dec 3, 2019

AlexeyAB commented Dec 4, 2019 •

edited

Loading

Look4-you commented Dec 6, 2019 •

edited

Loading

AlexeyAB commented Dec 6, 2019

WongKinYiu commented Dec 22, 2019 •

edited

Loading

AlexeyAB commented Dec 22, 2019

WongKinYiu commented Dec 22, 2019

AlexeyAB commented Dec 22, 2019

WongKinYiu commented Dec 27, 2019

WongKinYiu commented Dec 27, 2019

AlexeyAB commented Dec 27, 2019

WongKinYiu commented Dec 27, 2019

AlexeyAB commented Dec 27, 2019

WongKinYiu commented Dec 27, 2019

AlexeyAB commented Dec 27, 2019

WongKinYiu commented Dec 28, 2019

BernoGreyling commented Jan 4, 2020

AlexeyAB commented Jan 4, 2020 •

edited

Loading

Classifier trainin - Mosaic data augmentation #4432

Classifier trainin - Mosaic data augmentation #4432

Comments

AlexeyAB commented Dec 2, 2019

AlexeyAB commented Dec 2, 2019

WongKinYiu commented Dec 2, 2019 • edited Loading

Look4-you commented Dec 3, 2019 • edited Loading

AlexeyAB commented Dec 3, 2019 • edited Loading

AlexeyAB commented Dec 3, 2019

Look4-you commented Dec 3, 2019

WongKinYiu commented Dec 3, 2019

AlexeyAB commented Dec 3, 2019

WongKinYiu commented Dec 3, 2019 • edited Loading

AlexeyAB commented Dec 3, 2019

WongKinYiu commented Dec 3, 2019

AlexeyAB commented Dec 3, 2019

WongKinYiu commented Dec 3, 2019 • edited Loading

AlexeyAB commented Dec 3, 2019

AlexeyAB commented Dec 4, 2019 • edited Loading

Look4-you commented Dec 6, 2019 • edited Loading

AlexeyAB commented Dec 6, 2019

WongKinYiu commented Dec 22, 2019 • edited Loading

AlexeyAB commented Dec 22, 2019

WongKinYiu commented Dec 22, 2019

AlexeyAB commented Dec 22, 2019

WongKinYiu commented Dec 27, 2019

WongKinYiu commented Dec 27, 2019

AlexeyAB commented Dec 27, 2019

WongKinYiu commented Dec 27, 2019

AlexeyAB commented Dec 27, 2019

WongKinYiu commented Dec 27, 2019

AlexeyAB commented Dec 27, 2019

WongKinYiu commented Dec 28, 2019

BernoGreyling commented Jan 4, 2020

Testing

Training

AlexeyAB commented Jan 4, 2020 • edited Loading

WongKinYiu commented Dec 2, 2019 •

edited

Loading

Look4-you commented Dec 3, 2019 •

edited

Loading

AlexeyAB commented Dec 3, 2019 •

edited

Loading

WongKinYiu commented Dec 3, 2019 •

edited

Loading

WongKinYiu commented Dec 3, 2019 •

edited

Loading

AlexeyAB commented Dec 4, 2019 •

edited

Loading

Look4-you commented Dec 6, 2019 •

edited

Loading

WongKinYiu commented Dec 22, 2019 •

edited

Loading

AlexeyAB commented Jan 4, 2020 •

edited

Loading