Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Az/custom annotation #233

Merged
merged 42 commits into from
Dec 26, 2018
Merged
Show file tree
Hide file tree
Changes from 13 commits
Commits
Show all changes
42 commits
Select commit Hold shift + click to select a range
176d632
initial version of custom annotation application
Dec 10, 2018
98760d1
added readme for custom annotation
Dec 13, 2018
20b5d01
Update Readme
azhavoro Dec 13, 2018
b5d90eb
Update README
azhavoro Dec 13, 2018
fdb1d51
update README
Dec 13, 2018
145beea
Merge branch 'az/custom_annotation' of https://github.com/opencv/cvat…
Dec 13, 2018
e81bbbb
minor fixes
Dec 13, 2018
897ad63
custom annotation -> auto annotation
Dec 14, 2018
095bfbb
fixed typos
Dec 14, 2018
8d608db
remove unused method
Dec 14, 2018
039d70f
fixed indents
Dec 14, 2018
0f19d63
restricting usage of built-ins in user's code
Dec 14, 2018
95ac2d9
updted README
Dec 14, 2018
515aafb
fixed typos
Dec 17, 2018
ae3e565
fixed typo
Dec 17, 2018
96b57cc
fixed comments from Boris
Dec 17, 2018
f6a9563
switch from OpenCV to IE to infer directly
Dec 20, 2018
78d19a4
fixed some codacy issues
Dec 25, 2018
ba53010
updated README
Dec 25, 2018
f3e33a1
fix codacy issue
Dec 25, 2018
0240061
fix codacy issues
Dec 25, 2018
dd7ae04
fix codacy issues
Dec 25, 2018
284df75
updated readme
Dec 25, 2018
1d7204d
Update README.md
azhavoro Dec 25, 2018
565d442
Update README.md
azhavoro Dec 26, 2018
8c0c8fd
fixed some typos
Dec 26, 2018
c854dfb
fixed some typos
Dec 26, 2018
6dc611b
Moved information about components into root README.md
Dec 26, 2018
9142fd8
Merge remote-tracking branch 'origin/develop' into az/custom_annotation
Dec 26, 2018
b2805b2
Slightly improved documentation.
Dec 26, 2018
1daf222
Fix typo
nmanovic Dec 26, 2018
f0ff2f1
added public results class to interact with interpretation script
Dec 26, 2018
4f1dc0d
attributes support
Dec 26, 2018
e98080f
fixed codacy issues
Dec 26, 2018
4c20011
added several points for Point shape support
Dec 26, 2018
cd68501
added polylines and polygons support
Dec 26, 2018
0254757
rename add_point -> add_points
Dec 26, 2018
28d86e2
updated readme
Dec 26, 2018
8370bb7
Merge remote-tracking branch 'origin/develop' into az/custom_annotation
Dec 26, 2018
fde65d8
Update CHANGELOG.md
Dec 26, 2018
015571c
support OpenVINO R5
Dec 26, 2018
6cc79f5
Merge branch 'az/custom_annotation' of https://github.com/opencv/cvat…
Dec 26, 2018
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
124 changes: 124 additions & 0 deletions cvat/apps/auto_annotation/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,124 @@
## Auto annotation

### Description

This application will be enabled automatically if OpenVINO component was installed and allows to use custom detection models for preannotation.
azhavoro marked this conversation as resolved.
Show resolved Hide resolved
Supported frameworks:
* DLDT form OpenVINO toolkit
azhavoro marked this conversation as resolved.
Show resolved Hide resolved

Application uses OpenCV dnn module with DLDT backed for inference.
azhavoro marked this conversation as resolved.
Show resolved Hide resolved

### Usage
To annotate task with custom model you need prepare 4 files:
azhavoro marked this conversation as resolved.
Show resolved Hide resolved
1. **Model config** - a text file that contains network configuration. It could be a file with the following extension:
azhavoro marked this conversation as resolved.
Show resolved Hide resolved
* *.xml (DLDT)
1. **Model weights** - a binary file that contains trained weights. The following file extension is expected:
* *.bin (DLDT)
1. **Preprocessing configuration and label map** - simple json file that describes image dimensions and preprocessing options. For more details please view [OpenCV](https://docs.opencv.org/3.4/d6/d0f/group__dnn.html#ga0b7b7c3c530b747ef738178835e1e70f) documentation.
Label values in label_map should be exactly equal to labels wich task was created, otherwise it will be ignored.
Example:
```json
{
"blob_params": {
"width": 300,
"height":300,
"mean": "127.5, 127.5, 127.5",
"scalefactor": 0.0078431372549,
"swapRB": false,
"crop": false
},
"label_map": {
"0": "background",
"1": "aeroplane",
"2": "bicycle",
"3": "bird",
"4": "boat",
"5": "bottle",
"6": "bus",
"7": "car",
"8": "cat",
"9": "chair",
"10": "cow",
"11": "diningtable",
"12": "dog",
"13": "horse",
"14": "motorbike",
"15": "person",
"16": "pottedplant",
"17": "sheep",
"18": "sofa",
"19": "train",
"20": "tvmonitor"
}
}
```
1. **Interpretation script** - python file that used to convert output results from net to CVAT format. This code running inside restricted environment.
List of builtins functions that available to use:
nmanovic marked this conversation as resolved.
Show resolved Hide resolved
* **str**
* **int**
* **float**
* **max**
* **min**
* **range**

Also two variables are available in scope:
* **detections** list with detection results(see description below)
* **results** dictionary where convertation results shoud be added (see examples below for details)

`detection` is a python's list of dictionaries that represent detections for each frame of task with folloing keys:
* frame_id - frame number
* frame_height - frame height
* frame_width - frame width
* detections - output blob (See [cv::dnn::Net::forward](https://docs.opencv.org/3.4/db/d30/classcv_1_1dnn_1_1Net.html#a98ed94cb6ef7063d3697259566da310b) for details).

`results` is dictionary with structure:
```python
{
"boxes": [],
"polygons": [],
"polylines": [],
"points": [],
"box_paths": [],
"polygon_paths": [],
"polyline_paths": [],
"points_paths": [],
}
```

Example for SSD based network
```python
def process_results(detections, results):
def clip(value):
return max(min(1.0, value), 0.0)

boxes = results['boxes']

for frame_results in detections:
nmanovic marked this conversation as resolved.
Show resolved Hide resolved
frame_height = frame_results['frame_height']
frame_width = frame_results['frame_width']
frame_number = frame_results['frame_id']

for i in range(frame_results['detections'].shape[2]):
confidence = frame_results['detections'][0, 0, i, 2]
if confidence < 0.4: continue
class_id = str(int(frame_results['detections'][0, 0, i, 1]))
xtl = '{:.2f}'.format(clip(frame_results['detections'][0, 0, i, 3]) * frame_width)
ytl = '{:.2f}'.format(clip(frame_results['detections'][0, 0, i, 4]) * frame_height)
xbr = '{:.2f}'.format(clip(frame_results['detections'][0, 0, i, 5]) * frame_width)
ybr = '{:.2f}'.format(clip(frame_results['detections'][0, 0, i, 6]) * frame_height)

boxes.append({
'label': class_id,
'frame': frame_number,
'xtl': xtl,
'ytl': ytl,
'xbr': xbr,
'ybr': ybr,
'attributes': {
'confidence': '{:.2f}'.format(confidence),
}
})

process_results(detections, results)

```
8 changes: 8 additions & 0 deletions cvat/apps/auto_annotation/__init__.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@

# Copyright (C) 2018 Intel Corporation
#
# SPDX-License-Identifier: MIT

from cvat.settings.base import JS_3RDPARTY

JS_3RDPARTY['dashboard'] = JS_3RDPARTY.get('dashboard', []) + ['auto_annotation/js/auto_annotation.js']
9 changes: 9 additions & 0 deletions cvat/apps/auto_annotation/admin.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@

# Copyright (C) 2018 Intel Corporation
#
# SPDX-License-Identifier: MIT

from django.contrib import admin

# Register your models here.

11 changes: 11 additions & 0 deletions cvat/apps/auto_annotation/apps.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,11 @@

# Copyright (C) 2018 Intel Corporation
#
# SPDX-License-Identifier: MIT

from django.apps import AppConfig


class AutoAnnotationConfig(AppConfig):
name = 'auto_annotation'

19 changes: 19 additions & 0 deletions cvat/apps/auto_annotation/image_loader.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,19 @@
import cv2

class ImageLoader():
def __init__(self, image_list):
self.image_list = image_list

def __getitem__(self, i):
return self.image_list[i]

def __iter__(self):
for imagename in self.image_list:
yield imagename, self._load_image(imagename)

def __len__(self):
return len(self.image_list)

@staticmethod
def _load_image(path_to_image):
return cv2.imread(path_to_image)
5 changes: 5 additions & 0 deletions cvat/apps/auto_annotation/migrations/__init__.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@

# Copyright (C) 2018 Intel Corporation
#
# SPDX-License-Identifier: MIT

51 changes: 51 additions & 0 deletions cvat/apps/auto_annotation/model_loader.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,51 @@
import cv2
azhavoro marked this conversation as resolved.
Show resolved Hide resolved
import json

class BlobParamaters():
def __init__(self, scalefactor, input_size, mean, swapRB, crop):
self.scalefactor = scalefactor
self.input_size = input_size
self.mean = mean
self.swapRB = swapRB
self.crop = crop

class ModelLoader():
def __init__(self, path_to_model, blob_params):
self.path_to_model = path_to_model
self.blob_params = blob_params

def load(self):
self.net = cv2.dnn.readNet(*self.path_to_model)
self.net.setPreferableTarget(cv2.dnn.DNN_TARGET_CPU)
self.net.setPreferableBackend(cv2.dnn.DNN_BACKEND_DEFAULT)

def setInput(self, images):
blob = cv2.dnn.blobFromImages(images,
self.blob_params.scalefactor,
self.blob_params.input_size,
self.blob_params.mean,
self.blob_params.swapRB,
self.blob_params.crop)

self.net.setInput(blob)

def forward(self):
return self.net.forward()

def read_model_config(config_path):
with open(config_path, 'r') as f:
return json.load(f)

def get_blob_props(config):
blob_params = config['blob_params']
return BlobParamaters(
scalefactor=blob_params['scalefactor'] if 'scalefactor' in blob_params else 1.0,
input_size=(blob_params['height'], blob_params['width']) if 'height' in blob_params and 'width' in blob_params else (),
mean=tuple(float(v) for v in blob_params['mean'].split(',')) if 'mean' in blob_params else tuple(),
swapRB=blob_params['swapRB'] if 'swapRB' in blob_params else False,
crop=blob_params['crop'] if 'crop' in blob_params else False,
)

def get_model_label_map(config):
return config['label_map']

9 changes: 9 additions & 0 deletions cvat/apps/auto_annotation/models.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@

# Copyright (C) 2018 Intel Corporation
#
# SPDX-License-Identifier: MIT

from django.db import models

# Create your models here.

Loading