Skip to content

Commit

Permalink
Merge branch 'develop' into zm/clean-download-filenames
Browse files Browse the repository at this point in the history
  • Loading branch information
zhiltsov-max authored Jul 21, 2023
2 parents eef60f7 + be81d28 commit 90d4b65
Show file tree
Hide file tree
Showing 5 changed files with 97 additions and 34 deletions.
131 changes: 97 additions & 34 deletions site/content/en/docs/manual/advanced/automatic-annotation.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,54 +2,117 @@
title: 'Automatic annotation'
linkTitle: 'Automatic annotation'
weight: 16
description: 'Guide to using the automatic annotation of tasks.'
description: 'Automatic annotation of tasks'
---

Automatic Annotation is used for creating preliminary annotations.
To use Automatic Annotation you need a DL model that can be deployed by a CVAT administrator.
You can find the list of available models in the `Models` section.
Automatic annotation in CVAT is a tool that you can use
to automatically pre-annotate your data with pre-trained models.

1. To launch automatic annotation, you should open the dashboard and find a task which you want to annotate.
Then click the `Actions` button and choose option `Automatic Annotation` from the dropdown menu.
CVAT can use models from the following sources:

- [Pre-installed models](#models).
- Models integrated from [Hugging Face and Roboflow](#adding-models-from-hugging-face-and-roboflow).
- [Self-hosted models deployed with Nuclio](/docs/manual/advanced/serverless-tutorial/).

The following table describes the available options:

| | Self-hosted | Cloud |
| ------------------------------------------- | ---------------------- | ------------------------------------------------ |
| **Price** | Free | See [Pricing](https://www.cvat.ai/pricing/cloud) |
| **Models** | You have to add models | You can use pre-installed models |
| **Hugging Face & Roboflow <br>integration** | Not supported | Supported |

See:

- [Running Automatic annotation](#running-automatic-annotation)
- [Labels matching](#labels-matching)
- [Models](#models)
- [Adding models from Hugging Face and Roboflow](#adding-models-from-hugging-face-and-roboflow)

## Running Automatic annotation

To start automatic annotation, do the following:

1. On the top menu, click **Tasks**.
1. Find the task you want to annotate and click **Action** > **Automatic annotation**.

![](/images/image119_detrac.jpg)

1. In the dialog window select a model you need. DL models are created for specific labels, e.g.
the Crossroad model was taught using footage from cameras located above the highway and it is best to
use this model for the tasks with similar camera angles.
If it's necessary select the `Clean old annotations` checkbox.
Adjust the labels so that the task labels will correspond to the labels of the DL model.
For example, let’s consider a task where you have to annotate labels “car” and “person”.
You should connect the “person” label from the model to the “person” label in the task.
As for the “car” label, you should choose the most fitting label available in the model - the “vehicle” label.
If the chosen model supports automatic attributes detection
(like facial expressions, for example: ``serverless/openvino/omz/intel/face-detection-0205``),
you can also map attributes between the DL model and your task.
The task requires to annotate cars only and choosing the “vehicle” label implies annotation of all vehicles,
in this case using auto annotation will help you complete the task faster.
Click `Submit` to begin the automatic annotation process.
1. In the Automatic annotation dialog, from the drop-down list, select a [model](#models).
1. [Match the labels](#labels-matching) of the model and the task.
1. (Optional) In case you need the model to return masks as polygons, switch toggle **Return masks as polygons**.
1. (Optional) In case you need to remove all previous annotations, switch toggle **Clean old annotations**.

![](/images/image120.jpg)

1. At runtime - you can see the percentage of completion.
You can cancel the automatic annotation by clicking on the `Cancel`button.
1. Click **Annotate**.

CVAT will show the progress of annotation on the progress bar.

![Progress bar](/images/image121_detrac.jpg)

You can stop the automatic annotation at any moment by clicking cancel.

## Labels matching

Each model is trained on a dataset and supports only the dataset's labels.

For example:

- DL model has the label `car`.
- Your task (or project) has the label `vehicle`.

To annotate, you need to match these two labels to give
CVAT a hint that, in this case, `car` = `vehicle`.

If you have a label that is not on the list
of DL labels, you will not be able to
match them.

For this reason, supported DL models are suitable only
for certain labels.

To check the list of labels for each model, see [Models](#models)
papers and official documentation.

## Models

Automatic annotation uses pre-installed and added models.

> For self-hosted solutions,
> you need to [install Automatic Annotation first](/docs/administration/advanced/installation_automatic_annotation/)
> and [add models](/docs/manual/advanced/models/).
List of pre-installed models:

<!--lint disable maximum-line-length-->

| Model | Description |
| ------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| Attributed face detection | Three OpenVINO models work together: <br><br><li> [Face Detection 0205](https://docs.openvino.ai/2022.3/omz_models_model_face_detection_0205.html): face detector based on MobileNetV2 as a backbone with a FCOS head for indoor and outdoor scenes shot by a front-facing camera. <li>[Emotions recognition retail 0003](https://docs.openvino.ai/2022.3/omz_models_model_emotions_recognition_retail_0003.html#emotions-recognition-retail-0003): fully convolutional network for recognition of five emotions (‘neutral’, ‘happy’, ‘sad’, ‘surprise’, ‘anger’). <li>[Age gender recognition retail 0013](https://docs.openvino.ai/2022.3/omz_models_model_age_gender_recognition_retail_0013.html): fully convolutional network for simultaneous Age/Gender recognition. The network can recognize the age of people in the [18 - 75] years old range; it is not applicable for children since their faces were not in the training set. |
| RetinaNet R101 | RetinaNet is a one-stage object detection model that utilizes a focal loss function to address class imbalance during training. Focal loss applies a modulating term to the cross entropy loss to focus learning on hard negative examples. RetinaNet is a single, unified network composed of a backbone network and two task-specific subnetworks. <br><br>For more information, see: <li>[Site: RetinaNET](https://paperswithcode.com/lib/detectron2/retinanet) |
| Text detection | Text detector based on PixelLink architecture with MobileNetV2, depth_multiplier=1.4 as a backbone for indoor/outdoor scenes. <br><br> For more information, see: <li>[Site: OpenVINO Text detection 004](https://docs.openvino.ai/2022.3/omz_models_model_text_detection_0004.html) |
| YOLO v3 | YOLO v3 is a family of object detection architectures and models pre-trained on the COCO dataset. <br><br> For more information, see: <li>[Site: YOLO v3](https://docs.openvino.ai/2022.3/omz_models_model_yolo_v3_tf.html) |
| YOLO v5 | YOLO v5 is a family of object detection architectures and models based on the Pytorch framework. <br><br> For more information, see: <li>[GitHub: YOLO v5](https://github.com/ultralytics/yolov5) <li>[Site: YOLO v5](https://docs.ultralytics.com/#yolov5) |
| YOLO v7 | YOLOv7 is an advanced object detection model that outperforms other detectors in terms of both speed and accuracy. It can process frames at a rate ranging from 5 to 160 frames per second (FPS) and achieves the highest accuracy with 56.8% average precision (AP) among real-time object detectors running at 30 FPS or higher on the V100 graphics processing unit (GPU). <br><br> For more information, see: <li>[GitHub: YOLO v7](https://github.com/WongKinYiu/yolov7) <li>[Paper: YOLO v7](https://arxiv.org/pdf/2207.02696.pdf) |

<!--lint enable maximum-line-length-->

![](/images/image121_detrac.jpg)
## Adding models from Hugging Face and Roboflow

1. The end result of an automatic annotation is an annotation with separate rectangles (or other shapes)
In case you did not find the model you need, you can add a model
of your choice from [Hugging Face](https://huggingface.co/)
or [Roboflow](https://roboflow.com/).

![](/images/gif014_detrac.gif)
> **Note**, that you cannot add models from Hugging Face and Roboflow to self-hosted CVAT.
1. You can combine separate bounding boxes into tracks using the `Person reidentification ` model.
To do this, click on the automatic annotation item in the action menu again and select the model
of the `ReID` type (in this case the `Person reidentification` model).
You can set the following parameters:
<!--lint disable maximum-line-length-->

- Model `Threshold` is a maximum cosine distance between objects’ embeddings.
- `Maximum distance` defines a maximum radius that an object can diverge between adjacent frames.
For more information,
see [Streamline annotation by integrating Hugging Face and Roboflow models](https://www.cvat.ai/post/integrating-hugging-face-and-roboflow-models).

![](/images/image133.jpg)
This video demonstrates the process:

1. You can remove false positives and edit tracks using `Split` and `Merge` functions.
<iframe width="560" height="315" src="https://www.youtube.com/embed/SbU3aB65W5s" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe>

![](/images/gif015_detrac.gif)
<!--lint enable maximum-line-length-->
Binary file removed site/content/en/images/gif014_detrac.gif
Binary file not shown.
Binary file removed site/content/en/images/gif015_detrac.gif
Binary file not shown.
Binary file modified site/content/en/images/image120.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file removed site/content/en/images/image133.jpg
Binary file not shown.

0 comments on commit 90d4b65

Please sign in to comment.