Object Detection model#

This tutorial provides a step-by-step guide — from installation to model training — for the object detection task using a specific example.

To learn more about the object detection task, refer to Object Detection.

In this tutorial, we demonstrate how to train and validate the ATSS model on the publicly available WGISD dataset. For details on how to export, optimize, and deploy the trained model, refer to Deploy & Demo.

To provide a concrete example, all commands in this tutorial use the ATSS model — a medium-sized architecture that offers a good trade-off between accuracy and inference speed.

This process has been tested with the following configuration:

Ubuntu 20.04
NVIDIA GeForce RTX 3090
Intel(R) Core(TM) i9-11900
CUDA Toolkit 11.8

Setup virtual environment#

1. You can follow the installation process from a quick start guide to create a universal virtual environment for OpenVINO™ Training Extensions.

2. Activate your virtual environment:

.otx/bin/activate
# or by this line, if you created an environment, using tox
. venv/otx/bin/activate

Dataset preparation#

Note

Currently, we support the following object detection dataset formats:

1. Clone a repository with WGISD dataset.

mkdir data ; cd data
git clone https://github.com/thsant/wgisd.git
cd wgisd
git checkout 6910edc5ae3aae8c20062941b1641821f0c30127

This dataset contains images of grapevines with the annotation for different varieties of grapes.

CDY - Chardonnay
CFR - Cabernet Franc
CSV - Cabernet Sauvignon
SVB - Sauvignon Blanc
SYH - Syrah

It’s a great example to start with. The model achieves high accuracy right from the beginning of the training due to relatively large and focused objects. Also, these objects are distinguished by a person, so we can check inference results just by looking at images.

this image uploaded from this `source <https://github.com/thsant/wgisd/blob/master/data/CDY_2015.jpg>`_

2. To run the training using auto-configuration feature, we need to reformat the dataset according to this structure:

wgisd
├── annotations/
    ├── instances_train.json
    ├── instances_val.json
    └── instances_test.json
├──images/
    ├── train
    ├── val
    └── test

We can do that by running these commands:

# format images folder
mv data images

# format annotations folder
mv coco_annotations annotations

# rename annotations to meet *_train.json pattern
mv annotations/train_bbox_instances.json annotations/instances_train.json
mv annotations/test_bbox_instances.json annotations/instances_val.json
cp annotations/instances_val.json annotations/instances_test.json

cd ../..

Evaluation#

1. otx test runs evaluation of a trained model on a particular dataset.

Test function receives test annotation information and model snapshot, trained in previous step.

The default metric is mAP_50 measure.

2. That’s how we can evaluate the snapshot in otx-workspace folder on WGISD dataset and save results to otx-workspace:

CLI (with work_dir)

(otx) ...$ otx test --work_dir otx-workspace
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓
┃        Test metric        ┃       DataLoader 0        ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩
│      test/data_time       │   0.025369757786393166    │
│       test/map_50         │    0.8693901896476746     │
│      test/iter_time       │    0.08180806040763855    │
└───────────────────────────┴───────────────────────────┘

CLI (with config)

(otx) ...$ otx test --config  src/otx/recipe/detection/atss_mobilenetv2.yaml \
                    --data_root data/wgisd \
                    --checkpoint otx-workspace/20240312_051135/checkpoints/epoch_033.ckpt
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓
┃        Test metric        ┃       DataLoader 0        ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩
│      test/data_time       │   0.025369757786393166    │
│       test/map_50         │    0.8693901896476746     │
│      test/iter_time       │    0.08180806040763855    │
└───────────────────────────┴───────────────────────────┘

API

engine.test()

3. The output of {work_dir}/{timestamp}/csv/version_0/metrics.csv consists of a dict with target metric name and its value.

The next tutorial on how to export, optimize, and deploy the model is available at Deploy & Demo.

Object Detection model#

Setup virtual environment#

Dataset preparation#

Training#

Evaluation#