Skip to content

Commit 02a599f

Browse files
author
Anna Grebneva
authored
Updated input/output description of intel models (openvinotoolkit#3425)
1 parent 544689f commit 02a599f

File tree

54 files changed

+88
-87
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

54 files changed

+88
-87
lines changed

models/intel/age-gender-recognition-retail-0013/README.md

+2-2
Original file line numberDiff line numberDiff line change
@@ -38,7 +38,7 @@ applicable for children since their faces were not in the training set.
3838

3939
## Inputs
4040

41-
Image, name: `input`, shape: `1, 3, 62, 62` in `1, C, H, W` format, where:
41+
Image, name: `data`, shape: `1, 3, 62, 62` in `1, C, H, W` format, where:
4242

4343
- `C` - number of channels
4444
- `H` - image height
@@ -48,7 +48,7 @@ Expected color order is `BGR`.
4848

4949
## Outputs
5050

51-
1. Name: `age_conv3`, shape: `1, 1, 1, 1` - Estimated age divided by 100.
51+
1. Name: `fc3_a`, shape: `1, 1, 1, 1` - Estimated age divided by 100.
5252
2. Name: `prob`, shape: `1, 2, 1, 1` - Softmax output across 2 type classes [0 - female, 1 - male].
5353

5454
## Demo usage

models/intel/face-detection-0200/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ curve. All numbers were evaluated by taking into account only faces bigger than
2626

2727
## Inputs
2828

29-
Image, name: `input`, shape: `1, 3, 256, 256` in the format `B, C, H, W`, where:
29+
Image, name: `image`, shape: `1, 3, 256, 256` in the format `B, C, H, W`, where:
3030

3131
- `B` - batch size
3232
- `C` - number of channels

models/intel/face-detection-0202/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ curve. All numbers were evaluated by taking into account only faces bigger than
2626

2727
## Inputs
2828

29-
Image, name: `input`, shape: `1, 3, 384, 384` in the format `B, C, H, W`, where:
29+
Image, name: `image`, shape: `1, 3, 384, 384` in the format `B, C, H, W`, where:
3030

3131
- `B` - batch size
3232
- `C` - number of channels

models/intel/face-detection-0204/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ curve. All numbers were evaluated by taking into account only faces bigger than
2626

2727
## Inputs
2828

29-
Image, name: `input`, shape: `1, 3, 448, 448` in the format `B, C, H, W`, where:
29+
Image, name: `image`, shape: `1, 3, 448, 448` in the format `B, C, H, W`, where:
3030

3131
- `B` - batch size
3232
- `C` - number of channels

models/intel/face-detection-0205/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,7 @@ curve. All numbers were evaluated by taking into account only faces bigger than
2525

2626
## Inputs
2727

28-
Image, name: `input`, shape: `1, 3, 416, 416` in the format `B, C, H, W`, where:
28+
Image, name: `image`, shape: `1, 3, 416, 416` in the format `B, C, H, W`, where:
2929

3030
- `B` - batch size
3131
- `C` - number of channels

models/intel/face-detection-0206/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,7 @@ curve. All numbers were evaluated by taking into account only faces bigger than
2525

2626
## Inputs
2727

28-
Image, name: `input`, shape: `1, 3, 640, 640` in the format `B, C, H, W`, where:
28+
Image, name: `image`, shape: `1, 3, 640, 640` in the format `B, C, H, W`, where:
2929

3030
- `B` - batch size
3131
- `C` - number of channels

models/intel/face-detection-adas-0001/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -30,7 +30,7 @@ curve. Numbers are on
3030

3131
## Inputs
3232

33-
Image, name: `input`, shape: `1, 3, 384, 672` in the format `B, C, H, W`, where:
33+
Image, name: `data`, shape: `1, 3, 384, 672` in the format `B, C, H, W`, where:
3434

3535
- `B` - batch size
3636
- `C` - number of channels

models/intel/face-detection-retail-0004/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,7 @@ curve. All numbers were evaluated by taking into account only faces bigger than
2727

2828
## Inputs
2929

30-
Image, name: `input`, shape: `1, 3, 300, 300` in the format `B, C, H, W`, where:
30+
Image, name: `data`, shape: `1, 3, 300, 300` in the format `B, C, H, W`, where:
3131

3232
- `B` - batch size
3333
- `C` - number of channels

models/intel/face-detection-retail-0005/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ curve. All numbers were evaluated by taking into account only faces bigger than
2626

2727
## Inputs
2828

29-
Image, name: `input`, shape: `1, 3, 300, 300` in the format `B, C, H, W`, where:
29+
Image, name: `input.1`, shape: `1, 3, 300, 300` in the format `B, C, H, W`, where:
3030

3131
- `B` - batch size
3232
- `C` - number of channels

models/intel/face-reidentification-retail-0095/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -34,7 +34,7 @@ To align the face, use a landmarks regression model: using regressed points and
3434

3535
## Inputs
3636

37-
Input image, name: `data` , shape: `1, 3, 128, 128` in the format `B, C, H, W`, where:
37+
Input image, name: `0` , shape: `1, 3, 128, 128` in the format `B, C, H, W`, where:
3838

3939
- `B` - batch size
4040
- `C` - number of channels

models/intel/faster-rcnn-resnet101-coco-sparse-60-0001/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ See Average Precision metric description at [COCO: Common Objects in Context](ht
2121

2222
## Inputs
2323

24-
Image, name: `input`, shape: `1, 800, 1280, 3` in the format `B, H, W, C`, where:
24+
Image, name: `image`, shape: `1, 800, 1280, 3` in the format `B, H, W, C`, where:
2525

2626
- `B` - batch size
2727
- `H` - image height

models/intel/head-pose-estimation-adas-0001/README.md

+3-3
Original file line numberDiff line numberDiff line change
@@ -46,9 +46,9 @@ Expected color order is `BGR`.
4646
Each output contains one float value that represents value in Tait-Bryan angles
4747
(yaw, pitch or roll).
4848

49-
1. name: `angle_y_fc`, shape: `1, 1` - Estimated yaw (in degrees).
50-
2. name: `angle_p_fc`, shape: `1, 1` - Estimated pitch (in degrees).
51-
3. name: `angle_r_fc`, shape: `1, 1` - Estimated roll (in degrees).
49+
1. name: `fc_y`, shape: `1, 1` - Estimated yaw (in degrees).
50+
2. name: `fc_p`, shape: `1, 1` - Estimated pitch (in degrees).
51+
3. name: `fc_r`, shape: `1, 1` - Estimated roll (in degrees).
5252

5353
## Demo usage
5454

models/intel/horizontal-text-detection-0001/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ The key benefit of this model compared to the [base model](../text-detection-000
2121

2222
## Inputs
2323

24-
Image, name: `input`, shape: `1, 3, 704, 704` in the format `1, C, H, W`, where:
24+
Image, name: `image`, shape: `1, 3, 704, 704` in the format `1, C, H, W`, where:
2525

2626
- `C` - number of channels
2727
- `H` - image height

models/intel/human-pose-estimation-0005/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,7 @@ Average Precision metric described in [COCO Keypoint Evaluation site](https://co
2323

2424
## Inputs
2525

26-
Image, name: `input`, shape: `1, 3, 288, 288` in the `B, C, H, W` format, where:
26+
Image, name: `image`, shape: `1, 3, 288, 288` in the `B, C, H, W` format, where:
2727

2828
- `B` - batch size
2929
- `C` - number of channels

models/intel/human-pose-estimation-0006/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,7 @@ Average Precision metric described in [COCO Keypoint Evaluation site](https://co
2323

2424
## Inputs
2525

26-
Image, name: `input`, shape: `1, 3, 352, 352` in the `B, C, H, W` format, where:
26+
Image, name: `image`, shape: `1, 3, 352, 352` in the `B, C, H, W` format, where:
2727

2828
- `B` - batch size
2929
- `C` - number of channels

models/intel/human-pose-estimation-0007/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,7 @@ Average Precision metric described in [COCO Keypoint Evaluation site](https://co
2323

2424
## Inputs
2525

26-
Image, name: `input`, shape: `1, 3, 448, 448` in the `B, C, H, W` format, where:
26+
Image, name: `image`, shape: `1, 3, 448, 448` in the `B, C, H, W` format, where:
2727

2828
- `B` - batch size
2929
- `C` - number of channels

models/intel/landmarks-regression-retail-0009/README.md

+2-2
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ where N is the number of landmarks, _p_-hat and _p_ are, correspondingly, the pr
2626

2727
## Inputs
2828

29-
Image, name: `data`, shape: `1, 3, 48, 48` in the format `B, C, H, W`, where:
29+
Image, name: `0`, shape: `1, 3, 48, 48` in the format `B, C, H, W`, where:
3030

3131
- `B` - batch size
3232
- `C` - number of channels
@@ -37,7 +37,7 @@ The expected color order is `BGR`.
3737

3838
## Outputs
3939

40-
The net outputs a blob with the shape: `1, 10`, containing a row-vector of 10 floating point values
40+
The net outputs a blob with the shape: `1, 10, 1, 1`, containing a row-vector of 10 floating point values
4141
for five landmarks coordinates in the form (x0, y0, x1, y1, ..., x4, y4).
4242
All the coordinates are normalized to be in range [0, 1].
4343

models/intel/license-plate-recognition-barrier-0001/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -46,7 +46,7 @@ thoroughly. Other types of license plates may underperform.
4646

4747
## Outputs
4848

49-
Encoded vector of floats, name: `decode`, shape: `1, 88, 1, 1`. Each float
49+
Encoded vector of floats, name: `dec`, shape: `1, 88, 1, 1`. Each float
5050
is an integer number encoding a character according to this dictionary:
5151

5252
0 0

models/intel/pedestrian-and-vehicle-detector-adas-0001/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,7 @@ Tested on challenging internal datasets with 1001 pedestrian and 12585 vehicles
2727

2828
## Inputs
2929

30-
Image, name: `input`, shape: `1, 3, 384, 672` in the format `B, C, H, W`, where:
30+
Image, name: `data`, shape: `1, 3, 384, 672` in the format `B, C, H, W`, where:
3131

3232
- `B` - batch size
3333
- `C` - number of channels

models/intel/pedestrian-detection-adas-0002/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ Tested on an internal dataset with 1001 pedestrian to detect.
2626

2727
## Inputs
2828

29-
Image, name: `input`, shape: `1, 3, 384, 672` in the format `B, C, H, W`, where:
29+
Image, name: `data`, shape: `1, 3, 384, 672` in the format `B, C, H, W`, where:
3030

3131
- `B` - batch size
3232
- `C` - number of channels

models/intel/person-attributes-recognition-crossroad-0230/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -39,7 +39,7 @@ This model presents a person attributes classification algorithm analysis scenar
3939

4040
## Inputs
4141

42-
Image, name: `input`, shape: `1, 3, 160, 80` in the format `1, C, H, W`, where:
42+
Image, name: `0`, shape: `1, 3, 160, 80` in the format `1, C, H, W`, where:
4343

4444
- `C` - number of channels
4545
- `H` - image height

models/intel/person-detection-0106/README.md

+5-3
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,7 @@ curve.
2424

2525
## Inputs
2626

27-
Image, name: `input`, shape: `1, 3, 800, 1344` in the format `B, C, H, W`, where:
27+
Image, name: `image`, shape: `1, 3, 800, 1344` in the format `B, C, H, W`, where:
2828

2929
- `B` - batch size
3030
- `C` - number of channels
@@ -35,15 +35,17 @@ Expected color order is `BGR`.
3535

3636
## Outputs
3737

38-
1. The `boxes` is a blob with the shape `100, 5` in the format `N, 5`, where `N` is the number of detected
38+
Model has outputs with dynamic shapes.
39+
40+
1. The `boxes` is a blob with the shape `-1, 5` in the format `N, 5`, where `N` is the number of detected
3941
bounding boxes. For each detection, the description has the format
4042
[`x_min`, `y_min`, `x_max`, `y_max`, `conf`], where:
4143

4244
- (`x_min`, `y_min`) - coordinates of the top left bounding box corner
4345
- (`x_max`, `y_max`) - coordinates of the bottom right bounding box corner
4446
- `conf` - confidence for the predicted class
4547

46-
2. The `labels` is a blob with the shape `100` in the format `N`, where `N` is the number of detected
48+
2. The `labels` is a blob with the shape `-1` in the format `N`, where `N` is the number of detected
4749
bounding boxes. It contains predicted class ID (0 - person) per each detected box.
4850

4951
## Demo usage

models/intel/person-detection-0200/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,7 @@ curve.
2525

2626
## Inputs
2727

28-
Image, name: `input`, shape: `1, 3, 256, 256` in the format `B, C, H, W`, where:
28+
Image, name: `image`, shape: `1, 3, 256, 256` in the format `B, C, H, W`, where:
2929

3030
- `B` - batch size
3131
- `C` - number of channels

models/intel/person-detection-0201/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,7 @@ curve.
2525

2626
## Inputs
2727

28-
Image, name: `input`, shape: `1, 3, 384, 384` in the format `B, C, H, W`, where:
28+
Image, name: `image`, shape: `1, 3, 384, 384` in the format `B, C, H, W`, where:
2929

3030
- `B` - batch size
3131
- `C` - number of channels

models/intel/person-detection-0202/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,7 @@ curve.
2525

2626
## Inputs
2727

28-
Image, name: `input`, shape: `1, 3, 512, 512` in the format `B, C, H, W`, where:
28+
Image, name: `image`, shape: `1, 3, 512, 512` in the format `B, C, H, W`, where:
2929

3030
- `B` - batch size
3131
- `C` - number of channels

models/intel/person-detection-0203/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,7 @@ curve.
2424

2525
## Inputs
2626

27-
Image, name: `input`, shape: `1, 3, 480, 864` in the format `B, C, H, W`, where:
27+
Image, name: `image`, shape: `1, 3, 480, 864` in the format `B, C, H, W`, where:
2828

2929
- `B` - batch size
3030
- `C` - number of channels

models/intel/person-detection-0301/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,7 @@ curve.
2424

2525
## Inputs
2626

27-
Image, name: `input`, shape: `1, 3, 800, 1344` in the format `B, C, H, W`, where:
27+
Image, name: `image`, shape: `1, 3, 800, 1344` in the format `B, C, H, W`, where:
2828

2929
- `B` - batch size
3030
- `C` - number of channels

models/intel/person-detection-0302/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,7 @@ curve.
2424

2525
## Inputs
2626

27-
Image, name: `input`, shape: `1, 3, 720, 1280` in the format `B, C, H, W`, where:
27+
Image, name: `image`, shape: `1, 3, 720, 1280` in the format `B, C, H, W`, where:
2828

2929
- `B` - batch size
3030
- `C` - number of channels

models/intel/person-detection-0303/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,7 @@ curve.
2424

2525
## Inputs
2626

27-
Image, name: `input`, shape: `1, 3, 720, 1280` in the format `B, C, H, W`, where:
27+
Image, name: `image`, shape: `1, 3, 720, 1280` in the format `B, C, H, W`, where:
2828

2929
- `B` - batch size
3030
- `C` - number of channels

models/intel/person-detection-action-recognition-0005/README.md

+5-5
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,7 @@ curve.
2727

2828
## Inputs
2929

30-
Image, name: `input`, shape: `1, 3, 400, 680` in the format `B, C, H, W`, where:
30+
Image, name: `data`, shape: `1, 3, 400, 680` in the format `B, C, H, W`, where:
3131

3232
- `B` - batch size
3333
- `C` - number of channels
@@ -43,10 +43,10 @@ The net outputs four branches:
4343
1. name: `mbox_loc1/out/conv/flat`, shape: `b, num_priors*4` - Box coordinates in SSD format
4444
2. name: `mbox_main_conf/out/conv/flat/softmax/flat`, shape: `b, num_priors*2` - Detection confidences
4545
3. name: `mbox/priorbox`, shape: `1, 2, num_priors*4` - Prior boxes in SSD format
46-
4. name: `out/anchor1`, shape: `b, 3, h, w` - Action confidences
47-
5. name: `out/anchor2`, shape: `b, 3, h, w` - Action confidences
48-
6. name: `out/anchor3`, shape: `b, 3, h, w` - Action confidences
49-
7. name: `out/anchor4`, shape: `b, 3, h, w` - Action confidences
46+
4. name: `out/anchor1`, shape: `b, h, w, 3` - Action confidences
47+
5. name: `out/anchor2`, shape: `b, h, w, 3` - Action confidences
48+
6. name: `out/anchor3`, shape: `b, h, w, 3` - Action confidences
49+
7. name: `out/anchor4`, shape: `b, h, w, 3` - Action confidences
5050

5151
Where:
5252

models/intel/person-detection-action-recognition-teacher-0002/README.md

+5-5
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,7 @@ curve.
2727

2828
## Inputs
2929

30-
Image, name: `input`, shape: `1, 3, 400, 680` in the format `B, C, H, W`, where:
30+
Image, name: `data`, shape: `1, 3, 400, 680` in the format `B, C, H, W`, where:
3131

3232
- `B` - batch size
3333
- `C` - number of channels
@@ -43,10 +43,10 @@ The net outputs four branches:
4343
1. name: `mbox_loc1/out/conv/flat`, shape: `b, num_priors*4` - Box coordinates in SSD format
4444
2. name: `mbox_main_conf/out/conv/flat/softmax/flat`, shape: `b, num_priors*2` - Detection confidences
4545
3. name: `mbox/priorbox`, shape: `1, 2, num_priors*4` - Prior boxes in SSD format
46-
4. name: `out/anchor1`, shape: `b, 3, h, w` - Action confidences
47-
5. name: `out/anchor2`, shape: `b, 3, h, w` - Action confidences
48-
6. name: `out/anchor3`, shape: `b, 3, h, w` - Action confidences
49-
7. name: `out/anchor4`, shape: `b, 3, h, w` - Action confidences
46+
4. name: `out/anchor1`, shape: `b, h, w, 3` - Action confidences
47+
5. name: `out/anchor2`, shape: `b, h, w, 3` - Action confidences
48+
6. name: `out/anchor3`, shape: `b, h, w, 3` - Action confidences
49+
7. name: `out/anchor4`, shape: `b, h, w, 3` - Action confidences
5050

5151
Where:
5252

models/intel/person-detection-asl-0001/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,7 @@ Average Precision (AP) is defined as an area under the [precision/recall](https:
2222

2323
## Inputs
2424

25-
Image, name: `input`, shape: `1, 3, 320, 320` in the format `1, C, H, W`, where:
25+
Image, name: `image`, shape: `1, 3, 320, 320` in the format `1, C, H, W`, where:
2626

2727
- `C` - number of channels
2828
- `H` - image height

models/intel/person-detection-raisinghand-recognition-0001/README.md

+5-5
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,7 @@ curve.
2727

2828
## Inputs
2929

30-
Image, name: `input`, shape: `1, 3, 400, 680` in the format `B, C, H, W`, where:
30+
Image, name: `data`, shape: `1, 3, 400, 680` in the format `B, C, H, W`, where:
3131

3232
- `B` - batch size
3333
- `C` - number of channels
@@ -43,10 +43,10 @@ The net outputs four branches:
4343
1. name: `mbox_loc1/out/conv/flat`, shape: `b, num_priors*4` - Box coordinates in SSD format
4444
2. name: `mbox_main_conf/out/conv/flat/softmax/flat`, shape: `b, num_priors*2` - Detection confidences
4545
3. name: `mbox/priorbox`, shape: `1, 2, num_priors*4` - Prior boxes in SSD format
46-
4. name: `out/anchor1`, shape: `b, 2, h, w` - Action confidences
47-
5. name: `out/anchor2`, shape: `b, 2, h, w` - Action confidences
48-
6. name: `out/anchor3`, shape: `b, 2, h, w` - Action confidences
49-
7. name: `out/anchor4`, shape: `b, 2, h, w` - Action confidences
46+
4. name: `out/anchor1`, shape: `b, h, w, 2` - Action confidences
47+
5. name: `out/anchor2`, shape: `b, h, w, 2` - Action confidences
48+
6. name: `out/anchor3`, shape: `b, h, w, 2` - Action confidences
49+
7. name: `out/anchor4`, shape: `b, h, w, 2` - Action confidences
5050

5151
Where:
5252

models/intel/person-detection-retail-0002/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -36,7 +36,7 @@ curve. Validation dataset consists of ~50K of images from ~100 different scenes.
3636

3737
The expected channel order is `BGR`.
3838

39-
2. name: `im_info`, shape: `1x6` - An image information
39+
2. name: `im_info`, shape: `1, 6` - An image information
4040
[544, 992, 992/`frame_width`, 544/`frame_height`, 992/`frame_width`, 544/`frame_height`]
4141

4242
## Outputs

0 commit comments

Comments
 (0)