kprokofi
diff --git a/‎README.md
+7-5 b/‎README.md
+7-5
diff --git a/‎compute_mean_std.py
+14-23 b/‎compute_mean_std.py
+14-23
diff --git a/‎configs/config.py
+3-4 b/‎configs/config.py
+3-4
diff --git a/‎configs/config_large_075.py
+76 b/‎configs/config_large_075.py
+76
diff --git a/‎configs/config_small.py
+76 b/‎configs/config_small.py
+76
diff --git a/‎configs/config_small_075.py
+76 b/‎configs/config_small_075.py
+76
diff --git a/‎conversion_checker.py
+13-22 b/‎conversion_checker.py
+13-22
@@ -2,6 +2,8 @@
 Towards the solving anti-spoofing problem on RGB only data.
 ## Introduction
 This repository contains a training and evaluation pipeline with different regularization methods for face anti-spoofing network. There are a few models available for training purposes, based on MobileNetv2 (MN2) and MobileNetv3 (MN3). Project supports natively three datasets: [CelebA Spoof](https://github.com/Davidzhangyuanhan/CelebA-Spoof), [LCC FASD](https://csit.am/2019/proceedings/PRIP/PRIP3.pdf), [CASIA-SURF CeFA](https://arxiv.org/pdf/2003.05136.pdf). Also, you may want to train or validate with your own data. Final model based on MN3 trained on the CelebA Spoof dataset. The model has 3.72 times fewer parameters and 24.3 times fewer GFlops than AENET from the original paper, at the same time MN3 better generalizes on cross-domain. The code contains a demo that you can launch in real-time with your webcam or on the provided video. You can check out the short video on how it works on the [goole drive](https://drive.google.com/drive/u/0/folders/1A6wa3AlrdjyNPkXT81knIzXxR7SAYm1q). Also, the code supports conversion to the ONNX format.
+You can follow the links to the configuration files with smaller models to train them as-is and obtain metrics below.
+
 | model name | dataset | AUC | EER% | APCER% | BPCER% | ACER% | MParam | GFlops | Link to snapshot |
 | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
 | MN3_large |CelebA-Spoof| 0.998 | 2.26 | 0.69 | 6.92 | 3.8 | 3.02 |  0.15 | [snapshot](https://drive.google.com/drive/u/0/folders/1A6wa3AlrdjyNPkXT81knIzXxR7SAYm1q) |
@@ -70,11 +72,11 @@ The script for training and inference uses a configuration file. This is [defaul
 * **resize** - resize of the image
 * **checkpoint** - the name of the checkpoint to save and the path to the experiment folder where checkpoint, tensorboard logs and eval metrics will be kept
 * **loss** - there are available two possible losses: `amsoftmax` with `cos`, `arcos`, `cross_enropy` margins and `soft_triple` with different number of inner classes. For more details about this soft triple loss see in [paper](https://arxiv.org/pdf/1909.05235.pdf)
-* **loss.amsoftmax.ratio**a  - there is availability to use different m for different classes. The ratio is the weights on which provided `m` will be divided for a specific class. For example ratio = [1,2] means that m for the first class will equal to m, but for the second will equal to m/2
+* **loss.amsoftmax.ratio**  - there is availability to use different m for different classes. The ratio is the weights on which provided `m` will be divided for a specific class. For example ratio = [1,2] means that m for the first class will equal to m, but for the second will equal to m/2
 * **loss.amsoftmax.gamma** - if this constant differs from 0 then the focal loss will be switched on with the corresponding gamma
-* **For soft triple loss**: `Cn` - number of classes, `K` - number of proxies for each class, `tau` - parameter for regularisation number of proxies
-* **model** - there are parameters concern model. `pretrained` means that you want to train with the imagenet weights (you can download weights from [google drive](https://drive.google.com/drive/u/0/folders/1A6wa3AlrdjyNPkXT81knIzXxR7SAYm1q) and specify the path to it in the `imagenet weights` parameter. **model_type** - type of the model, 'Mobilenet3' and 'Mobilenet2' are available. **size** param means the size of the mobilenetv3, there are 'large' and 'small' options. Note that this will change mobilenev3 only. **embeding_dim** - the size of the embeding (vector of features after average pooling). **width_mult** - the width scaling parameter of the model. Note, that you will need the appropriate imagenet weights if you want to train your model with transfer learning. On google drive weights with 0.75, 1.0 value of this parameter is available
-* **aug** - advanced augmentation, appropriate value for type is 'cutmix' or 'mixup. lambda = BetaDistribution(alpha, beta), cutmix_prob - probability of applying cutmix on image.
+* **For soft triple loss**: `Cn` - number of classes, `K` - number of proxies for each class, `tau` - parameter for regularization number of proxies
+* **model** - there are parameters concerning model. `pretrained` means that you want to train with the imagenet weights (you can download weights from [google drive](https://drive.google.com/drive/u/0/folders/1A6wa3AlrdjyNPkXT81knIzXxR7SAYm1q) and specify the path to it in the `imagenet weights` parameter. **model_type** - type of the model, 'Mobilenet3' and 'Mobilenet2' are available. **size** param means the size of the mobilenetv3, there are 'large' and 'small' options. Note that this will change mobilenev3 only. **embeding_dim** - the size of the embeding (vector of features after average pooling). **width_mult** - the width scaling parameter of the model. Note, that you will need the appropriate imagenet weights if you want to train your model with transfer learning. On google drive weights with 0.75, 1.0 values of this parameter are available
+* **aug** - there are some advanced augmentations are available. You can specify `cutmix` or `mixup` and appropriate params for them. `alpha` and `beta` are used for choosing `lambda` from beta distribution, `aug_prob` response for the probability of applying augmentation on the image.
 * **curves** - you can specify the name of the curves, then set option `--draw_graph` to `True` when evaluating with eval_protocol.py script
 * **dropout** - `bernoulli` and `gaussian` dropouts are available with respective parameters
 * **data_parallel** - you can train your network on several GPU
@@ -120,7 +122,7 @@ You will see the mean difference (L1 metric distance) on the first and second pr
 
 ## Demo
 ![demo.png](./demo/demo.png)
-To start demo you need to [download] OpenVINO™ face detector model. Concretely, you will need `face-detection-0100` version. 
+To start demo you need to [download] OpenVINO™ face detector model. Concretely, you will need `face-detection-0100` version.
 On [google drive](https://drive.google.com/drive/u/0/folders/1A6wa3AlrdjyNPkXT81knIzXxR7SAYm1q) you will see a trained antispoofing model that you can download and run, or choose your own trained model. Use OpenVINO™ format to obtain the best performance speed, but PyTorch format will work as well.
 
 After preparation start demo by running:
 
@@ -1,29 +1,20 @@
-'''MIT License
-
-Copyright (C) 2020 Prokofiev Kirill
-
-Permission is hereby granted, free of charge, to any person obtaining a copy
-of this software and associated documentation files (the "Software"),
-to deal in the Software without restriction, including without limitation
-the rights to use, copy, modify, merge, publish, distribute, sublicense,
-and/or sell copies of the Software, and to permit persons to whom
-the Software is furnished to do so, subject to the following conditions:
-
-The above copyright notice and this permission notice shall be included
-in all copies or substantial portions of the Software.
-
-THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS
-OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
-FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT.  IN NO EVENT SHALL
-THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES
-OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE,
-ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE
-OR OTHER DEALINGS IN THE SOFTWARE.'''
+"""
+ Copyright (c) 2020 Intel Corporation
+ Licensed under the Apache License, Version 2.0 (the "License");
+ you may not use this file except in compliance with the License.
+ You may obtain a copy of the License at
+      http://www.apache.org/licenses/LICENSE-2.0
+ Unless required by applicable law or agreed to in writing, software
+ distributed under the License is distributed on an "AS IS" BASIS,
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ See the License for the specific language governing permissions and
+ limitations under the License.
+"""
 
 import argparse
 
 import albumentations as A
-import cv2
+import cv2 as cv
 import torch
 from torch.utils.data import DataLoader
 from tqdm import tqdm
@@ -41,7 +32,7 @@ def main():
     args = parser.parse_args()
     # transform image
     transforms = A.Compose([
-                                A.Resize(*args.img_size, interpolation=cv2.INTER_CUBIC),
+                                A.Resize(*args.img_size, interpolation=cv.INTER_CUBIC),
                                 A.Normalize(mean=[0, 0, 0], std=[1, 1, 1])
                                 ])
     root_folder = args.root
 
@@ -21,7 +21,7 @@
 
 scheduler = dict(milestones=[20,40], gamma=0.2)
 
-data = dict( batch_size=256,
+data = dict(batch_size=256,
             data_loader_workers=4,
             sampler=None,
             pin_memory=True)
@@ -39,7 +39,6 @@
                            smoothing=0.1,
                            ratio=[1,1],
                            gamma=0),
-
             soft_triple=dict(cN=2, K=10, s=1, tau=.2, m=0.35))
 
 epochs = dict(start_epoch=0, max_epoch=71)
@@ -54,13 +53,13 @@
 aug = dict(type_aug=None,
             alpha=0.5,
             beta=0.5,
-            cutmix_prob=0.7)
+            aug_prob=0.7)
 
 curves = dict(det_curve='det_curve_0.png',
               roc_curve='roc_curve_0.png')
 
 dropout = dict(prob_dropout=0.1,
-               classifier=0.5,
+               classifier=0.35,
                type='bernoulli',
                mu=0.5,
                sigma=0.3)
 
@@ -0,0 +1,76 @@
+exp_num = 0
+
+dataset = 'celeba_spoof'
+
+multi_task_learning = True
+
+evaluation = True
+
+test_steps = None
+
+datasets = dict(LCCFASD_root='./LCC_FASDcropped',
+                Celeba_root='./CelebA_Spoof',
+                Casia_root='./CASIA')
+
+external = dict(train=dict(), val=dict(), test=dict())
+
+img_norm_cfg = dict(mean=[0.5931, 0.4690, 0.4229],
+                    std=[0.2471, 0.2214, 0.2157])
+
+optimizer = dict(lr=0.005, momentum=0.9, weight_decay=5e-4)
+
+scheduler = dict(milestones=[20,50], gamma=0.2)
+
+data = dict(batch_size=256,
+            data_loader_workers=4,
+            sampler=None,
+            pin_memory=True)
+
+resize = dict(height=128, width=128)
+
+checkpoint = dict(snapshot_name="MobileNet3.pth.tar",
+                  experiment_path='./logs')
+
+loss = dict(loss_type='amsoftmax',
+            amsoftmax=dict(m=0.5,
+                           s=1,
+                           margin_type='cross_entropy',
+                           label_smooth=False,
+                           smoothing=0.1,
+                           ratio=[1,1],
+                           gamma=0),
+            soft_triple=dict(cN=2, K=10, s=1, tau=.2, m=0.35))
+
+epochs = dict(start_epoch=0, max_epoch=71)
+
+model= dict(model_type='Mobilenet3',
+            model_size = 'large',
+            width_mult = 1.0,
+            pretrained=True,
+            embeding_dim=1280,
+            imagenet_weights='./pretrained/mobilenetv3-large-0.75-9632d2a8.pth')
+
+aug = dict(type_aug=None,
+            alpha=0.5,
+            beta=0.5,
+            aug_prob=0.7)
+
+curves = dict(det_curve='det_curve_0.png',
+              roc_curve='roc_curve_0.png')
+
+dropout = dict(prob_dropout=0.1,
+               classifier=0.3,
+               type='bernoulli',
+               mu=0.5,
+               sigma=0.3)
+
+data_parallel = dict(use_parallel=False,
+                     parallel_params=dict(device_ids=[0,1], output_device=0))
+
+RSC = dict(use_rsc=False,
+           p=0.333,
+           b=0.333)
+
+test_dataset = dict(type='LCC_FASD')
+
+conv_cd = dict(theta=0)
@@ -0,0 +1,76 @@
+exp_num = 0
+
+dataset = 'celeba_spoof'
+
+multi_task_learning = True
+
+evaluation = True
+
+test_steps = None
+
+datasets = dict(LCCFASD_root='./LCC_FASDcropped',
+                Celeba_root='./CelebA_Spoof',
+                Casia_root='./CASIA')
+
+external = dict(train=dict(), val=dict(), test=dict())
+
+img_norm_cfg = dict(mean=[0.5931, 0.4690, 0.4229],
+                    std=[0.2471, 0.2214, 0.2157])
+
+optimizer = dict(lr=0.005, momentum=0.9, weight_decay=5e-4)
+
+scheduler = dict(milestones=[20,50], gamma=0.2)
+
+data = dict(batch_size=256,
+            data_loader_workers=4,
+            sampler=None,
+            pin_memory=True)
+
+resize = dict(height=128, width=128)
+
+checkpoint = dict(snapshot_name="MobileNet3.pth.tar",
+                  experiment_path='./logs')
+
+loss = dict(loss_type='amsoftmax',
+            amsoftmax=dict(m=0.5,
+                           s=1,
+                           margin_type='cross_entropy',
+                           label_smooth=False,
+                           smoothing=0.1,
+                           ratio=[1,1],
+                           gamma=0),
+            soft_triple=dict(cN=2, K=10, s=1, tau=.2, m=0.35))
+
+epochs = dict(start_epoch=0, max_epoch=71)
+
+model= dict(model_type='Mobilenet3',
+            model_size = 'large',
+            width_mult = 1.0,
+            pretrained=True,
+            embeding_dim=1024,
+            imagenet_weights='./pretrained/mobilenetv3-small-55df8e1f.pth')
+
+aug = dict(type_aug=None,
+            alpha=0.5,
+            beta=0.5,
+            aug_prob=0.7)
+
+curves = dict(det_curve='det_curve_0.png',
+              roc_curve='roc_curve_0.png')
+
+dropout = dict(prob_dropout=0.1,
+               classifier=0.1,
+               type='bernoulli',
+               mu=0.5,
+               sigma=0.3)
+
+data_parallel = dict(use_parallel=False,
+                     parallel_params=dict(device_ids=[0,1], output_device=0))
+
+RSC = dict(use_rsc=False,
+           p=0.333,
+           b=0.333)
+
+test_dataset = dict(type='LCC_FASD')
+
+conv_cd = dict(theta=0)
@@ -0,0 +1,76 @@
+exp_num = 0
+
+dataset = 'celeba_spoof'
+
+multi_task_learning = True
+
+evaluation = True
+
+test_steps = None
+
+datasets = dict(LCCFASD_root='./LCC_FASDcropped',
+                Celeba_root='./CelebA_Spoof',
+                Casia_root='./CASIA')
+
+external = dict(train=dict(), val=dict(), test=dict())
+
+img_norm_cfg = dict(mean=[0.5931, 0.4690, 0.4229],
+                    std=[0.2471, 0.2214, 0.2157])
+
+optimizer = dict(lr=0.005, momentum=0.9, weight_decay=5e-4)
+
+scheduler = dict(milestones=[20,50], gamma=0.2)
+
+data = dict(batch_size=256,
+            data_loader_workers=4,
+            sampler=None,
+            pin_memory=True)
+
+resize = dict(height=128, width=128)
+
+checkpoint = dict(snapshot_name="MobileNet3.pth.tar",
+                  experiment_path='./logs')
+
+loss = dict(loss_type='amsoftmax',
+            amsoftmax=dict(m=0.5,
+                           s=1,
+                           margin_type='cross_entropy',
+                           label_smooth=False,
+                           smoothing=0.1,
+                           ratio=[1,1],
+                           gamma=0),
+            soft_triple=dict(cN=2, K=10, s=1, tau=.2, m=0.35))
+
+epochs = dict(start_epoch=0, max_epoch=71)
+
+model= dict(model_type='Mobilenet3',
+            model_size = 'large',
+            width_mult = 0.75,
+            pretrained=True,
+            embeding_dim=1024,
+            imagenet_weights='./pretrained/mobilenetv3-small-0.75-86c972c3.pth')
+
+aug = dict(type_aug=None,
+            alpha=0.5,
+            beta=0.5,
+            aug_prob=0.7)
+
+curves = dict(det_curve='det_curve_0.png',
+              roc_curve='roc_curve_0.png')
+
+dropout = dict(prob_dropout=0.1,
+               classifier=0.35,
+               type='bernoulli',
+               mu=0.5,
+               sigma=0.3)
+
+data_parallel = dict(use_parallel=False,
+                     parallel_params=dict(device_ids=[0,1], output_device=0))
+
+RSC = dict(use_rsc=False,
+           p=0.333,
+           b=0.333)
+
+test_dataset = dict(type='LCC_FASD')
+
+conv_cd = dict(theta=0)
@@ -1,24 +1,15 @@
-'''MIT License
-
-Copyright (C) 2020 Prokofiev Kirill
-
-Permission is hereby granted, free of charge, to any person obtaining a copy
-of this software and associated documentation files (the "Software"),
-to deal in the Software without restriction, including without limitation
-the rights to use, copy, modify, merge, publish, distribute, sublicense,
-and/or sell copies of the Software, and to permit persons to whom
-the Software is furnished to do so, subject to the following conditions:
-
-The above copyright notice and this permission notice shall be included
-in all copies or substantial portions of the Software.
-
-THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS
-OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
-FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT.  IN NO EVENT SHALL
-THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES
-OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE,
-ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE
-OR OTHER DEALINGS IN THE SOFTWARE.'''
+"""
+ Copyright (c) 2020 Intel Corporation
+ Licensed under the Apache License, Version 2.0 (the "License");
+ you may not use this file except in compliance with the License.
+ You may obtain a copy of the License at
+      http://www.apache.org/licenses/LICENSE-2.0
+ Unless required by applicable law or agreed to in writing, software
+ distributed under the License is distributed on an "AS IS" BASIS,
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ See the License for the specific language governing permissions and
+ limitations under the License.
+"""
 
 import argparse
 import inspect
@@ -67,7 +58,7 @@ def pred_spoof(batch, spoof_model_torch, spoof_model_openvino):
     return output1, output2
 
 def check_accuracy(torch_pred, openvino_pred):
-    diff = abs(np.array(openvino_pred) - np.array(torch_pred))
+    diff = np.abs(openvino_pred - torch_pred)
     avg = diff.mean(axis=0)
     return avg