Deploying Model Server in Docker Container {#ovms_docs_deploying_server_docker}

This is a step-by-step guide on how to deploy OpenVINO™ Model Server on Linux, using Docker.

Before you start, make sure you have:

Docker Engine installed
Intel® Core™ processor (6-13th gen.) or Intel® Xeon® processor (1st to 4th gen.)
Linux, macOS or Windows via WSL
(optional) AI accelerators supported by OpenVINO. Accelerators are tested only on bare-metal Linux hosts.

Launch Model Server Container

This example shows how to launch the model server with a ResNet50 image classification model from a cloud storage:

Step 1. Pull Model Server Image

Pull an image from Docker:

docker pull openvino/model_server:latest

or RedHat Ecosystem Catalog:

docker pull registry.connect.redhat.com/intel/openvino-model-server:latest

Step 2. Prepare Data for Serving

2.1 Start the container with the model

wget https://storage.openvinotoolkit.org/repositories/open_model_zoo/2022.1/models_bin/2/resnet50-binary-0001/FP32-INT1/resnet50-binary-0001.{xml,bin} -P models/resnet50/1
docker run -u $(id -u) -v $(pwd)/models:/models -p 9000:9000 openvino/model_server:latest \
--model_name resnet --model_path /models/resnet50 \
--layout NHWC:NCHW --port 9000

2.2 Download input files: an image and a label mapping file

wget https://raw.githubusercontent.com/openvinotoolkit/model_server/main/demos/common/static/images/zebra.jpeg
wget https://raw.githubusercontent.com/openvinotoolkit/model_server/main/demos/common/python/classes.py

2.3 Install the Python-based ovmsclient package

pip3 install ovmsclient

Step 3. Run Prediction

echo 'import numpy as np
from classes import imagenet_classes
from ovmsclient import make_grpc_client

client = make_grpc_client("localhost:9000")

with open("zebra.jpeg", "rb") as f:
   img = f.read()

output = client.predict({"0": img}, "resnet")
result_index = np.argmax(output[0])
print(imagenet_classes[result_index])' >> predict.py

python predict.py
zebra

If everything is set up correctly, you will see 'zebra' prediction in the output.

Build Image From Source

In case you want to try out features that have not been released yet, you can build the image from source code yourself.

git clone https://github.com/openvinotoolkit/model_server.git
cd model_server
make release_image GPU=1

It will create an image called openvino/model_server:latest.

Note: This operation might take 40min or more depending on your build host. Note: GPU parameter in image build command is needed to include dependencies for GPU device. Note: The public image from the last release might be not compatible with models exported using the the latest export script. We recommend using export script and docker image from the same release to avoid compatibility issues.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

deploying_server_docker.md

deploying_server_docker.md

Deploying Model Server in Docker Container {#ovms_docs_deploying_server_docker}

Launch Model Server Container

Step 1. Pull Model Server Image

Step 2. Prepare Data for Serving

2.1 Start the container with the model

2.2 Download input files: an image and a label mapping file

2.3 Install the Python-based ovmsclient package

Step 3. Run Prediction

Build Image From Source

Files

deploying_server_docker.md

Latest commit

History

deploying_server_docker.md

File metadata and controls

Deploying Model Server in Docker Container {#ovms_docs_deploying_server_docker}

Launch Model Server Container

Step 1. Pull Model Server Image

Step 2. Prepare Data for Serving

2.1 Start the container with the model

2.2 Download input files: an image and a label mapping file

2.3 Install the Python-based ovmsclient package

Step 3. Run Prediction

Build Image From Source