Skip to content

Commit a369eff

Browse files
committed
add more details in sagemaker model for batch inference
Signed-off-by: Xun Zhang <xunzh@amazon.com>
1 parent 8ca3495 commit a369eff

File tree

1 file changed

+12
-0
lines changed

1 file changed

+12
-0
lines changed

docs/remote_inference_blueprints/batch_inference_sagemaker_connector_blueprint.md

+12
Original file line numberDiff line numberDiff line change
@@ -5,6 +5,18 @@ Read more details on https://opensearch.org/docs/latest/ml-commons-plugin/remote
55
Integrate the SageMaker Batch Transform API using the connector below with a new action type "batch_predict".
66
For more details to use batch transform to run inference with Amazon SageMaker, please refer to https://docs.aws.amazon.com/sagemaker/latest/dg/batch-transform.html.
77

8+
SageMaker uses your pre-created model to execute the batch transform job. For creating your model in SageMaker
9+
that supports batch transform, please refer to https://docs.aws.amazon.com/sagemaker/latest/APIReference/API_CreateModel.html. In this example, the following primary
10+
container is used to create the text-embedding DJL model in SageMaker.
11+
```json
12+
"ModelName": "DJL-Text-Embedding-Model-imageforjsonlines",
13+
"PrimaryContainer": {
14+
"Environment": {
15+
"SERVING_LOAD_MODELS" : "djl://ai.djl.huggingface.pytorch/sentence-transformers/all-MiniLM-L6-v2"
16+
},
17+
"Image": "763104351884.dkr.ecr.us-east-1.amazonaws.com/djl-inference:0.22.1-cpu-full"
18+
}
19+
```
820
#### 1. Create your Model connector and Model group
921

1022
##### 1a. Register Model group

0 commit comments

Comments
 (0)