New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

add tutorial for RAG with DeepSeek R1 model on Sagemaker #3455

Merged

ylwu-amzn merged 5 commits into opensearch-project:main from ylwu-amzn:ds_doc

Jan 29, 2025

Collaborator

ylwu-amzn commented Jan 29, 2025

Description

This tutorial introduces how to use DeepSeek model deployed on Sagemaker to build RAG in OpenSearch

Related Issues

Resolves #[Issue number to be closed when this PR is merged]

Check List

New functionality includes testing.
New functionality has been documented.
API changes companion pull request created.
Commits are signed per the DCO using --signoff.
Public documentation issue/PR created.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.


          add tutorial for RAG with DeepSeek R1 model on Sagemaker

207bf8e

Signed-off-by: Yaliang Wu <ylwu@amazon.com>

ylwu-amzn requested review from b4sjoo, dhrubo-os, mingshl, jngz-es, model-collapse, rbhavna, zane-neo, Zhangxunmt, austintlee, HenryL27 and xinyual as code owners

January 29, 2025 08:29

ylwu-amzn had a problem deploying to ml-commons-cicd-env

January 29, 2025 08:30

— with

GitHub Actions Failure

ylwu-amzn had a problem deploying to ml-commons-cicd-env

January 29, 2025 08:30

— with

GitHub Actions Failure


          add tutorials for RAG with DeepSeek Chat model

b028302

Signed-off-by: Yaliang Wu <ylwu@amazon.com>

ylwu-amzn temporarily deployed to ml-commons-cicd-env

January 29, 2025 09:33

— with

GitHub Actions Inactive

ylwu-amzn had a problem deploying to ml-commons-cicd-env

January 29, 2025 09:33

— with

GitHub Actions Failure

nathaliellenaa reviewed

View reviewed changes

docs/tutorials/aws/RAG_with_DeepSeek_Chat_model.md Outdated Show resolved Hide resolved

Zhangxunmt reviewed

View reviewed changes

docs/tutorials/aws/RAG_with_DeepSeek_R1_model_on_Sagemaker.md Outdated


		### Deploy DeepSeek R1 model to Sagemaker

		Follow this [blog](https://community.aws/content/2sG84dNUCFzA9z4HdfqTI0tcvKP/deploying-deepseek-r1-on-amazon-sagemaker) to deploy DeepSeek R1 model to Sagemaker.

Collaborator

Zhangxunmt Jan 29, 2025

This is interesting. Have you followed the steps and deployed successfully in SageMaker?

Collaborator Author

ylwu-amzn Jan 29, 2025

yes, the result is from test cluster, not fake one

pyek-bot reviewed

View reviewed changes

docs/tutorials/aws/RAG_with_DeepSeek_Chat_model.md Outdated

Comment on lines 153 to 163

+              Add DeepSeek API endpoint to trusted URLs
+              ```
+              PUT /_cluster/settings
+              {
+                  "persistent": {
+                      "plugins.ml_commons.trusted_connector_endpoints_regex": [
+                        "^https://api\\.deepseek\\.com/.*$"
+                      ]
+                  }
+              }
+              ```

Contributor

pyek-bot Jan 29, 2025

this will not be needed from v2.19, so should we mention that here?

Collaborator Author

ylwu-amzn Jan 29, 2025

We can add after 2.19

pyek-bot reviewed

View reviewed changes

docs/tutorials/aws/RAG_with_DeepSeek_Chat_model.md Outdated

+                "status": "COMPLETED"
+              }
+              ```
+. Predict

Contributor

pyek-bot Jan 29, 2025

should we link the deepseek chat api here? so users can be aware of which api we are hitting internally

Collaborator Author

ylwu-amzn Jan 29, 2025

The first sentence already mentioned the API

This tutorial introduces how to build RAG in Amazon managed OpenSearch with DeepSeek Chat Model.

pyek-bot reviewed

View reviewed changes

docs/tutorials/aws/RAG_with_DeepSeek_R1_model_on_Sagemaker.md

+              ```
+. Predict
+              ```
+              POST /_plugins/_ml/models/Sym9sJQBts7fa6byEh1-/_predict

Contributor

pyek-bot Jan 29, 2025

same here? is it possible to provide some reference so users are aware of how this parameters work

Collaborator Author

ylwu-amzn Jan 29, 2025 •

edited

Loading

This is somehow not easy. Sagemaker team owns how the API works. I don't have clear view about this, no docs found


          add tutorial for RAG with DeepSeek R1 model on Sagemaker

c4b65a8

Signed-off-by: Yaliang Wu <ylwu@amazon.com>

ylwu-amzn had a problem deploying to ml-commons-cicd-env

January 29, 2025 20:33

— with

GitHub Actions Failure

ylwu-amzn had a problem deploying to ml-commons-cicd-env

January 29, 2025 20:33

— with

GitHub Actions Failure

kolchfa-aws reviewed

View reviewed changes

Contributor

kolchfa-aws left a comment

I made suggestions for the first 2 files. The 3rd one is almost the same, so please apply my suggestions to that one as well. Thanks.

docs/tutorials/aws/RAG_with_DeepSeek_Chat_model.md Outdated Show resolved Hide resolved

docs/tutorials/aws/RAG_with_DeepSeek_Chat_model.md Outdated Show resolved Hide resolved

docs/tutorials/aws/RAG_with_DeepSeek_Chat_model.md Outdated Show resolved Hide resolved

docs/tutorials/aws/RAG_with_DeepSeek_Chat_model.md Outdated Show resolved Hide resolved

docs/tutorials/aws/RAG_with_DeepSeek_Chat_model.md Outdated Show resolved Hide resolved

docs/tutorials/aws/RAG_with_DeepSeek_R1_model_on_Bedrock.md Outdated

+              ## 5. RAG
+              ### 5.1 create search pipeline
+              Create search pipeline with [RAG processor](https://opensearch.org/docs/latest/search-plugins/search-pipelines/rag-processor/).

Contributor

kolchfa-aws Jan 29, 2025

Suggested change

      
            Create search pipeline with [RAG processor](https://opensearch.org/docs/latest/search-plugins/search-pipelines/rag-processor/).
          
            Create search pipeline with a [RAG processor](https://opensearch.org/docs/latest/search-plugins/search-pipelines/rag-processor/):

docs/tutorials/aws/RAG_with_DeepSeek_R1_model_on_Bedrock.md Outdated

+              }
+              ```
+              ### 5.2 create vector database
+              Follow this [neural search tutorial](https://opensearch.org/docs/latest/search-plugins/neural-search-tutorial/) to create embedding model, K-NN index `my-nlp-index`, and ingest data

Contributor

kolchfa-aws Jan 29, 2025

Suggested change

      
            Follow this [neural search tutorial](https://opensearch.org/docs/latest/search-plugins/neural-search-tutorial/) to create embedding model, K-NN index `my-nlp-index`, and ingest data
          
            Follow the [neural search tutorial](https://opensearch.org/docs/latest/search-plugins/neural-search-tutorial/) to create an embedding model and a k-NN index. Then ingest data into the index:

docs/tutorials/aws/RAG_with_DeepSeek_R1_model_on_Bedrock.md Outdated

		```


		### 5.3 search

Contributor

kolchfa-aws Jan 29, 2025

Suggested change

      
            ### 5.3 search
          
            ### 5.3 Search the index

docs/tutorials/aws/RAG_with_DeepSeek_R1_model_on_Bedrock.md Outdated



		### 5.3 search
		Run neural search to retrieve documents from vector database, then use DeepSeek model to do RAG.

Contributor

kolchfa-aws Jan 29, 2025

Suggested change

      
            Run neural search to retrieve documents from vector database, then use DeepSeek model to do RAG.
          
            Run vector search to retrieve documents from the vector database, then use the DeepSeek model for RAG:

docs/tutorials/aws/RAG_with_DeepSeek_R1_model_on_Bedrock.md Outdated

+                }
+              }
+              ```
+              Response

Contributor

kolchfa-aws Jan 29, 2025

Suggested change

      
            Response
          
            The response contains the matching documents:


          Apply suggestions from code review

a56f88c

Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com>
Signed-off-by: Yaliang Wu <ylwu@amazon.com>

ylwu-amzn had a problem deploying to ml-commons-cicd-env

January 29, 2025 22:51

— with

GitHub Actions Failure

ylwu-amzn had a problem deploying to ml-commons-cicd-env

January 29, 2025 22:51

— with

GitHub Actions Failure


          apply comments to sagemkaer tutorial

df96b04

Signed-off-by: Yaliang Wu <ylwu@amazon.com>

ylwu-amzn had a problem deploying to ml-commons-cicd-env

January 29, 2025 23:07

— with

GitHub Actions Failure

ylwu-amzn had a problem deploying to ml-commons-cicd-env

January 29, 2025 23:07

— with

GitHub Actions Failure

jngz-es approved these changes

View reviewed changes

b4sjoo approved these changes

View reviewed changes

ylwu-amzn merged commit 8830a43 into opensearch-project:main

5 of 7 checks passed

ylwu-amzn added the backport 2.x label

opensearch-trigger-bot bot pushed a commit that referenced this pull request


          add tutorial for RAG with DeepSeek R1 model on Sagemaker (#3455)

dc12a14

* add tutorial for RAG with DeepSeek R1 model on Sagemaker

Signed-off-by: Yaliang Wu <ylwu@amazon.com>

* add tutorials for RAG with DeepSeek Chat model

Signed-off-by: Yaliang Wu <ylwu@amazon.com>

* add tutorial for RAG with DeepSeek R1 model on Sagemaker

Signed-off-by: Yaliang Wu <ylwu@amazon.com>

* Apply suggestions from code review

Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com>
Signed-off-by: Yaliang Wu <ylwu@amazon.com>

* apply comments to sagemkaer tutorial

Signed-off-by: Yaliang Wu <ylwu@amazon.com>

---------

Signed-off-by: Yaliang Wu <ylwu@amazon.com>
Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com>
(cherry picked from commit 8830a43)

opensearch-trigger-bot bot mentioned this pull request

[Backport 2.x] add tutorial for RAG with DeepSeek R1 model on Sagemaker #3462

Merged

ylwu-amzn added a commit that referenced this pull request


          add tutorial for RAG with DeepSeek R1 model on Sagemaker (#3455) (#3462)

23ad63b

* add tutorial for RAG with DeepSeek R1 model on Sagemaker

Signed-off-by: Yaliang Wu <ylwu@amazon.com>

* add tutorials for RAG with DeepSeek Chat model

Signed-off-by: Yaliang Wu <ylwu@amazon.com>

* add tutorial for RAG with DeepSeek R1 model on Sagemaker

Signed-off-by: Yaliang Wu <ylwu@amazon.com>

* Apply suggestions from code review

Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com>
Signed-off-by: Yaliang Wu <ylwu@amazon.com>

* apply comments to sagemkaer tutorial

Signed-off-by: Yaliang Wu <ylwu@amazon.com>

---------

Signed-off-by: Yaliang Wu <ylwu@amazon.com>
Co-authored-by: kolchfa-aws <105444904+kolchfa-aws@users.noreply.github.com>
(cherry picked from commit 8830a43)

Co-authored-by: Yaliang Wu <ylwu@amazon.com>

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

Zhangxunmt Zhangxunmt left review comments

kolchfa-aws kolchfa-aws left review comments

nathaliellenaa nathaliellenaa left review comments

pyek-bot pyek-bot left review comments

jngz-es jngz-es approved these changes

b4sjoo b4sjoo approved these changes

dhrubo-os Awaiting requested review from dhrubo-os dhrubo-os is a code owner

mingshl Awaiting requested review from mingshl mingshl is a code owner

model-collapse Awaiting requested review from model-collapse model-collapse is a code owner

rbhavna Awaiting requested review from rbhavna rbhavna is a code owner

zane-neo Awaiting requested review from zane-neo zane-neo is a code owner

austintlee Awaiting requested review from austintlee austintlee is a code owner

HenryL27 Awaiting requested review from HenryL27 HenryL27 is a code owner

xinyual Awaiting requested review from xinyual xinyual is a code owner

Labels