Skip to content

Commit 7716e54

Browse files
author
Varun Shenoy
authoredAug 16, 2023
Fixed vLLM documentation (#573)
* fixed vllm docs * fix toml * revert * revert x2
1 parent 96d13ae commit 7716e54

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed
 

‎docs/examples/performance/vllm-server.mdx

+2-2
Original file line numberDiff line numberDiff line change
@@ -20,7 +20,7 @@ This example will cover:
2020
Get started by creating a new Truss:
2121

2222
```sh
23-
truss init opt125
23+
truss init --backend VLLM opt125
2424
```
2525

2626
You're going to see a couple of prompts. Follow along with the instructions below:
@@ -69,7 +69,7 @@ truss push
6969
You can invoke the model with:
7070

7171
```sh
72-
truss predict -d '{"prompt": "What is a large language model?"}'
72+
truss predict -d '{"prompt": "What is a large language model?", "model": "facebook/opt-125M"}' --published
7373
```
7474

7575
<RequestExample>

0 commit comments

Comments
 (0)
Please sign in to comment.