This repository has been archived.
DigitalOcean has acquired Paperspace. We recommend exploring DigitalOcean GPU Droplets, DigitalOcean Managed Kubernetes with GPU Worker Nodes, or the DigitalOcean Gen AI Platform as robust, fully supported alternatives. These platforms offer comprehensive solutions tailored to modern GPU-based workloads.
Here's a relatively easy way to get your own OpenAI compatible API endpoint running on Paperspace.
- Create a Paperspace account, go to Gradient > Deployments > Create
- Select a GPU, such as P4000 $0.51/hr (can be started/stopped at any time)
- For Docker image, you can use
ollama/ollama:latest
- details here: https://github.com/ollama/ollama - Ports: 11434
- Once deployed, you will be given a new https endpoint.
- Pull a new image such as
llama3
usingcurl https://<yourendpoint>.paperspacegradient.com/api/pull -d '{"name": "llama3"}'
- That’s it! you’ve now got an OpenAI compatible API endpoint available at your Gradient URL, it will work with OpenAI compatible tools.
Paperspace deployment config json:
{
"apiVersion": "v1",
"image": "ollama/ollama:latest",
"name": "ollama",
"enabled": false,
"resources": {
"machineType": "RTX4000",
"replicas": 1,
"ports": [
11434
]
}
}