Inference Fails on 16GB GPU even in Low-VRAM Mode #32

rachelkluu · 2025-01-21T17:46:10Z

I'm running into issues trying to run.py on the demo files. Command:

python run.py demo_files/examples/fish.png --output-dir output/ --low-vram-mode

Output:
RuntimeError: CUDA driver error: out of memory

CUDA Available: True
CUDA Version: 12.4
PyTorch Version: 2.5.1+cu124
GPU 0: Quadro RTX 5000
GPU 0 Memory: 16.11 GB

I already tried rebuilding the venv twice now, but still not working. It works when I run it on my CPU but ideally, I'd like to get it running on my GPU. Thanks!

jammm · 2025-01-23T14:00:40Z

Hmm that's weird. It ran fine (without low VRAM mode) on my 4080 mobile and that one has 12GB of VRAM. It should use ~10GB on memory without low VRAM mode and around 6GB with low VRAM mode.

Are you sure that there's nothing else running on the GPU when you run the model? What does nvidia-smi show?

rachelkluu · 2025-01-29T17:45:09Z

Yeah it's odd. It happens in both low VRAM mode and not.

Here's nvidia-smi when I run without low VRAM and yes, it'll stay at ~10GB for some time

+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 550.134 Driver Version: 553.35 CUDA Version: 12.4 |
|-----------------------------------------+------------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+========================+======================|
| 0 Quadro RTX 4000 On | 00000000:04:00.0 Off | N/A |
| 30% 32C P8 1W / 125W | 15MiB / 8192MiB | 0% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+
| 1 Quadro RTX 5000 On | 00000000:65:00.0 On | 0 |
| 33% 51C P2 196W / 230W | 10477MiB / 15360MiB | 100% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+

+-----------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=========================================================================================|
| 0 N/A N/A 16 G /Xwayland N/A |
| 1 N/A N/A 16 G /Xwayland N/A |
+-----------------------------------------------------------------------------------------+

But then all of a sudden it'll spike to 15GB+ and crash

jammm · 2025-01-29T17:50:29Z

Can you try setting CUDA_VISIBLE_DEVICES=1 and run again?

rachelkluu changed the title ~~Inference Fails on 16GB GPU in Low-VRAM Mode~~ Inference Fails on 16GB GPU even in Low-VRAM Mode Jan 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inference Fails on 16GB GPU even in Low-VRAM Mode #32

Inference Fails on 16GB GPU even in Low-VRAM Mode #32

rachelkluu commented Jan 21, 2025

jammm commented Jan 23, 2025 •

edited

Loading

rachelkluu commented Jan 29, 2025

jammm commented Jan 29, 2025

Inference Fails on 16GB GPU even in Low-VRAM Mode #32

Inference Fails on 16GB GPU even in Low-VRAM Mode #32

Comments

rachelkluu commented Jan 21, 2025

jammm commented Jan 23, 2025 • edited Loading

rachelkluu commented Jan 29, 2025

jammm commented Jan 29, 2025

jammm commented Jan 23, 2025 •

edited

Loading