[llm bench] Move calculation of memory consumption to memory_monitor tool #1937

sbalandi · 2025-03-18T20:04:43Z

memory_monitor.py from https://github.com/openvinotoolkit/nncf/blob/develop/tools/memory_monitor.py
added two custom lines, because of issue with tkiner, founded on text2image pipeline and stable-diffusion-v2-1 with pytorch framework :

import matplotlib
# CUSTOM FIX TO AVOID ISSUE: RuntimeError: main thread is not in main loop
matplotlib.use('Agg')

Task: CVS-164392 CVS-157590

sbalandi · 2025-03-18T20:28:41Z

to discuss. Is it okay that:

added delay as compilation and generation can sometimes be too fast and measure will be 0 in such cases: interval 0.01, delay 0.03
memory consumption is not included full memory which process consume, just memory, which was consumed by code snippet:
before, consumption from start + generate:
[warm-up][P0] Max rss memory cost: 5113.64MBytes
now, just generate:
[warm-up][P0] Max rss memory cost: 3991.55MBytes
In that case generation on next step after warm-up shows very low consumption :
before:
[1][P0] Max rss memory cost: 5124.81MBytes
now:
[1][P0] Max rss memory cost: 0.51MBytes

sbalandi · 2025-03-19T13:36:29Z

to discuss. Is it okay that:

added delay as compilation and generation can sometimes be too fast and measure will be 0 in such cases: interval 0.01, delay 0.03

memory consumption is not included full memory which process consume, just memory, which was consumed by code snippet:
before, consumption from start + generate:
[warm-up][P0] Max rss memory cost: 5113.64MBytes
now, just generate:
[warm-up][P0] Max rss memory cost: 3991.55MBytes
In that case generation on next step after warm-up shows very low consumption :
before:
[1][P0] Max rss memory cost: 5124.81MBytes
now:
[1][P0] Max rss memory cost: 0.51MBytes

delay is ok
keep printing full memory and add print of increase
move content of memory_profiling.py to memory_monitor.py

…tool

github-actions bot added the category: llm_bench Label for tool/llm_bench folder label Mar 18, 2025

sbalandi force-pushed the mem_mon branch 2 times, most recently from 04f7441 to e338fb6 Compare March 18, 2025 20:46

sbalandi requested a review from eaidova March 18, 2025 21:56

sbalandi marked this pull request as ready for review March 18, 2025 21:56

ilya-lavrenov assigned eaidova Mar 19, 2025

[llm bench] Move calculation of memory consumption to memory_monitor …

c1fc02a

…tool

sbalandi force-pushed the mem_mon branch 2 times, most recently from 991a7f1 to f37e4ec Compare March 20, 2025 22:38

update

f37e4ec

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[llm bench] Move calculation of memory consumption to memory_monitor tool #1937

[llm bench] Move calculation of memory consumption to memory_monitor tool #1937

sbalandi commented Mar 18, 2025 •

edited

Loading

sbalandi commented Mar 18, 2025

sbalandi commented Mar 19, 2025

[llm bench] Move calculation of memory consumption to memory_monitor tool #1937

Are you sure you want to change the base?

[llm bench] Move calculation of memory consumption to memory_monitor tool #1937

Conversation

sbalandi commented Mar 18, 2025 • edited Loading

sbalandi commented Mar 18, 2025

sbalandi commented Mar 19, 2025

sbalandi commented Mar 18, 2025 •

edited

Loading