How to make the performence breakdown like the picture Fig3? #10

chenhongyu2048 · 2023-08-16T08:27:19Z

As I was reading this article I noticed that the TIME breakdown in Figure 3 is very accurate, I was wondering what tool you used to complete the time measurements?

yzhaiustc · 2023-08-16T16:15:16Z

I just measured time manually by conducting tic and toc for each segment and obtained T1, T2, ..., Tn.
T_total = T1 + T2 + ... + Tn, such that the percentage of each segment can be computed --- T1/T_total * 100% or so.
This measurement makes sense since the BERT inference is a single-stream computational pipeline.
Alternatively you could try with the built-in nsight systems to measure elapsed time - and I would expect you see a similar result.

chenhongyu2048 · 2023-08-17T03:45:57Z

Thank you for your reply. By "single-stream computational pipeline" do you mean that the time spent loading model weights from the HBM to the Cache will be counted in the computational time? Or the time for loading is overlapped by computation?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to make the performence breakdown like the picture Fig3? #10

How to make the performence breakdown like the picture Fig3? #10

chenhongyu2048 commented Aug 16, 2023

yzhaiustc commented Aug 16, 2023 •

edited

Loading

chenhongyu2048 commented Aug 17, 2023 •

edited

Loading

How to make the performence breakdown like the picture Fig3? #10

How to make the performence breakdown like the picture Fig3? #10

Comments

chenhongyu2048 commented Aug 16, 2023

yzhaiustc commented Aug 16, 2023 • edited Loading

chenhongyu2048 commented Aug 17, 2023 • edited Loading

yzhaiustc commented Aug 16, 2023 •

edited

Loading

chenhongyu2048 commented Aug 17, 2023 •

edited

Loading