Modify p4studio interactive to use limited parallel jobs #78

jafingerhut · 2025-02-19T18:16:56Z

No description provided.

Signed-off-by: Andy Fingerhut <andy_fingerhut@alum.wustl.edu>

…active' Signed-off-by: Andy Fingerhut <andy_fingerhut@alum.wustl.edu>

jafingerhut · 2025-02-19T18:17:55Z

I have tested this, and I do right at the beginning see the 2 lines of output from the function that calculates the number of parallel jobs to use, so these changes are tested working.

fruffy · 2025-02-19T18:57:25Z

p4studio/interactive/interactive_command.py

@@ -166,7 +166,8 @@ def create_default_profile() -> Profile:


 def build_sde(context: Context, profile: Profile) -> None:
-    plan = ProfileExecutionPlan(profile, None, os.cpu_count())
+    jobs = calculate_jobs_from_available_cpus_and_memory()


Probably should be an argument from the previous calculation?

I am sorry, but I do not understand the comment. The new first line of build_sde calculates and stores a value in variable jobs. The second line calls ProfileExecutionPlan, with the 3rd parameter being jobs instead of os.cpu_count().

What are you recommending I change?

open-p4studio/p4studio/profile/profile_command.py

Line 215 in c5c4e98

jobs = calculate_jobs_from_available_cpus_and_memory()

this calculation is already done here, no? Ideally, we should only do this once and then reuse the result in other contexts.

In case it is not obvious: the code in this file is run if you do a command p4studio interactive, which is what the install.sh bash script invokes. If you do this, the other Python source file that has the definition of calculate_jobs_from_available_cpus_and_memory has none of its code executed at all, unless it is explicitly called from somewhere in this file. That is what this change is adding -- an explicit call from this file into the function defined in the other file, that calculates a reasonable number of parallel jobs to use.

This is not a repeated call of that function in this code path. It is the only one.

What is the common ancestor between the profile and the interactive command? I was thinking that this ancestor should call the function, then pass the result to either one.

I think the common ancestor is some click module Python code that parses some command line arguments, and decides via click annotations on Python methods which function to invoke. If you do p4studio profile <some args> on the command line, it invokes a method in the file profile_command.py. If you do p4studio interactive, it iknvokes a method in this file being modified.

Note this line early in this file:

from profile.profile_command import execute_plan, calculate_jobs_from_available_cpus_and_memory

There is an execute_plan function called from both profile_command.py and interactive_command.py that is doing most of the work of building and installing the software. Whoever wrote this code decided to define it in profile_command.py and call it from here, rather than putting execute_plan in some common 3rd file and calling it from both places. I'm not adding new code flows that didn't exist between files already here. I'm adding one more cross-file call where there was already a more significant one before.

Hmm, I think we can leave it there then. It seems like checks like these should be set from both workflows, but it may be that the interactive command has less restrictions and doesn't even invoke system checks or common code.

fruffy · 2025-02-19T19:25:14Z

One thing, for CI we should try to use max CPUs. Simply to speed things up. If it locks up we can revert that change.

jafingerhut · 2025-02-19T19:26:47Z

One thing, for CI we should try to use max CPUs. Simply to speed things up. If it looks up we can revert that change.

I can create a separate PR for that. It is just adding command line option like '--jobs nproc' to the CI file that runs the p4studio profile ... command now.

jafingerhut · 2025-02-19T19:27:46Z

One thing, for CI we should try to use max CPUs. Simply to speed things up. If it looks up we can revert that change.

I can create a separate PR for that. It is just adding command line option like '--jobs nproc' to the CI file that runs the p4studio profile ... command now.

Or I can just toss that one-line change into this PR. Give me a minute.

fruffy · 2025-02-19T19:28:44Z

Yeah, I just noted that because I saw current CI uses 3 jobs instead of 4.

fruffy · 2025-02-19T19:29:20Z

Feel free to do this in a separate PR. Either works for me.

Signed-off-by: Andy Fingerhut <andy_fingerhut@alum.wustl.edu>

jafingerhut · 2025-02-19T19:35:09Z

Yeah, I just noted that because I saw current CI uses 3 jobs instead of 4.

If it is a 4vCPU 16GB RAM instance with no swap, it could end up significantly slower, or failing. At least that is what Vlad saw on an AWS instance he tried this on a while ago.

jafingerhut · 2025-02-19T19:37:18Z

Yeah, I just noted that because I saw current CI uses 3 jobs instead of 4.

If it is a 4vCPU 16GB RAM instance with no swap, it could end up significantly slower, or failing. At least that is what Vlad saw on an AWS instance he tried this on a while ago.

The Tofino back end unified build uses somewhere around 5-6 GB of RAM on a very few individual processes, is why. Vlad suggested the idea of disabling unified build option -- more processes to run, but no individual huge-memory ones that cause you to want to reduce the number of parallel jobs.

fruffy · 2025-02-19T19:41:12Z

Yeah, I just noted that because I saw current CI uses 3 jobs instead of 4.

If it is a 4vCPU 16GB RAM instance with no swap, it could end up significantly slower, or failing. At least that is what Vlad saw on an AWS instance he tried this on a while ago.

The Tofino back end unified build uses somewhere around 5-6 GB of RAM on a very few individual processes, is why. Vlad suggested the idea of disabling unified build option -- more processes to run, but no individual huge-memory ones that cause you to want to reduce the number of parallel jobs.

In my experience a unified build outperforms even multiple cores but we can tweak the "chunk size" of the build to reduce the memory usage.

vgurevich · 2025-02-19T19:50:00Z

My suggestion is to have a swap file and then implement #72. Essentially, all we need to take the min of 15% RAM and 25% Swap and add that amount to the total size of RAM before calculating the amount of RAM per CPU. Having 15% of RAM in the swap does not usually impact the performance.

It also makes sense to have swap anyway, just for the sake of safety.

jafingerhut · 2025-02-20T17:36:22Z

I think this PR is ready to merge, if you approve.

All of the suggestions for detecting swap space and using that to determine the number of parallel jobs to run has a separate issue created to track it already. I am not planning to implement that in this PR.

jafingerhut added 15 commits January 15, 2025 20:18

Enable batch-install.sh to run on Ubuntu 22.04

39d35f3

Signed-off-by: Andy Fingerhut <andy_fingerhut@alum.wustl.edu>

Merge remote-tracking branch 'upstream/main' into main

ecc22c1

Merge branch 'main' of https://github.com/p4lang/open-p4studio into main

5f5c312

Merge branch 'main' of https://github.com/p4lang/open-p4studio

113e4f1

Merge branch 'main' of github.com:jafingerhut/open-p4studio

4becb30

Merge branch 'main' of https://github.com/p4lang/open-p4studio

d94cdd5

Merge branch 'main' of https://github.com/p4lang/open-p4studio

5a28bf0

Merge remote-tracking branch 'upstream/main'

14807ca

Merge remote-tracking branch 'upstream/main' into main

38a5340

Merge branch 'main' of https://github.com/p4lang/open-p4studio into main

09b4a9c

Merge branch 'main' of https://github.com/p4lang/open-p4studio into main

2f3cf06

Merge remote-tracking branch 'up/main'

f7ea382

Merge branch 'main' of https://github.com/p4lang/open-p4studio into main

7533bd6

Merge branch 'main' of https://github.com/p4lang/open-p4studio into main

ce6ab36

Fix p4lang#77 Limit parallel jobs used during command 'p4studio inter…

4798fe8

…active' Signed-off-by: Andy Fingerhut <andy_fingerhut@alum.wustl.edu>

fruffy reviewed Feb 19, 2025

View reviewed changes

Force CI to use nproc parallel jobs

dfabc14

Signed-off-by: Andy Fingerhut <andy_fingerhut@alum.wustl.edu>

fruffy approved these changes Feb 20, 2025

View reviewed changes

jafingerhut merged commit e01a187 into p4lang:main Feb 20, 2025
3 checks passed

jafingerhut mentioned this pull request Feb 20, 2025

The install.sh script uses all CPU cores, even if that fails due to running out of RAM #77

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modify p4studio interactive to use limited parallel jobs #78

Modify p4studio interactive to use limited parallel jobs #78

jafingerhut commented Feb 19, 2025

jafingerhut commented Feb 19, 2025

fruffy Feb 19, 2025

jafingerhut Feb 19, 2025

fruffy Feb 19, 2025

jafingerhut Feb 19, 2025

fruffy Feb 19, 2025 •

edited

Loading

jafingerhut Feb 19, 2025

jafingerhut Feb 19, 2025

fruffy Feb 19, 2025 •

edited

Loading

fruffy commented Feb 19, 2025 •

edited

Loading

jafingerhut commented Feb 19, 2025

jafingerhut commented Feb 19, 2025

fruffy commented Feb 19, 2025

fruffy commented Feb 19, 2025

jafingerhut commented Feb 19, 2025

jafingerhut commented Feb 19, 2025

fruffy commented Feb 19, 2025

vgurevich commented Feb 19, 2025

jafingerhut commented Feb 20, 2025

Modify p4studio interactive to use limited parallel jobs #78

Modify p4studio interactive to use limited parallel jobs #78

Conversation

jafingerhut commented Feb 19, 2025

jafingerhut commented Feb 19, 2025

fruffy Feb 19, 2025

Choose a reason for hiding this comment

jafingerhut Feb 19, 2025

Choose a reason for hiding this comment

fruffy Feb 19, 2025

Choose a reason for hiding this comment

jafingerhut Feb 19, 2025

Choose a reason for hiding this comment

fruffy Feb 19, 2025 • edited Loading

Choose a reason for hiding this comment

jafingerhut Feb 19, 2025

Choose a reason for hiding this comment

jafingerhut Feb 19, 2025

Choose a reason for hiding this comment

fruffy Feb 19, 2025 • edited Loading

Choose a reason for hiding this comment

fruffy commented Feb 19, 2025 • edited Loading

jafingerhut commented Feb 19, 2025

jafingerhut commented Feb 19, 2025

fruffy commented Feb 19, 2025

fruffy commented Feb 19, 2025

jafingerhut commented Feb 19, 2025

jafingerhut commented Feb 19, 2025

fruffy commented Feb 19, 2025

vgurevich commented Feb 19, 2025

jafingerhut commented Feb 20, 2025

fruffy Feb 19, 2025 •

edited

Loading

fruffy Feb 19, 2025 •

edited

Loading

fruffy commented Feb 19, 2025 •

edited

Loading