Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Intel/CI: Add OneCCL CPU tests with DSA + shm to the CI #9904

Merged
merged 2 commits into from
Mar 20, 2024

Conversation

Juee14Desai
Copy link
Contributor

  • Add OneCCL DSA stage.
  • There was an issue with libfabric being built with ze and oneccl built without.
    This led to failure and CCL transport being switched to MPI because oneccl looked for the library.
    To avoid this, an else case is added with --with-ze=no.
  • Update oneccl summarizer to support DSA + shm summary.
  • Add the logic to check whether DSA was actually used or not.

@j-xiong
Copy link
Contributor

j-xiong commented Mar 18, 2024

Wait until the 'v1.21.x' branch is created.

@j-xiong
Copy link
Contributor

j-xiong commented Mar 19, 2024

The v1.21.x branch has been created.

Copy link
Contributor

@zachdworkin zachdworkin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks good, we still need to wait for cloudbees to pass before merging

- Add OneCCL DSA stage.
- There was an issue with libfabric being built with ze and oneccl built without.
This led to failure and CCL transport being switched to MPI because oneccl looked for the library.
To avoid this, an else case is added with --with-ze=no.

Signed-off-by: Juee Himalbhai Desai <juee.himalbhai.desai@intel.com>
@Juee14Desai Juee14Desai force-pushed the main branch 4 times, most recently from 05b79cc to c736ec8 Compare March 20, 2024 18:23
- Update oneccl summarizer to support DSA + shm summary.
- Add the logic to check whether DSA was actually used or not.

Signed-off-by: Juee Himalbhai Desai <juee.himalbhai.desai@intel.com>
@zachdworkin
Copy link
Contributor

AWS CI does not need to finish since these changes are only to Intel CI

@zachdworkin zachdworkin merged commit e3fc8d7 into ofiwg:main Mar 20, 2024
16 of 17 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants