Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

prov/efa: released matched rxe before destroying the srx rx_pool #10830

Merged
merged 2 commits into from
Feb 28, 2025

Conversation

jiaxiyan
Copy link
Contributor

If the sender exits early, there may still be unreleased rxe in the srx->rx_pool during util_srx_close, which will cause an
assertion error when the pool is destroyed. Release the matched rxe before calling util_srx_close.

fabtests/efa: Add remote exit early test with post recv
When rdma read is available and message size >= 1M, long read or runt read protocol is used and server is expected to get a cq entry or cq error.
Otherwise, if long CTS is used and sender exits before sending CTS data, receiver is expected to timeout after sending the CTS packet without getting a cq entry or cq error.

If the sender exits early, there may still be unreleased rxe in
the srx->rx_pool during util_srx_close, which will cause an
assertion error when the pool is destroyed.
Release the matched rxe before calling util_srx_close.

Signed-off-by: Jessie Yang <jiaxiyan@amazon.com>
@jiaxiyan jiaxiyan requested a review from a team February 28, 2025 00:06
When rdma read is available and message size >= 1M, long read or
runt read protocol is used and server is expected to get a cq entry
or cq error.
Otherwise, if long CTS is used and sender exits before sending CTS
data, receiver is expected to timeout after sending the CTS packet
without getting a cq entry or cq error.

Signed-off-by: Jessie Yang <jiaxiyan@amazon.com>
@j-xiong j-xiong merged commit 9fd92a4 into ofiwg:main Feb 28, 2025
13 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants