-
Notifications
You must be signed in to change notification settings - Fork 36
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Failing tests with Intel iGPU and its OpenCL driver #164
Comments
With #162:
|
With the brutal sync issue workaround for #152:
|
With #162, now down to
|
Iris Xe iGPU OpenCL (with #162 and @pvelesko's OpenCL event fix on top): The following tests FAILED:
Might be a single issue. @pvelesko @franz any bells ringing? |
interesting.. same ones as #146 |
Yep, seems like heap corruption, overwriting the malloc() book keeping structures or such. Could be serious. Have you ran valgrind recently? |
This crashes both with the GPU and the CPU drivers. When I run it in valgrind with the CPU driver, it prints out:
Also:
When I run with the GPU driver in Valgrind, it heisenbugs away. Might be still a race in the runtime related to memory copies/allocations. Any quick guesses @pvelesko or should I dig deeper (to ensure this is not more serious)? |
I can't reproduce this on the OpenCL iGPU. On the OpenCL CPU side, the error seems to be coming from the OpenCL runtime since the test passes but it segfaults upon thread exit. @pjaaskel |
It might be coming from or just appear in it (memory corruption can be visible that way). It fails for both iGPU and the CPU here, but we can postpone for 0.9.1. |
With the latest OpenCL fixes @pvelesko, this target regressed by one test:
|
ok let me check this. |
With the latest 0.9 branch:
|
|
OK. Did you report the issues to https://github.com/intel/compute-runtime/issues? |
That's for Level Zero, this is OpenCL, but maybe the same underlying issue. |
läpällä ja kännissä |
At 3 failing tests with 7f20613.
|
@pjaaskel I can only reproduce the |
OK, let's see if it goes away with future driver upgrades. |
None of the 688 or so fail here anymore with OpenCL/iGPU. Well done! |
Looking good still, but one of the tests is time outing: Label Time Summary: Total Test time (real) = 960.48 sec The following tests did not run: The following tests FAILED:
|
Current (6407148) status:
The ABM cases fail due to pushing too large WG size (1000) while the iGPU has 512 max. |
Let's open separate issues to each test we start to fix and assign them to whoever's working on it to keep better track. |
With export OverrideDefaultFP64Settings=1 and export IGC_EnableDPEmulation=1 to emulate the doubles (#137).
Check the end of the thread for the current status.
The text was updated successfully, but these errors were encountered: