Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

segfault in clIcdGetPlatformIDsKHR #754

Closed
cdecker08 opened this issue Aug 11, 2024 · 4 comments
Closed

segfault in clIcdGetPlatformIDsKHR #754

cdecker08 opened this issue Aug 11, 2024 · 4 comments
Labels
bug merged change was merged

Comments

@cdecker08
Copy link

I have seen reports of some of our users segfaulting when utilizing tonemapping functionality in ffmpeg. The users are inconsistent (2 users with the same distro/version and graphics cards in the same family are seeing 2 different results) but the same user will consistently run fine or segfault. I was able to get the following stacktrace for the crash.

* thread #1, name = 'Plex Transcoder', stop reason = signal SIGSEGV: invalid address (fault address: 0x0)
  * frame #0: 0x00007ffff28897df libigdrcl.so`NEO::GfxCoreHelperHw<NEO::XeHpgCoreFamily>::getGpgpuEngineInstances(this=0x00007ffff3c6a270, rootDeviceEnvironment=0x00007ffff7e61160) const at gfx_core_helper_xehp_and_later.inl:81:101
    frame #1: 0x00007ffff2a73777 libigdrcl.so`NEO::ExecutionEnvironment::calculateMaxOsContextCount(this=0x00007ffff14758d0) at execution_environment.cpp:95:47
    frame #2: 0x00007ffff2afc674 libigdrcl.so`NEO::DeviceFactory::prepareDeviceEnvironments(executionEnvironment=0x00007ffff14758d0) at device_factory.cpp:201:26
    frame #3: 0x00007ffff29b2107 libigdrcl.so`NEO::prepareDeviceEnvironmentsImpl(executionEnvironment=0x00007ffff14758d0) at create_command_stream_impl.cpp:57:16
    frame #4: 0x00007ffff21fb7b6 libigdrcl.so`NEO::prepareDeviceEnvironments(executionEnvironment=0x00007ffff14758d0, osPciPath="", rootDeviceIndex=0) at get_devices.cpp:20:23
    frame #5: 0x00007ffff21fbbb2 libigdrcl.so`NEO::prepareDeviceEnvironments(executionEnvironment=0x00007ffff14758d0) at get_devices.cpp:53:12
    frame #6: 0x00007ffff2afcafd libigdrcl.so`NEO::DeviceFactory::createDevices(executionEnvironment=0x00007ffff14758d0) at device_factory.cpp:242:10
    frame #7: 0x00007ffff21fdf43 libigdrcl.so`::clGetPlatformIDs(numEntries=0, platforms=0x0000000000000000, numPlatforms=0x00007fffffffa9cc) at api.cpp:101:31
    frame #8: 0x00007ffff21ff196 libigdrcl.so`::clIcdGetPlatformIDsKHR(numEntries=0, platforms=0x0000000000000000, numPlatforms=0x00007fffffffa9cc) at api.cpp:148:14
    frame #9: 0x00007ffff6ade2a0 libavutil.so.57`___lldb_unnamed_symbol1393 + 162
    frame #10: 0x00007ffff6ae097f libavutil.so.57`___lldb_unnamed_symbol1553 + 445
    frame #11: 0x00007ffff6ae0619 libavutil.so.57`___lldb_unnamed_symbol1550 + 352
    frame #12: 0x00007ffff6ae048c libavutil.so.57`___lldb_unnamed_symbol1549 + 50
@cdecker08
Copy link
Author

I actually tracked this down to the fact we disable AIL and submitted a fix here: #755

@JablonskiMateusz
Copy link
Contributor

Hi @chrisdecker08
Thanks for reporting the issue. Disabling AIL may in fact impact here as we missed some checks for nullptr after adding debug key to disable AIL.
I'll provide wider fix with corresponding ULTs

@cdecker08
Copy link
Author

@JablonskiMateusz can you please tag me in the wider fix so I can apply the patch to our version?

@JablonskiMateusz
Copy link
Contributor

@chrisdecker08 fix merged in f617093

@JablonskiMateusz JablonskiMateusz added merged change was merged and removed in queue labels Aug 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug merged change was merged
Projects
None yet
Development

No branches or pull requests

2 participants