Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[memory_checker] cgroup memory usage file 'memory.usage_in_bytes' does not exist on device #21880

Open
volodymyrsamotiy opened this issue Feb 27, 2025 · 2 comments
Labels
Issue for 202411 Triaged this issue has been triaged

Comments

@volodymyrsamotiy
Copy link
Collaborator

Description
Very rarely, during config reload or reboot, below error is observed:

ERR memory_checker: [memory_checker] cgroup memory usage file '/sys/fs/cgroup/memory/docker/3d4452045e8e85223825a45a913eb9ae041494be5702fa5b4c9059cd7d0b8b0b/memory.usage_in_bytes' of container '3d4452045e8e85223825a45a913eb9ae041494be5702fa5b4c9059cd7d0b8b0b' does not exist on device! Exiting ...

It happens on shutdown and it is not harmful in any way except just error in syslog.

It looks like race condition between memory_checker and docker shutdown flow.

memory_checker is running periodically and checks memory usage.
It checks memory for containers using this file '/sys/fs/cgroup/memory/docker/.../memory.usage_in_bytes'.
It could happen that container is already down which means 'memory.usage_in_bytes' file does not exist anymore.
But memory_checker was executed in parallel so it didn't know yet container is down, so still checked the file and printed an error that there is no such file.

Steps to reproduce the issue:

  1. Execute config reload or reboot (please note, as mentioned above reproduce rate is low)

Describe the results you received:
Very rarely, during config reload or reboot, below error is observed:

ERR memory_checker: [memory_checker] cgroup memory usage file '/sys/fs/cgroup/memory/docker/3d4452045e8e85223825a45a913eb9ae041494be5702fa5b4c9059cd7d0b8b0b/memory.usage_in_bytes' of container '3d4452045e8e85223825a45a913eb9ae041494be5702fa5b4c9059cd7d0b8b0b' does not exist on device! Exiting ...

Describe the results you expected:
No memory_checker related error during config reload and reboot.

@dgsudharsan
Copy link
Collaborator

@qiluo-msft Can you please investigate this issue?

@arlakshm
Copy link
Contributor

@qiluo-msft to check

@arlakshm arlakshm added the Triaged this issue has been triaged label Mar 12, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Issue for 202411 Triaged this issue has been triaged
Projects
None yet
Development

No branches or pull requests

3 participants