Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove connect/disconnect messages from the audit logs #1218

Merged
merged 2 commits into from
Feb 14, 2024

Conversation

oestrich
Copy link
Contributor

It's a very large portion of what's in the audit logs table. We should probably be storing these in another way because devices tend to flap online/offline enough with LTE connections.

It's a very large portion of what's in the audit logs table. We
should probably be storing these in another way because devices tend to
flap online/offline enough with LTE connections.
@oestrich oestrich requested a review from jjcarstens February 14, 2024 15:41
Copy link
Collaborator

@jjcarstens jjcarstens left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For those following - When scaling to lots of devices, this audit event bloats the table and is problematic when trying to load other pages that include audit logs.

Adding an identifier to the telemetry event at least allows some metric and introspection until we figure out how to best handle connect/disconnect

@oestrich oestrich merged commit 2a9be62 into main Feb 14, 2024
2 checks passed
@oestrich oestrich deleted the audit-logs-no-connect-disconnect branch February 14, 2024 21:41
@fhunleth
Copy link
Contributor

I'm debugging an issue right now where having a record of when the hub was online is really nice. I hope we can get this back soon. I can make the "Last Handshake" field to work in the interim.

@fhunleth
Copy link
Contributor

If anyone is following this, I've learned that there are just fundamental issues with how the audit log is stored and index. Jon alluded to it in his comment. I did not realize how unscaleable the audit log was right now. Manual intervention is necessary to prune logs aggressively to prevent timeouts. This PR provides breathing room to make a proper fix.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants