Cap concurrent requests to get Neptune schema #58

danielfinke · 2025-01-23T01:16:08Z

Issue #, if available: #20

Description of changes:

Previously, for a graph with n nodes or edges, there would be n concurrent requests made to Neptune via queryNeptune. Depending on your instance class or VPC settings, you could hit errors such as MemoryLimitExceededException or connection limits/sockets abruptly closing, with increasing likelihood as the size of the graph increased. Initially observed abrupt socket closing just like #20.

Now, the concurrent request Promises are resolved in batches using mapAll. For now, the batch size is hardcoded to 20. For very small instance classes (e.g. from the "Development and testing" template), MemoryLimitExceededExceptions are still likely.

Further improvements could include:

relating the batch size to the instance class
allowing the user to specify the batch size in the process args
exponential/dynamic backoff on transient errors as indicated in https://docs.aws.amazon.com/neptune/latest/userguide/errors-engine-codes.html#errors-query

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

Previously, for a graph with `n` nodes or edges, there would be `n` concurrent requests made to Neptune via `queryNeptune`. Depending on your instance class or VPC settings, you could hit errors such as `MemoryLimitExceededException` or connection limits/sockets abruptly closing, with increasing likelihood as the size of the graph increased. Initially observed abrupt socket closing just like aws#20. Now, the concurrent request Promises are resolved in batches using `mapAll`. For now, the batch size is hardcoded to 20. For very small instance classes (e.g. from the "Development and testing" template), `MemoryLimitExceededException`s are still likely. Further improvements could include: - relating the batch size to the instance class - allowing the user to specify the batch size in the process args - exponential/dynamic backoff on transient errors as indicated in https://docs.aws.amazon.com/neptune/latest/userguide/errors-engine-codes.html#errors-query

src/util-promise.js

src/test/util-promise.test.js

andreachild · 2025-01-24T18:00:22Z

LGTM

Cole-Greer

Thanks for the contribution @danielfinke. LGTM

andreachild reviewed Jan 23, 2025

View reviewed changes

src/util-promise.js Outdated Show resolved Hide resolved

Add mapAll unit tests

c364fd4

andreachild reviewed Jan 24, 2025

View reviewed changes

src/test/util-promise.test.js Outdated Show resolved Hide resolved

Fix mapAll batch counter unit test

340352f

Cole-Greer approved these changes Jan 24, 2025

View reviewed changes

Cole-Greer merged commit 983c338 into aws:main Jan 24, 2025
2 checks passed

danielfinke deleted the cap-concurrent-requests-to-get-neptune-schema branch January 25, 2025 00:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cap concurrent requests to get Neptune schema #58

Cap concurrent requests to get Neptune schema #58

danielfinke commented Jan 23, 2025

andreachild commented Jan 24, 2025

Cole-Greer left a comment

Cap concurrent requests to get Neptune schema #58

Cap concurrent requests to get Neptune schema #58

Conversation

danielfinke commented Jan 23, 2025

andreachild commented Jan 24, 2025

Cole-Greer left a comment

Choose a reason for hiding this comment