-
Notifications
You must be signed in to change notification settings - Fork 207
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Make s3 partition size configurable and add unit test for S3 partition creator classes #4437
Conversation
…n creator classes Signed-off-by: Dinu John <[email protected]>
@JsonProperty("export") | ||
private boolean export; | ||
|
||
@JsonProperty("export_partition_size") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What does this mean exactly?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is the number of records of chunk size the export partition worker will work on when it executes query to DocDB/Mongo. The export workers run in parallel and they work on independent chunk of data.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Does it make sense to call it export_batch_size
instead? For two reasons:
- The "partition" here (chunk of data queried from DocDB) is different from the "partition" (folder) in
partition_count
defined below, which can cause confusion. - We have a corresponding
stream_batch_size
parameter.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
updated to export_batch_size
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good. One comment about the parameter name above.
Signed-off-by: Dinu John <[email protected]>
…n creator classes (opensearch-project#4437) * Make s3 partition size configurable and add unit test for S3 partition creator classes Signed-off-by: Dinu John <[email protected]> * Rename export partition size to export batch size Signed-off-by: Dinu John <[email protected]> --------- Signed-off-by: Dinu John <[email protected]>
Make s3 partition size configurable and add unit test for S3 partition creator classes
Check List
By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.