Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tweak _bulk flush threshold #46

Closed
alexdunnjpl opened this issue Jul 26, 2023 · 0 comments · Fixed by #48
Closed

Tweak _bulk flush threshold #46

alexdunnjpl opened this issue Jul 26, 2023 · 0 comments · Fixed by #48
Labels
B14.0 i&t.skip Skip I&T of this task/ticket task

Comments

@alexdunnjpl
Copy link
Contributor

💡 Description

Currently, _bulk updates are flushed every 5000 products.

Per Elastic, the actual metric of interest is size.

Start with a bulk size around 5–15 MB and slowly increase it until you do not see performance gains anymore. Then start increasing the concurrency of your bulk ingestion (multiple threads, and so forth).

Fine-tuning will be somewhat annoying and take time, but some coarse tuning should yield meaningful impact with minimal effort.

@alexdunnjpl alexdunnjpl added B14.0 i&t.skip Skip I&T of this task/ticket task labels Jul 26, 2023
alexdunnjpl added a commit that referenced this issue Jul 26, 2023
this replaces the size-independent update-count threshold previously used
fixes #46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
B14.0 i&t.skip Skip I&T of this task/ticket task
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants