-
Notifications
You must be signed in to change notification settings - Fork 24.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[DOCS] Adds important admonition to handling delayed data page #97753
Conversation
Documentation preview: |
Pinging @elastic/ml-core (Team:ML) |
Pinging @elastic/es-docs (Team:Docs) |
+ " Consider increasing query_delay"; | ||
+ " Check if your ingest pipeline delays ingesting. If the pipeline works as expected, consider increasing query_delay."; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I have no strong feelings about this. But IDK how common it is for folks to have ingest pipelines.
Would a user be confused if they have no ingest pipelines?
If y'all think this is fine, I am cool with it. @droberts195
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think there's been a misunderstanding at some point. "Ingest pipeline" here really means "end to end ingestion process". So that includes everything from:
- Originating application creating message
- Data collector reading the message
- Data collector processing
- Data collector sending to Elasticsearch
- Elasticsearch processing
- Elasticsearch indexing
- Elasticsearch making searchable
Using dictionary definitions of words that could be described as an "ingest pipeline", but we shouldn't use the term here because it has a very specific meaning in Elasticsearch.
So basically "ingest pipeline" should be replaced with something else throughout this PR. Maybe "end to end ingestion process", "end to end ingestion flow", "ingestion flow from source to index", or something better.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the input, I rephrased the text via af50d7f.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you for looking at this. I added some details to the texts while keeping the size similar.
x-pack/plugin/core/src/main/java/org/elasticsearch/xpack/core/ml/job/messages/Messages.java
Outdated
Show resolved
Hide resolved
docs/reference/ml/anomaly-detection/ml-delayed-data-detection.asciidoc
Outdated
Show resolved
Hide resolved
x-pack/plugin/core/src/main/java/org/elasticsearch/xpack/core/ml/job/messages/Messages.java
Outdated
Show resolved
Hide resolved
…ml/job/messages/Messages.java
docs/reference/ml/anomaly-detection/ml-delayed-data-detection.asciidoc
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Thank you 🙏
Overview
This PR:
Handling delayed data
page that hints that missed documents could be the results of an error with the ingestion process,