Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DOCS] Adds important admonition to handling delayed data page #97753

Merged
merged 8 commits into from
Jul 19, 2023
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
Expand Up @@ -16,6 +16,12 @@ if it is set too high, analysis drifts farther away from real-time. The balance
that is struck depends upon each use case and the environmental factors of the
cluster.

IMPORTANT: If you get an error that says
`Datafeed missed XXXX documents due to ingest latency`, consider increasing
the value of `query_delay'. If it doesn't help, investigate the ingest latency and its
cause. You can do this by comparing event and ingest timestamps. High latency
is often caused by bursts of ingested documents, misconfiguration of the ingest
pipeline, or misalignment of system clocks.
== Why worry about delayed data?

If data are delayed randomly (and consequently are missing from analysis), the
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -154,7 +154,7 @@ public final class Messages {
public static final String JOB_AUDIT_DATAFEED_NO_DATA = "Datafeed has been retrieving no data for a while";
public static final String JOB_AUDIT_DATAFEED_MISSING_DATA =
"Datafeed has missed {0} documents due to ingest latency, latest bucket with missing data is [{1}]."
+ " Consider increasing query_delay";
+ " Consider increasing query_delay and investigate the cause of high latency in your ingestion process.";
public static final String JOB_AUDIT_DATAFEED_RECOVERED = "Datafeed has recovered data extraction and analysis";
public static final String JOB_AUDIT_DATAFEED_STARTED_FROM_TO = "Datafeed started (from: {0} to: {1}) with frequency [{2}]";
public static final String JOB_AUDIT_DATAFEED_STARTED_REALTIME = "Datafeed started in real-time";
Expand Down