[APM] Latency correlations meta issue #118009

walterra · 2021-11-09T12:54:39Z

Follow up to #109220.

Must have

Should have

Could have

Consolidate correlation / score / impact for both Correlations tab, ideally showing [0-1] and an indicator of severity high | med | low | v low - [APM] Correlations: Consolidate correlation, score and impact columns #119617
Display at least 1 correlated term - We currently hide all results where the correlation is considered to be very low, however this means users who wait for analysis to complete can often end up seeing "No results". We should always show a results, but label it as very low. - [APM] Latency correlations: Always display at least one correlated term #119613

Backlog

support metrics based indices with a first pass to analyze metric data --> pending ability to self-select which fields can be summarised
Improve progress bar, e.g. show current service task instead of raw %
Deduplicate results - revisit initial design thoughts. --> pending further customer feedback
- Feedback on initial deduplication prototype: Duplicates look odd. The display of duplicates feels a bit weird. The first-found field gets its own columns for name and value. Whereas the duplicates are concatenated (name: value). The first-found field is likely to be from the priority list, however it is still a bit arbitrary and the duplicates might also be priority fields. I suspect it would look better as a table of name+value pairs, within a row.
- there tends to be lots of dups in the data examples we have so far. e.g. kubernetes.pod.name and kubernetes.pod.uid. This seems more so for failures.
field/value candidates prioritization based on user selected fields (this could reuse the existing UI in the correlation tab where a user selects field candidates)
caching results in tabs - Previous thoughts on caching in flyout: Clicking on a filter value should not take you directly back to main page -- if correlations takes several minutes to calculate, then user should not lose this time because it is too easy to click on a filter.
Indicate percentile where 'slow' transactions begin (Need more thoughts from Steve D)
Should we continue to limit correlation analysis to a single named transaction? --> pending development of a "generic and transversal APM trace explorer"
Candidate terms selection - we have an optimization that works well on test data. --> pending performance assessment on large customer data
The trace samples page size is now 500, but with the compressed style of EUI pagination control this means the user has to click 499 times to get to page 500. Could the compressed mode be removed, as otherwise setting the page size to 500 seems to add very little value.

Tech Debt

Optimize ES queries (e.g. summarize field/value queries as part of nested aggs or multi search), investigate use of p-limit
destructure arguments of query calls
[ML] APM Correlations: Chart for failed transactions correlations tab. #110172 (comment)
Revisit the naming of callbacks of CorrelationTable, for example setSelectedSignificantTerm still references significant terms which was used by the previous version of correlation analysis.
Revisit hooks that fetch correlations results and possibly consolidate duplicate code like fetching the overall histogram.

Research

Benchmark edge cluster results with a variant that just uses a hard coded list of fields instead of identifying all fields by itself

The text was updated successfully, but these errors were encountered:

elasticmachine · 2021-11-09T12:54:41Z

Pinging @elastic/ml-ui (:ml)

walterra added Meta :ml apm:correlations labels Nov 9, 2021

walterra self-assigned this Nov 9, 2021

peteharverson unassigned walterra Dec 3, 2021

walterra mentioned this issue Aug 11, 2022

[ML] APM Correlations: Fix chart errors caused by inconsistent histogram range steps. #138259

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[APM] Latency correlations meta issue #118009

[APM] Latency correlations meta issue #118009

walterra commented Nov 9, 2021 •

edited

Loading

elasticmachine commented Nov 9, 2021

[APM] Latency correlations meta issue #118009

[APM] Latency correlations meta issue #118009

Comments

walterra commented Nov 9, 2021 • edited Loading

Must have

Should have

Could have

Backlog

Tech Debt

Research

elasticmachine commented Nov 9, 2021

walterra commented Nov 9, 2021 •

edited

Loading