Implement/smarter index selection #5642

spalger · 2015-12-11T01:28:18Z

When using segmented fetch we currently assume that documents are stored in index patterns based on time, but this is no longer a safe assumption. Since documents can be in any order we need to fetch documents from every index and then sort them client side. Since fetching the documents can take some time, we are using the bounds of the time field in an index to try and identify indices which can't produce hits for a result set, and then not fetch their documents.

It works like this:

get the list of indices and the min/max of their timefield
fetch the entire sample size from each index until we have enough documents to satisfy the sample.
for each remaining index
- if the min/max of it's timefield overlaps the min/max of the fetched documents also fetch the documents for that index pattern
- otherwise fetch with size=0

Issues:

it seems that the loading indicator on the histogram is not happy

For #5605

… implement/smarterIndexSelection

When using segmented fetch we currently assume that documents are stored in index patterns based on time, but this is no long necessary with the use of the field stats API. Since documents can be in any order we need to fetch documents from every index and then sort them client side. Since fetching the documents can take some time, we are using the bounds of the time field in an index to try and identify indices which can't produce hits for a result set, and then not fetch their documents. It works like this: - get the list of indices and the min/max of their timefield - fetch the entire sample size from each index until we have enough documents to satisfy the sample. - for each remaining index pattern - if the min/max of it's timefield overlaps the min/max of the fetched documents also fetch the documents for that index pattern - otherwise fetch with size=0

epixa · 2015-12-11T16:48:23Z

This isn't for 4.2.2 since the field stats stuff was added in 4.3.0.

epixa · 2015-12-11T16:50:15Z

Please correct me if I'm wrong!

spalger · 2015-12-11T17:03:29Z

src/ui/public/courier/fetch/request/segmented.js

+      const hitWindow = this._hitWindow;
+
+      // the order of documents isn't important, just get us more
+      if (!this._sortFn) return Math.max(this._desiredSize - hitWindow.size, 0);


hitWindow is used before it is verified to exist

rashidkpc · 2015-12-11T17:06:52Z

Get rid of the mix of es5 and es6. Stick with es5 in this pull because it changes functionality. You're welcome to convert to es6 in a separate pull that only contains es6 changes.

spalger · 2015-12-11T17:08:08Z

src/ui/public/courier/fetch/request/segmented.js

+          .sort(this._sortFn)
+          .slice(0, this._desiredSize);
+        });
+      }


merged.hits.hits needs to be sliced even if this._sortFn is not defined

…rterIndexSelection

…xSelection

…rterIndexSelection

epixa · 2015-12-11T21:31:01Z

src/plugins/kibana/public/discover/controllers/discover.js


        notify.event('flatten hit and count fields', function () {
-          var counts = $scope.fieldCounts;
+          var counts = $scope.fieldCounts = (sortFn ? {} : $scope.fieldCounts) || {};


Doesn't need to block this PR, but this line is pretty complicated - two assignments, 2 conditionals, 3 branches. In the future we should try to err on the side of multiple lines for something like this.

I hoped no one would notice

epixa · 2015-12-11T22:45:53Z

I'm all for changing the default value of desiredSize to undefined, but what do you think about using isFinite() (either from lodash or Number) instead of dealing with null coercion in a bunch of places. It's a little more explicit and expressive.

Fixes #5642

When using segmented fetch we currently assume that documents are stored in index patterns based on time, but this is no long necessary with the use of the field stats API. Since documents can be in any order we need to fetch documents from every index and then sort them client side. Since fetching the documents can take some time, we are using the bounds of the time field in an index to try and identify indices which can't produce hits for a result set, and then not fetch their documents. It works like this: - get the list of indices and the min/max of their timefield - fetch the entire sample size from each index until we have enough documents to satisfy the sample. - for each remaining index pattern - if the min/max of it's timefield overlaps the min/max of the fetched documents also fetch the documents for that index pattern - otherwise fetch with size=0 Fixes #5642

Fixes #5642

When using segmented fetch we currently assume that documents are stored in index patterns based on time, but this is no long necessary with the use of the field stats API. Since documents can be in any order we need to fetch documents from every index and then sort them client side. Since fetching the documents can take some time, we are using the bounds of the time field in an index to try and identify indices which can't produce hits for a result set, and then not fetch their documents. It works like this: - get the list of indices and the min/max of their timefield - fetch the entire sample size from each index until we have enough documents to satisfy the sample. - for each remaining index pattern - if the min/max of it's timefield overlaps the min/max of the fetched documents also fetch the documents for that index pattern - otherwise fetch with size=0 Fixes #5642

elasticsearch-bot · 2015-12-11T23:22:09Z

Court Ewing merged this into the following branches!

Branch	Commits
4.3.1	`3a55e6c`, `7619843`, `b78e797`, `c23e839`, `6c9ac7b`, `66cb5e8`, `4bff792`, `875d445`, `4180d40`

Fixes #5642

spalger added 2 commits December 10, 2015 16:55

Merge branch 'implement/indexPatternToIndexListDetailedResponse' into…

4222033

… implement/smarterIndexSelection

spalger added v4.4.0 v5.0.0 v4.3.1 v4.2.2 labels Dec 11, 2015

epixa removed the v4.2.2 label Dec 11, 2015

spalger reviewed Dec 11, 2015
View reviewed changes

spalger added 6 commits December 11, 2015 11:47

Merge branch 'master' of github.com:elastic/kibana into implement/sma…

08c4d48

…rterIndexSelection

Merge branch 'rename/segmentedCreateQueue' into implement/smarterInde…

40064f3

…xSelection

[courier/segmented/createQueue] fix tests

76c5ff6

[es6] remove from files that are currently es5

4d53372

Merge branch 'master' of github.com:elastic/kibana into implement/sma…

a9a1aea

…rterIndexSelection

[courier/segmented] stub out the new toDetailedIndexList method

a4a12f1

spalger added the review label Dec 11, 2015

spalger added 4 commits December 11, 2015 12:06

[courier/segmented] verify hitWindow is defined if possible

ce89bd9

[courier/segmented] slice merged hits to size on any modification

86af24c

[discover] always tell segmented request our desired size

89a1fe6

[courier/segmented] don't assume that desiredSize will always be set

d134842

epixa reviewed Dec 11, 2015
View reviewed changes

spalger mentioned this pull request Dec 11, 2015

Resort even when sorting by time #5633

Closed

[courier/segmented][disocver] hygiene

e07f8ac

spalger force-pushed the implement/smarterIndexSelection branch from 24fd838 to e07f8ac Compare December 11, 2015 22:52

spalger pushed a commit that referenced this pull request Dec 11, 2015

verify hitWindow is defined if possible

cc3aaf7

Fixes #5642

spalger pushed a commit that referenced this pull request Dec 11, 2015

slice merged hits to size on any modification

a6785ad

Fixes #5642

spalger pushed a commit that referenced this pull request Dec 11, 2015

always tell segmented request our desired size

cf377fb

Fixes #5642

spalger pushed a commit that referenced this pull request Dec 11, 2015

don't assume that desiredSize will always be set

27fd939

Fixes #5642

spalger pushed a commit that referenced this pull request Dec 11, 2015

hygiene

c5aa38f

Fixes #5642

spalger pushed a commit that referenced this pull request Dec 11, 2015

fix tests

f58b307

Fixes #5642

spalger pushed a commit that referenced this pull request Dec 11, 2015

remove from files that are currently es5

bb957ad

Fixes #5642

spalger pushed a commit that referenced this pull request Dec 11, 2015

stub out the new toDetailedIndexList method

c723e0d

Fixes #5642

spalger pushed a commit that referenced this pull request Dec 11, 2015

verify hitWindow is defined if possible

ef23dc2

Fixes #5642

spalger pushed a commit that referenced this pull request Dec 11, 2015

slice merged hits to size on any modification

3531da9

Fixes #5642

spalger pushed a commit that referenced this pull request Dec 11, 2015

always tell segmented request our desired size

d0367b8

Fixes #5642

spalger pushed a commit that referenced this pull request Dec 11, 2015

don't assume that desiredSize will always be set

726d2ba

Fixes #5642

spalger pushed a commit that referenced this pull request Dec 11, 2015

hygiene

65bca7e

Fixes #5642

spalger pushed a commit that referenced this pull request Dec 11, 2015

fix tests

7619843

Fixes #5642

spalger pushed a commit that referenced this pull request Dec 11, 2015

remove from files that are currently es5

b78e797

Fixes #5642

spalger pushed a commit that referenced this pull request Dec 11, 2015

stub out the new toDetailedIndexList method

c23e839

Fixes #5642

spalger pushed a commit that referenced this pull request Dec 11, 2015

verify hitWindow is defined if possible

6c9ac7b

Fixes #5642

spalger pushed a commit that referenced this pull request Dec 11, 2015

slice merged hits to size on any modification

66cb5e8

Fixes #5642

spalger pushed a commit that referenced this pull request Dec 11, 2015

always tell segmented request our desired size

4bff792

Fixes #5642

spalger pushed a commit that referenced this pull request Dec 11, 2015

don't assume that desiredSize will always be set

875d445

Fixes #5642

spalger pushed a commit that referenced this pull request Dec 11, 2015

hygiene

4180d40

Fixes #5642

epixa mentioned this pull request Dec 11, 2015

Search results always sorted by _index #5605

Closed

spalger deleted the implement/smarterIndexSelection branch February 25, 2016 22:48

epixa added v5.0.0-alpha1 and removed v5.0.0-alpha1 labels Mar 31, 2016

snyk-bot mentioned this pull request May 27, 2021

[Snyk] Fix for 1 vulnerabilities larrycameron80/kibana#87

Open

larrycameron80 mentioned this pull request Jan 10, 2023

[Snyk] Fix for 1 vulnerabilities larrycameron80/kibana#154

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement/smarter index selection #5642

Implement/smarter index selection #5642

spalger commented Dec 11, 2015

epixa commented Dec 11, 2015

epixa commented Dec 11, 2015

spalger Dec 11, 2015

rashidkpc commented Dec 11, 2015

spalger Dec 11, 2015

epixa Dec 11, 2015

spalger Dec 11, 2015

epixa commented Dec 11, 2015

elasticsearch-bot commented Dec 11, 2015

Implement/smarter index selection #5642

Implement/smarter index selection #5642

Conversation

spalger commented Dec 11, 2015

epixa commented Dec 11, 2015

epixa commented Dec 11, 2015

spalger Dec 11, 2015

Choose a reason for hiding this comment

rashidkpc commented Dec 11, 2015

spalger Dec 11, 2015

Choose a reason for hiding this comment

epixa Dec 11, 2015

Choose a reason for hiding this comment

spalger Dec 11, 2015

Choose a reason for hiding this comment

epixa commented Dec 11, 2015

elasticsearch-bot commented Dec 11, 2015