[Logs+] Implement Logs Dataset selector #159907

tonyghiani · 2023-06-19T10:44:20Z

📓 Summary

Closes https://github.com/elastic/observability-dev/issues/2655

This PR introduces a customized log consumption experience in the Discover plugin. By leveraging the new discover_log_explorer plugin and utilizing the discover.customize functionality, we have curated a more tailored user experience.

The key feature of this implementation is the DatasetSelector component, which replaces the original Discover DataViewPicker. It handles the retrieval, rendering, and navigation of integrations and data streams related to logs, providing an improved user interface.

This PR involves significant development efforts, including the creation of the discover_log_explorer plugin, implementation of services, state machines, custom hooks, and enhancements to presentational components. The following overview will help reviewers understand the responsibilities of each component in this implementation.

demo-selector-fhd.mov

DatasetsService & DatasetsClient

The DatasetsService is introduced, a crucial component that mediates access to the newly implemented DatasetsClient. During the plugin's lifecycle, the DatasetsService exposes a client property through its start() method, providing convenient access to a DatasetsClient instance.

The DatasetsClient is responsible for abstracting the data fetching process for two endpoints: the integrations endpoint and the data streams listing endpoint. These endpoints are utilized to populate the selector options in the user interface. To facilitate this, the DatasetsClient exposes the findIntegrations and findDatasets methods, which handle the respective data fetching.

Discover Customization

The critical part of this work consists of where the customization is applied.
Inside the public/plugin.tsx, we lazy load and create, injecting the required dependencies, the CustomDatasetSelector, which already encapsulates all the logic required to make the selector work with the external APIs.
We kept separating the data fetching logic from how the selector works, and all the data and events are passed into the UI component with properties.

discover.customize(
  DISCOVER_LOG_EXPLORER_PROFILE_ID,
  ({ customizations, stateContainer }) => {

    customizations.set({
      id: 'search_bar',
      CustomDataViewPicker: createLazyCustomDatasetSelector({
        datasetsClient: datasetsService.client,
        stateContainer,
      }),
    });
    ...

Data fetching state machines & custom hooks

To handle the data fetching of integrations and unmanaged data streams, we created two different state machines to separately handle the related action for each dataset, such as remote search, in-memory search, error handling etc.

Integration machine and useIntegrations

The integrations state machine handles automatic data fetching of the resources and additionally provides transitions for loading more integrations, searching integrations by HTTP request, searching locally into integration streams, and all the related loading and error handling states.

It is then interpreted inside the useIntegrations custom hook, which exposes the fetched data and handlers for all the above-mentioned actions.

Datasets machine and useDatasets

Similar to the integrations state machine, but simplified since the data streams search can only happen with HTTP requests and there is no pagination that requires to handle the load of more entries.

It is interpreted inside the useDatasets custom hook, which also exposes the fetched data and handlers for the available actions.

DatasetSelector

The DatasetSelector component contains all the logic that manages the navigation and searches across the different panels that render integrations, integrations' streams or unmanaged streams.
As the datasets come from different APIs or are performed in-memory, the search work follow this logic:

When listing the integrations list (first level of the EuiContextMenu), the search is done with an HTTP request.
When listing the data streams list for a specific integration (second level of the EuiContextMenu), the search is done in-memory, filtering and sorting directly in the client.
When listing the unmanaged data streams list (second level of the EuiContextMenu), the search is done again with an HTTP request.

To handle these possible user journeys correctly without side effects, we created another state machine and exposed its actions with an internal useDatasetSelector custom hook.

Next steps

This component will change quite a lot until we won't get to a final design. As soon as a first solid mvp is defined for production, a complete test for the component will be implemented, among with a more generic functional test for the core customization features.

…-fix'

…fix'

…ocs'

…-fix'

…fix'

…iew selector

…fix'

…-fix'

…-ref HEAD~1..HEAD --fix'

…fix'

…-fix'

…are applied

weltenwort

I left a few final comments below. Thanks for following up on the previous ones.

One thing I noticed during testing is that a user might want to reload the list of integrations or the list of uncategorized datasets even when no error occured before. During testing, for example, I started a shipper in the background but its new data stream didn't show up until I navigated away and back. Any idea how we could allow triggering a reload without cluttering the UI?

...gins/discover_log_explorer/public/components/dataset_selector/state_machine/state_machine.ts

x-pack/plugins/discover_log_explorer/public/components/dataset_selector/utils.tsx

x-pack/plugins/discover_log_explorer/public/hooks/use_intersection_ref.ts

x-pack/plugins/discover_log_explorer/public/components/dataset_selector/dataset_selector.tsx

x-pack/plugins/discover_log_explorer/public/components/dataset_selector/constants.tsx

x-pack/test/functional/apps/discover_log_explorer/customization.ts

…reSearchResult

…ee util

…n.ts Co-authored-by: Felix Stürmer <[email protected]>

tonyghiani · 2023-06-27T17:09:28Z

One thing I noticed during testing is that a user might want to reload the list of integrations or the list of uncategorized datasets even when no error occured before. During testing, for example, I started a shipper in the background but its new data stream didn't show up until I navigated away and back. Any idea how we could allow triggering a reload without cluttering the UI?

The data are not reloaded after switching panel because this ensures that we can keep in memory the loaded integrations after some scrolling, without losing the loaded entries.
Some other benefits allow us to keep the current search results while navigating between panels without the need to perform each time the search and allow for enabling the client-side cache for those entries, which is pretty handful when performing multiple searches.

I honestly didn't think about the case the user would like to see the newly available dataset or integrations in an almost real-time manner, due to the fact that ideally the installed integrations and unmanaged dataset shouldn't change very often.
I think the trade-off in re-fetching the API results or reducing the cache TTL won't be in favour of the smooth user experience we provide now, compared to the need of the user to refresh the page (which is something they eventually do anyway after expecting to see new results) (my 2 cents, I'm mostly talking from a developer perspective, but I'm sure that for other apps' use cases, having faster feedback on new content is critical 🤓 ).

If we want to provide anyway a way for the user to refresh the list without reloading the page, I can't think of any behind-the-scene immediate solution, while having a small call to action such a button can give them this option straight away. Maybe something like this?

weltenwort · 2023-06-27T18:38:57Z

Yes, maybe my testing procedure was not exactly representative of normal usage. Once we have the selection state restoration in place a reload will be less jarring anyway.

weltenwort

Amazing work overall, very well done 👏

Aside from the great readability I was pleasantly surprised to see that you managed to not increase the size of any other bundle in any significant way. 🎉

As agreed upon via other channels, there are few high-priority follow-ups that we will want to implement next:

The merge of this PR will unblock the implementation of

Thank you for all the effort and care you invested into this!

weltenwort · 2023-06-28T10:39:29Z

src/dev/storybook/aliases.ts

@@ -28,6 +28,7 @@ export const storybookAliases = {
  dashboard: 'src/plugins/dashboard/.storybook',
  data: 'src/plugins/data/.storybook',
  discover: 'src/plugins/discover/.storybook',
+  discover_log_explorer: 'x-pack/plugins/discover_log_explorer/.storybook',


Could we also add it to .buildkite/scripts/steps/storybooks/build_and_upload.ts as recently documented in #160473?

kibana-ci · 2023-06-28T13:15:19Z

💚 Build Succeeded

Metrics [docs]

Module Count

Fewer modules leads to a faster build time

id	before	after	diff
`discoverLogExplorer`	-	132	+132

Public APIs missing comments

Total count of every public API that lacks a comment. Target amount is 0. Run node scripts/build_api_docs --plugin [yourplugin] --stats comments for more detailed information.

id	before	after	diff
`discover`	68	71	+3
`fleet`	1071	1073	+2
total			+5

Async chunks

Total size of all lazy-loaded chunks that will be downloaded as the user navigates the app

id	before	after	diff
`discover`	529.8KB	529.8KB	-8.0B
`discoverLogExplorer`	-	184.2KB	+184.2KB
`fleet`	980.7KB	979.9KB	-813.0B
`lens`	1.3MB	1.3MB	-8.0B
`unifiedSearch`	212.9KB	212.9KB	+16.0B
total			+183.4KB

Public APIs missing exports

Total count of every type that is part of your API that should be exported but is not. This will cause broken links in the API documentation system. Target amount is 0. Run node scripts/build_api_docs --plugin [yourplugin] --stats exports for more detailed information.

id	before	after	diff
`discover`	8	14	+6
`fleet`	35	36	+1
total			+7

Page load bundle

Size of the bundles that are downloaded on every page load. Target size is below 100kb

id	before	after	diff
`discoverLogExplorer`	-	4.3KB	+4.3KB
`fleet`	132.8KB	133.7KB	+883.0B
`stackAlerts`	19.1KB	19.1KB	-8.0B
total			+5.2KB

Unknown metric groups

API count

id	before	after	diff
`discover`	88	97	+9
`fleet`	1187	1189	+2
total			+11

async chunk count

id	before	after	diff
`discoverLogExplorer`	-	5	+5

ESLint disabled in files

id	before	after	diff
`discoverLogExplorer`	-	2	+2

ESLint disabled line counts

id	before	after	diff
`discoverLogExplorer`	-	3	+3
`enterpriseSearch`	14	16	+2
`securitySolution`	413	417	+4
total			+9

Total ESLint disabled count

id	before	after	diff
`discoverLogExplorer`	-	5	+5
`enterpriseSearch`	15	17	+2
`securitySolution`	492	496	+4
total			+11

History

💔 Build #138780 failed 033af2d
💚 Build #138728 succeeded b214654
💚 Build #138671 succeeded cbba119
💚 Build #138590 succeeded 8184fb9
💚 Build #138380 succeeded e8ec06e

To update your PR or re-run it, just comment with:
@elasticmachine merge upstream

## 📓 Summary Closes #160425 After the [first implementation of the log-explorer profile](#159907), we wanted to restore the selection of the dataset for a user when landing on the Discover log-explorer profile. Since we create an ad-hoc data view for Discover starting from the dataset details, we needed to develop a system for intercepting the `index` query parameter (which is used by Discover as the source of truth for restoring a data view), create our ad-hoc data view and store in the URL an encoded ID with the required details to restore the selection. The following video shows the user journey for: - Landing on the log-explorer profile with no index param, nothing to restore and fallback to All log datasets. - Landing on the log-explorer profile invalid index param, notify about failure and fallback to All log datasets. - Select a different dataset, applies the new data view and update the URL. When the URL is accessed directly, restore and initialize the data view for the selection. - Navigate back and forth in the browser history, restoring the selection and data view on `index` param changes. https://github.com/elastic/kibana/assets/34506779/37a212ee-08e4-4e54-8e42-1d739c38f164 ## 💡 Reviewer hints To have better control over the page selection and the restore process, we prepared the DatasetSelector component for [being controlled by the parent component](#160971). Having that ready, we now implemented a new top-level state machine with the following responsibilities: - Re-initialize (decompress/decode) the dataset selection from the `index` query params. - Derive and set into Discover state a new ad-hoc data view. - Keep track of new dataset selection changes and update the URL state and the current data view. <img width="1224" alt="log-explorer-machine" src="https://github.com/elastic/kibana/assets/34506779/67e3ff17-dc3f-4dcf-b6c0-f40dbbea2d44"> We found a race condition between the Discover URL initialization + data view initialization against the log-explorer profile customizations being applied. To guarantee we correctly initialize the state machine and restore the selection before Discover goes through its initialization steps, we need to wait for the customization service to exist in Discover so that also the customization callbacks are successfully invoked. --------- Co-authored-by: Marco Antonio Ghiani <[email protected]> Co-authored-by: kibanamachine <[email protected]>

## 📓 Summary Closes elastic#160425 After the [first implementation of the log-explorer profile](elastic#159907), we wanted to restore the selection of the dataset for a user when landing on the Discover log-explorer profile. Since we create an ad-hoc data view for Discover starting from the dataset details, we needed to develop a system for intercepting the `index` query parameter (which is used by Discover as the source of truth for restoring a data view), create our ad-hoc data view and store in the URL an encoded ID with the required details to restore the selection. The following video shows the user journey for: - Landing on the log-explorer profile with no index param, nothing to restore and fallback to All log datasets. - Landing on the log-explorer profile invalid index param, notify about failure and fallback to All log datasets. - Select a different dataset, applies the new data view and update the URL. When the URL is accessed directly, restore and initialize the data view for the selection. - Navigate back and forth in the browser history, restoring the selection and data view on `index` param changes. https://github.com/elastic/kibana/assets/34506779/37a212ee-08e4-4e54-8e42-1d739c38f164 ## 💡 Reviewer hints To have better control over the page selection and the restore process, we prepared the DatasetSelector component for [being controlled by the parent component](elastic#160971). Having that ready, we now implemented a new top-level state machine with the following responsibilities: - Re-initialize (decompress/decode) the dataset selection from the `index` query params. - Derive and set into Discover state a new ad-hoc data view. - Keep track of new dataset selection changes and update the URL state and the current data view. <img width="1224" alt="log-explorer-machine" src="https://github.com/elastic/kibana/assets/34506779/67e3ff17-dc3f-4dcf-b6c0-f40dbbea2d44"> We found a race condition between the Discover URL initialization + data view initialization against the log-explorer profile customizations being applied. To guarantee we correctly initialize the state machine and restore the selection before Discover goes through its initialization steps, we need to wait for the customization service to exist in Discover so that also the customization callbacks are successfully invoked. --------- Co-authored-by: Marco Antonio Ghiani <[email protected]> Co-authored-by: kibanamachine <[email protected]>

Marco Antonio Ghiani and others added 30 commits June 19, 2023 12:25

feat(observability-logs): create plugin boilerplate

9ab4529

[CI] Auto-commit changed files from 'node scripts/eslint --no-cache -…

6bcf4f8

…-fix'

[CI] Auto-commit changed files from 'node scripts/lint_ts_projects --…

eeecf6f

…fix'

[CI] Auto-commit changed files from 'node scripts/generate codeowners'

eea7560

[CI] Auto-commit changed files from 'node scripts/build_plugin_list_d…

1a6fb0b

…ocs'

feat(observability-logs): try discover customization

d2a76bb

[CI] Auto-commit changed files from 'node scripts/build_plugin_list_d…

559bc9c

…ocs'

feat(observability-logs): update profileId

c775f49

feat(observability-logs): add lazy loading and wip component

5d957b5

feat(observability-logs): add story for data stream selector

b68f33b

feat(observability-logs): create nested panels

826cb03

[CI] Auto-commit changed files from 'node scripts/eslint --no-cache -…

7ed6317

…-fix'

[CI] Auto-commit changed files from 'node scripts/lint_ts_projects --…

4d2d8cd

…fix'

feat(observability-logs): apply customization to hide discover controls

6660674

feat(observability-logs): add integration types

c2bd77e

feat(observability-logs): update integrations tree

80b4dc2

refactor(observability-logs): remove temporary index icon from data v…

cc23a2a

…iew selector

feat(observability-logs): create integrations state machine folder

d102bdb

[CI] Auto-commit changed files from 'node scripts/lint_ts_projects --…

60436ed

…fix'

refactor(observability-log): allow ad-hoc data view creation with specs

bc227c2

chore(observability-log): fix jest config

ee42d87

chore(observability-log): add optimizer bundle limit

dd530db

[CI] Auto-commit changed files from 'node scripts/eslint --no-cache -…

ec969ef

…-fix'

feat(observability-log): draft integrations state machine

b39b901

feat(observability-log): adding integrations service

2220c29

[CI] Auto-commit changed files from 'node scripts/precommit_hook.js -…

dd77c92

…-ref HEAD~1..HEAD --fix'

[CI] Auto-commit changed files from 'node scripts/lint_ts_projects --…

b3cb323

…fix'

[CI] Auto-commit changed files from 'node scripts/eslint --no-cache -…

790df31

…-fix'

feat(observability-log): use integration service

c47afcc

[CI] Auto-commit changed files from 'node scripts/eslint --no-cache -…

01fb593

…-fix'

test(discover-log-explorer): add basic test to assert customizations …

e8ec06e

…are applied

jcger approved these changes Jun 27, 2023

View reviewed changes

tonyghiani mentioned this pull request Jun 27, 2023

[Log Explorer] Missing test suite for DatasetSelector in log-explorer profile #160627

Closed

weltenwort reviewed Jun 27, 2023

View reviewed changes

Marco Antonio Ghiani and others added 5 commits June 27, 2023 18:11

refactor(discover-log-explorer): switch to pure action for maybeResto…

edcb90e

…reSearchResult

refactor(discover-log-explorer): move spyRef into buildIntegrationsTr…

0ffdf0a

…ee util

refactor(discover-log-explorer): remove import

389c097

refactor(discover-log-explorer): add explicit close action

7d605a7

Update x-pack/test/functional/apps/discover_log_explorer/customizatio…

8184fb9

…n.ts Co-authored-by: Felix Stürmer <[email protected]>

Marco Antonio Ghiani and others added 2 commits June 27, 2023 23:29

refactor(discover-log-explorer): add copies changes ans move conditional

cbba119

Merge branch 'main' into 2655-implement-log-data-stream-selector-rebased

b214654

weltenwort approved these changes Jun 28, 2023

View reviewed changes

refactor(discover-log-explorer): add storybook alias to buildkite

033af2d

weltenwort mentioned this pull request Jun 28, 2023

[Log Explorer] Add specialized locators #158382

Closed

Merge branch 'main' into 2655-implement-log-data-stream-selector-rebased

2893978

tonyghiani merged commit 6a0d6de into elastic:main Jun 28, 2023

tonyghiani deleted the 2655-implement-log-data-stream-selector-rebased branch June 28, 2023 13:20

kibanamachine added v8.10.0 backport:skip This commit does not require backporting labels Jun 28, 2023

This was referenced Jun 29, 2023

[Log Explorer] Include title and icon in installed packages API #160935

Closed

[Log Explorer] Add filter controls customization point #158561

Closed

tonyghiani mentioned this pull request Jul 7, 2023

[Logs+] Restore Dataset selection from page URL #161144

Merged

weltenwort mentioned this pull request Aug 17, 2023

[Log Explorer] Convert the log explorer profile into a standalone app #164197

Closed

weltenwort mentioned this pull request Aug 28, 2023

[Explorer] Add Explorer app locator #164995

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Logs+] Implement Logs Dataset selector #159907

[Logs+] Implement Logs Dataset selector #159907

tonyghiani commented Jun 19, 2023 •

edited

Loading

weltenwort left a comment

tonyghiani commented Jun 27, 2023 •

edited

Loading

weltenwort commented Jun 27, 2023

weltenwort left a comment

weltenwort Jun 28, 2023

kibana-ci commented Jun 28, 2023

API count

async chunk count

ESLint disabled in files

ESLint disabled line counts

Total ESLint disabled count

[Logs+] Implement Logs Dataset selector #159907

[Logs+] Implement Logs Dataset selector #159907

Conversation

tonyghiani commented Jun 19, 2023 • edited Loading

📓 Summary

DatasetsService & DatasetsClient

Discover Customization

Data fetching state machines & custom hooks

Integration machine and useIntegrations

Datasets machine and useDatasets

DatasetSelector

Next steps

weltenwort left a comment

Choose a reason for hiding this comment

tonyghiani commented Jun 27, 2023 • edited Loading

weltenwort commented Jun 27, 2023

weltenwort left a comment

Choose a reason for hiding this comment

weltenwort Jun 28, 2023

Choose a reason for hiding this comment

kibana-ci commented Jun 28, 2023

💚 Build Succeeded

Metrics [docs]

Module Count

Public APIs missing comments

Async chunks

Public APIs missing exports

Page load bundle

API count

async chunk count

ESLint disabled in files

ESLint disabled line counts

Total ESLint disabled count

History

tonyghiani commented Jun 19, 2023 •

edited

Loading

tonyghiani commented Jun 27, 2023 •

edited

Loading