feat: dataType.on('updated-docs') when docs update #396

gmaclennan · 2023-11-30T12:30:04Z

Emits an array of updated docs for the data type whenever the docs
update (rather than emitting one doc at a time, it waits for indexing to
complete, then emits all the docs that have been updated. If there are
several updates to a particular docId, then only the resolved head
version is returned).

Attaching this listener to a data type that has many values / is
frequently updated will have a performance impact, because large amounts
of data could be read from the database and there is no limit, e.g. if
this was attached to the observation dataType and a new device synced
with a project with 10,000 observations, an attached listener would be
called with 10,000 observations.

The main reason for this feature is for internal use for capabilities,
listening for changes and modifying permissions accordingly, initially in order to implement #268, which requires adding cores as device capabilities are updated.

This is implemented in a particular way with performance considerations in mind. My initial idea was to return all decoded indexed docs from the index writer, but this would have a high performance cost because:

the plan is to move the index writer to a separate worker thread, and sending all docs back over the worker message channel would be expensive.
we are not interested in subscribing to all doc updates (it wouldn't make sense to listen for all observations being updated because it would be too many, particularly after a large sync),
The docs that are indexed could contain multiple versions of a doc id (e.g. all the edits to that doc) and for an 'update' event we're only actually interested in the resolved "head" doc, so we have to re-request the doc from the index anyway (e.g. getByDocId(), which returns the "head").

So, the indexer now returns the docIds of the documents it indexed, then the DataStore de-dupes those and emits them for each schemaName (but only if a listener is attached) when indexing finishes, and the DataType class listens for updated doc ids (but only if a listener is attached) and looks up the docs by those docIds, and emits the updated-docs event.

Emits an array of updated docs for the data type whenever the docs update (rather than emitting one doc at a time, it waits for indexing to complete, then emits all the docs that have been updated. If there are several updates to a particular docId, then only the resolved `head` version is returned). Attaching this listener to a data type that has many values / is frequently updated will have a performance impact, because large amounts of data could be read from the database and there is no limit, e.g. if this was attached to the `observation` dataType and a new device synced with a project with 10,000 observations, an attached listener would be called with 10,000 observations. The main reason for this feature is for internal use for capabilities, listening for changes and modifying permissions accordingly.

achou11

lgtm. seems like you ran into some potentially gnarly TS issues - think you made the right choice to ignore them 😄

src/datastore/index.js

Co-authored-by: Andrew Chou <[email protected]>

gmaclennan · 2023-11-30T23:02:35Z

lgtm. seems like you ran into some potentially gnarly TS issues - think you made the right choice to ignore them 😄

Yeah, I wasted quite a lot of time trying to make TS happy, then just had a realization "why am I bothering". In the internals of functions like these dealing with complex data types, I don't think jumping through hoops to keep TS happy really gains us anything, especially we're hitting limitations of TS. The issue I keep getting frustrated by is trying to narrow a type with arrayOfValidTypes.includes(maybeValidType), but dependent types is often an issue (TS does not support them).

gmaclennan self-assigned this Nov 30, 2023

gmaclennan requested a review from achou11 November 30, 2023 12:30

gmaclennan mentioned this pull request Nov 30, 2023

Avoid leaking core keys and pre-have messages to devices with the project key, but without project access #268

Open

6 tasks

gmaclennan added 3 commits November 30, 2023 21:45

fix: make sure we clear pending emits

70d7ba0

fix datastore test

976183f

attempt to fix flakey test

98a6aa7

achou11 mentioned this pull request Nov 30, 2023

Feat: add fields to invites #393

Merged

achou11 approved these changes Nov 30, 2023

View reviewed changes

src/datastore/index.js Outdated Show resolved Hide resolved

Update src/datastore/index.js

60d5783

Co-authored-by: Andrew Chou <[email protected]>

gmaclennan merged commit 2ef893e into main Nov 30, 2023
4 of 7 checks passed

gmaclennan deleted the feat/data-type-updated-docs-event branch November 30, 2023 23:02

optic-release-automation bot mentioned this pull request Dec 4, 2023

[OPTIC-RELEASE-AUTOMATION] release/v9.0.0-alpha.3 #404

Merged

optic-release-automation bot mentioned this pull request Dec 12, 2023

[OPTIC-RELEASE-AUTOMATION] release/v9.0.0-alpha.4 #412

Merged

optic-release-automation bot mentioned this pull request Mar 14, 2024

[OPTIC-RELEASE-AUTOMATION] release/v9.0.0-alpha.5 #519

Merged

This was referenced Apr 11, 2024

[OPTIC-RELEASE-AUTOMATION] release/v9.0.0-alpha.6 #558

Merged

[OPTIC-RELEASE-AUTOMATION] release/v9.0.0-alpha.7 #573

Merged

optic-release-automation bot mentioned this pull request Apr 30, 2024

[OPTIC-RELEASE-AUTOMATION] release/v9.0.0-alpha.8 #592

Merged

optic-release-automation bot mentioned this pull request May 9, 2024

[OPTIC-RELEASE-AUTOMATION] release/v9.0.0-alpha.9 #632

Merged

This was referenced Jun 20, 2024

[OPTIC-RELEASE-AUTOMATION] release/v9.0.0-alpha.10 #709

Merged

[OPTIC-RELEASE-AUTOMATION] release/v9.0.0-alpha.11 #711

Merged

[OPTIC-RELEASE-AUTOMATION] release/v9.0.0-alpha.12 #714

Merged

This was referenced Jul 25, 2024

[OPTIC-RELEASE-AUTOMATION] release/v9.0.0-alpha.13 #726

Merged

[OPTIC-RELEASE-AUTOMATION] release/v9.0.0-alpha.14 #740

Merged

optic-release-automation bot mentioned this pull request Aug 13, 2024

[OPTIC-RELEASE-AUTOMATION] release/v9.0.0-alpha.15 #748

Merged

This was referenced Aug 21, 2024

[OPTIC-RELEASE-AUTOMATION] release/v9.0.0-alpha.16 #758

Merged

[OPTIC-RELEASE-AUTOMATION] release/v9.0.0-alpha.17 #787

Merged

This was referenced Aug 29, 2024

[OPTIC-RELEASE-AUTOMATION] release/v9.0.0-alpha.18 #796

Merged

[OPTIC-RELEASE-AUTOMATION] release/v9.0.0-alpha.19 #798

Merged

This was referenced Aug 29, 2024

[OPTIC-RELEASE-AUTOMATION] release/v9.0.0-alpha.20 #802

Merged

[OPTIC-RELEASE-AUTOMATION] release/v9.0.0-alpha.21 #804

Merged

[OPTIC-RELEASE-AUTOMATION] release/v9.0.0-alpha.22 #808

Merged

[OPTIC-RELEASE-AUTOMATION] release/v9.0.0-alpha.23 #810

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: dataType.on('updated-docs') when docs update #396

feat: dataType.on('updated-docs') when docs update #396

gmaclennan commented Nov 30, 2023 •

edited

Loading

achou11 left a comment

gmaclennan commented Nov 30, 2023

feat: dataType.on('updated-docs') when docs update #396

feat: dataType.on('updated-docs') when docs update #396

Conversation

gmaclennan commented Nov 30, 2023 • edited Loading

achou11 left a comment

Choose a reason for hiding this comment

gmaclennan commented Nov 30, 2023

gmaclennan commented Nov 30, 2023 •

edited

Loading