Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

estuary-cdk idle backoff tweak & readme #1330

Merged
merged 4 commits into from
Mar 8, 2024
Merged

estuary-cdk idle backoff tweak & readme #1330

merged 4 commits into from
Mar 8, 2024

Conversation

jgraettinger
Copy link
Member

@jgraettinger jgraettinger commented Mar 6, 2024

See commits

Workflow steps:

(How does one use this feature, and how has it changed)

Documentation links affected:

(list any documentation links that you created, or existing ones that you've identified as needing updates, along with a brief description)

Notes for reviewers:

(anything that might help someone review this PR)


This change is Reviewable

Some resources always have _some_ level of activity every time they're
checked, which would ordinarily prevent the resource from ever being
considered "caught up", preventing the connector from exiting.

If the LogCursor is a date-time, then additionally consider the task to
be idle if the LogCursor is less-than `interval` old (where interval is
configured on the ResourceConfig), and sleep until the cursor is
`interval` old before issuing a successive fetch.
@jgraettinger jgraettinger requested a review from jshearer March 6, 2024 20:23
Much like FetchChangesFn, FetchPageFn is now an AsyncGenerator which
yields documents, checkpoint-able PageCursors, or completes an
iteration of a resource.

This refactoring is consistent with recent updates to FetchChangesFn,
and is motivated by use cases where the fetched "page" could be quite
large and itself composed of multiple concurrent data fetches,
implemented as constituent AsyncGenerator instances.

By using an AsyncGenerator, such an implementation can immediately
yield from across those concurrent fetches as data arrives such that
data is spilled to disk and memory pressure kept low. Then, when the
scatter / gather data fetch completes, the entire "page" is
checkpointed.
@jgraettinger
Copy link
Member Author

@jshearer bump for this PR and the other please

Copy link
Contributor

@jshearer jshearer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM 👍

@jshearer jshearer merged commit ee34ea7 into main Mar 8, 2024
49 of 53 checks passed
@jgraettinger jgraettinger deleted the johnny/more-cdk branch March 8, 2024 16:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants