Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

In Historical Retrieval from BQ join between source & entities is performed inside BQ #1110

Merged

Conversation

pyalex
Copy link
Collaborator

@pyalex pyalex commented Oct 29, 2020

Signed-off-by: Oleksii Moskalenko [email protected]

What this PR does / why we need it:

This PR prevents load of full (batch source) table from BQ to join it with entities in Spark.
Instead entities are being uploaded to BQ and original BQ sources are being replaced by views (with join) before passing batch source to historical retrieval job.

This optimization is only applied when user pass entities to method get_historical_features as pandas dataframe or as BigQuerySource.

Which issue(s) this PR fixes:

Fixes #

Does this PR introduce a user-facing change?:


@pyalex pyalex force-pushed the optimize-bq-historical-retrieval branch 2 times, most recently from 70d5a6a to a60a955 Compare October 29, 2020 09:50
@pyalex pyalex changed the title [WIP] In Historical Retrieval from BQ join with entity is performed inside BQ In Historical Retrieval from BQ join between source & entities is performed inside BQ Oct 29, 2020
pyalex added 14 commits October 29, 2020 18:28
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
@pyalex pyalex force-pushed the optimize-bq-historical-retrieval branch from a2fbb19 to 0ccdf8f Compare October 29, 2020 10:43
Signed-off-by: Oleksii Moskalenko <[email protected]>
@pyalex
Copy link
Collaborator Author

pyalex commented Oct 29, 2020

/test test-end-to-end-gcp

1 similar comment
@pyalex
Copy link
Collaborator Author

pyalex commented Oct 29, 2020

/test test-end-to-end-gcp

@pyalex
Copy link
Collaborator Author

pyalex commented Oct 29, 2020

/test test-end-to-end

Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
@pyalex
Copy link
Collaborator Author

pyalex commented Oct 30, 2020

/test test-end-to-end-gcp
/test test-end-to-end

Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
@khorshuheng khorshuheng self-requested a review November 2, 2020 04:09
@feast-ci-bot
Copy link
Collaborator

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: khorshuheng, pyalex

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@khorshuheng
Copy link
Collaborator

/lgtm

@feast-ci-bot feast-ci-bot merged commit 4c1cc09 into feast-dev:master Nov 2, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants