Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Entity value_type inference for Feature Repo registration #1538

Merged
merged 9 commits into from
Jun 2, 2021

Conversation

mavysavydav
Copy link
Collaborator

@mavysavydav mavysavydav commented May 6, 2021

Signed-off-by: David Y Liu [email protected]

What this PR does / why we need it:
This change enables users to register Entities without needing to specify the value_type because we'll be able to infer it based on finding the entity column in the table based on the Entity name and using that type. Milestone 2 of this RFC.

Which issue(s) this PR fixes:

Fixes #1500

Does this PR introduce a user-facing change?:

This change enables users to register Entities without needing to specify the value_type because we'll be able to infer it based on finding the entity column in the table based on the Entity name and using that type.  More details can be found in this doc as milestone 2 -- https://docs.google.com/document/d/1MkWvexE4e5nYWcQLELFnJ5o9OlJDKC2rn_USHMDT9dg/edit#heading=h.pqhio4s5uw2s. 

@feast-ci-bot
Copy link
Collaborator

Hi @mavysavydav. Thanks for your PR.

I'm waiting for a feast-dev member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@mavysavydav mavysavydav force-pushed the entityValueTypeInference branch from 5681099 to 1c52ba2 Compare May 6, 2021 17:19
@jklegar jklegar self-assigned this May 6, 2021
@mavysavydav mavysavydav force-pushed the entityValueTypeInference branch from f604d9d to 1af0ae8 Compare May 7, 2021 07:14
@jklegar
Copy link
Collaborator

jklegar commented May 10, 2021

/ok-to-test

@jklegar
Copy link
Collaborator

jklegar commented May 10, 2021

/kind feature

Copy link
Collaborator

@jklegar jklegar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @mavysavydav this looks great! The one thing I see is that there's also a way to apply a specific set of entities/feature views on the FeatureStore class here:


Note it's an "incremental apply" rather than a "total apply" so only the new/to-be-updated entities/feature views are passed in.
Would you be able to add entity inference in there as well?

sdk/python/feast/repo_operations.py Outdated Show resolved Hide resolved
@mavysavydav
Copy link
Collaborator Author

Sounds good, will take a look

@codecov-commenter
Copy link

codecov-commenter commented May 25, 2021

Codecov Report

Merging #1538 (6dc6104) into master (dfb029d) will increase coverage by 0.06%.
The diff coverage is 95.83%.

Impacted file tree graph

@@            Coverage Diff             @@
##           master    #1538      +/-   ##
==========================================
+ Coverage   83.59%   83.65%   +0.06%     
==========================================
  Files          65       67       +2     
  Lines        5753     5812      +59     
==========================================
+ Hits         4809     4862      +53     
- Misses        944      950       +6     
Flag Coverage Δ
integrationtests 83.56% <95.83%> (+0.06%) ⬆️
unittests 77.85% <87.50%> (+0.49%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files Coverage Δ
sdk/python/feast/entity.py 96.36% <ø> (ø)
sdk/python/tests/utils/data_source_utils.py 100.00% <ø> (ø)
sdk/python/feast/repo_operations.py 32.17% <50.00%> (+0.17%) ⬆️
sdk/python/feast/feature_store.py 91.85% <88.88%> (-0.04%) ⬇️
sdk/python/feast/inference.py 95.45% <95.45%> (ø)
sdk/python/tests/conftest.py 100.00% <100.00%> (ø)
sdk/python/tests/test_feature_store.py 100.00% <100.00%> (ø)
sdk/python/tests/test_inference.py 100.00% <100.00%> (ø)
sdk/python/feast/telemetry.py 61.53% <0.00%> (-1.09%) ⬇️
sdk/python/feast/data_source.py 78.77% <0.00%> (-0.07%) ⬇️
... and 13 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update dfb029d...6dc6104. Read the comment docs.

@mavysavydav mavysavydav force-pushed the entityValueTypeInference branch from d25e11c to 3bb9524 Compare May 25, 2021 21:48
@mavysavydav
Copy link
Collaborator Author

Ready for another round of review. Sorry for the mess in this PR. Accidentally messed up my branch when syncing w master

@woop
Copy link
Member

woop commented May 25, 2021

Ready for another round of review. Sorry for the mess in this PR. Accidentally messed up my branch when syncing w master

Thanks @mavysavydav

@mavysavydav
Copy link
Collaborator Author

btw I made the assumption that for featurestore.apply, we'd only want to infer the entities based on the feature_views passed in as part of that apply call. The reason I thought this was reasonable is that the alternative suggests a feature_view has been applied with an entity that hasn't been defined yet. Though in the case of redefinitions of entities, inferring from previously applied FeatureViews seems like ok behavior but that won't be done with the current logic. Happy to reconsider if ppl disagree.

@mavysavydav mavysavydav force-pushed the entityValueTypeInference branch from fc7fd72 to 9e99e90 Compare June 2, 2021 06:16
@woop
Copy link
Member

woop commented Jun 2, 2021

/lgtm

@feast-ci-bot
Copy link
Collaborator

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: mavysavydav, woop

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@feast-ci-bot feast-ci-bot merged commit 09fe2a6 into feast-dev:master Jun 2, 2021
woop pushed a commit that referenced this pull request Jun 7, 2021
* Added inferencing of Entity values types & corresponding test

Signed-off-by: David Y Liu <[email protected]>

* Updated example_feature_repo_with_inference.py

Signed-off-by: David Y Liu <[email protected]>

* Fixed lint issue introduced by previous commit

Signed-off-by: David Y Liu <[email protected]>

* Optimized infer_entity_value_type_from_feature_views

Signed-off-by: David Y Liu <[email protected]>

* Added entity data type inferencing to feature_store.py and organized code

Signed-off-by: David Y Liu <[email protected]>

* Made error messages more descriptive and improved its structuring

Signed-off-by: David Y Liu <[email protected]>

* Removed s comments by using conftest file

Signed-off-by: David Y Liu <[email protected]>

* Renamed variables for clarity and added clarifying comments

Signed-off-by: David Y Liu <[email protected]>

* changed file name from inference_helpers.py to inference.py

Signed-off-by: David Y Liu <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add schema inferencing to feature views
5 participants