Add Feature Validation to Feast #172

kingkai620 · 2019-04-02T06:16:30Z

Having features been ingested by feast, i'm always need to check the missing value, generate the distribution and coverage statistics of the features. And then, it's necessary to detect features drift by looking at a series of feature, and to detect something like training-serving skew. I think the features can only be continued to ingested by training and serving if all these checks pass.

So，does feast have the plan to support these features? if not , what would be the standard approach to make these validation using feast?

tims · 2019-04-03T05:21:14Z

Hi,

Thanks for bringing this up. We definitely want to add more useful validation of features. Currently all we do is check the type matches. We'd like to be able to check they are within ranges, or regex's for strings etc. If you could elaborate on the type of validation you would require that would help us get started.

A note, if a feature does not pass validation, it gets thrown into an errors pile, so there might be some use cases, where you only want to be warned about it and still have the data accepted? I think in some use cases for example, people might want to keep accepting inputs if training serving skew occurs, but they would like to be able to monitor it or be notified.

Generating statistics about features is another thing we'd like to add that I think can be treated as a separate issue. If you could help us build a list of the sorts of statistics you'd like about a feature and how you would define them that would also help us get started. And which of these statistics would be needed for the validation you'd like to do.

We can use this issue as a discussion place and spin off other issues later.

woop · 2019-09-01T07:17:14Z

This is a hotly requested feature for Feast. We are planning to pick this up in 0.4.0.

woop · 2020-01-25T07:01:02Z

An update on this issue. We have drafted the first RFC for feature statistics and validation. Please have a look and comment if you can!

woop · 2020-06-21T01:56:10Z

This is addressed in Feast 0.6 (code on master). It's now possible to produce batch statistics for validation. #612

woop changed the title ~~How does feast handle feature validation?~~ Add feature validation to Feast Sep 1, 2019

woop added the kind/feature New feature or request label Sep 1, 2019

davidheryanto mentioned this issue Jan 20, 2020

Update protos with Tensorflow data validation schema #438

Merged

woop added area/job-management priority/p0 Highest priority labels Jan 26, 2020

woop assigned davidheryanto and zhilingc Jan 26, 2020

davidheryanto mentioned this issue Jan 29, 2020

Update ingestion to write feature metrics for validation #449

Closed

woop changed the title ~~Add feature validation to Feast~~ Add Feature Validation to Feast Jan 29, 2020

davidheryanto mentioned this issue Jan 30, 2020

Update Python SDK so FeatureSet can import Schema from Tensorflow metadata #450

Merged

davidheryanto mentioned this issue Feb 23, 2020

Extend WriteMetricsTransform in Ingestion to write feature value stats to StatsD #486

Merged

woop mentioned this issue Mar 9, 2020

Feast 0.5 release #527

Closed

woop added this to the v0.5.0 milestone Mar 10, 2020

woop modified the milestones: v0.5.0, v0.6.0 Apr 29, 2020

woop closed this as completed Jun 21, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Feature Validation to Feast #172

Add Feature Validation to Feast #172

kingkai620 commented Apr 2, 2019

tims commented Apr 3, 2019 •

edited

Loading

woop commented Sep 1, 2019

woop commented Jan 25, 2020

woop commented Jun 21, 2020

Add Feature Validation to Feast #172

Add Feature Validation to Feast #172

Comments

kingkai620 commented Apr 2, 2019

tims commented Apr 3, 2019 • edited Loading

woop commented Sep 1, 2019

woop commented Jan 25, 2020

woop commented Jun 21, 2020

tims commented Apr 3, 2019 •

edited

Loading