Get sparse feature sizes #8750

jupyterjazz · 2021-05-26T14:04:00Z

In order to dynamically change sparse layers, we should be able to split a combined feature input into features that come from separate featurizers. The reason behind this is that new features will be appended to the corresponding splits and not to the combined representation. That's why we need to know the size of each sparse feature.

feature_sizes collection will have a following structure: Dict[Text, Dict[Text, List[int]]]
It will be a dictionary of attributes that have sparse features(e.g. text, response… we are excluding label at this point) to the dictionary of different kinds of features(e.g. sequence/sentence) to the list of feature sizes of different featurizers.

The collection should be created in the DIETClassifier class during the _create_model_data call and it should be assigned to the DIETClassifier class as an attribute.

feature_sizes should be persisted after training and loaded during fine-tuning. We will need to compare persisted feature_sizes to the current feature_sizes to find out if there are any DenseForSparse layers to be updated.

Things to be done:

get feature_sizes of current data
persist feature_sizes
load persisted feature_sizes

The text was updated successfully, but these errors were encountered:

tttthomasssss · 2021-06-08T08:28:52Z

@samsucik assigned as reviewer.

jupyterjazz self-assigned this May 26, 2021

This was referenced May 26, 2021

Adjusting DenseForSparse layer size #8751

Closed

Calculate/Persist/Load sparse feature sizes #8795

Merged

tttthomasssss assigned samsucik Jun 8, 2021

jupyterjazz closed this as completed Jul 1, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Get sparse feature sizes #8750

Get sparse feature sizes #8750

jupyterjazz commented May 26, 2021 •

edited

Loading

tttthomasssss commented Jun 8, 2021

Get sparse feature sizes #8750

Get sparse feature sizes #8750

Comments

jupyterjazz commented May 26, 2021 • edited Loading

tttthomasssss commented Jun 8, 2021

jupyterjazz commented May 26, 2021 •

edited

Loading