fixes error of get_feature_names removal #235

David-Moody · 2022-12-15T20:52:37Z

Error when using scikit-learn >= 1.2.0

pyLDAvis.sklearn.prepare raises an error due to a missing method get_feature_names() for the vectorizer argument.

AttributeError: 'CountVectorizer' object has no attribute 'get_feature_names'

Using the documentation of sklearn.feature_extraction.text.CountVectorizer as an example. It can be seen this function was deprecated in 1.0 docs, and removed in 1.2 docs. The same is true for the other vectorizer that can be used TfidfVectorizer.

The recommendation in those docs is to use get_feature_names_out() as a replacement.

Instead of returning a list of feature names, this now returns an ndarray of them. Though both being iterable types it makes no difference for the use case, where reference is only required to array-like.

This fix would also be backwards compatible to at least scikit-learn 1.0.

Tested on a fresh conda environment with Python==3.10.8, and gives expected behaviour.

zhfuch · 2023-01-09T03:23:49Z

Error when using scikit-learn >= 1.2.0

pyLDAvis.sklearn.prepare raises an error due to a missing method get_feature_names() for the vectorizer argument.

AttributeError: 'CountVectorizer' object has no attribute 'get_feature_names'

Using the documentation of sklearn.feature_extraction.text.CountVectorizer as an example. It can be seen this function was deprecated in 1.0 docs, and removed in 1.2 docs. The same is true for the other vectorizer that can be used TfidfVectorizer.

The recommendation in those docs is to use get_feature_names_out() as a replacement.

Instead of returning a list of feature names, this now returns an ndarray of them. Though both being iterable types it makes no difference for the use case, where reference is only required to array-like.

This fix would also be backwards compatible to at least scikit-learn 1.0.

Tested on a fresh conda environment with Python==3.10.8, and gives expected behaviour.

Thank you very much!

fixes error of get_feature_names removal

b2df4f0

msusol approved these changes Feb 11, 2023

View reviewed changes

msusol merged commit 6d24827 into bmabey:master Feb 11, 2023

This was referenced Feb 11, 2023

tsne won't work with sklearn.prepare #233

Closed

'TfidfVectorizer' object has no attribute 'get_feature_names' #238

Closed

Sklearn Count Vector #228

Closed

msusol mentioned this pull request Feb 27, 2023

AttributeError: 'CountVectorizer' object has no attribute 'get_feature_names' #243

Closed

msusol mentioned this pull request Mar 9, 2023

AttributeError: ... object has no attribute 'get_feature_names' #245

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fixes error of get_feature_names removal #235

fixes error of get_feature_names removal #235

David-Moody commented Dec 15, 2022

zhfuch commented Jan 9, 2023

fixes error of get_feature_names removal #235

fixes error of get_feature_names removal #235

Conversation

David-Moody commented Dec 15, 2022

zhfuch commented Jan 9, 2023