Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

newspaper.nlp | ignored stopword: using #503

Closed
AndyTheFactory opened this issue Oct 24, 2023 · 2 comments
Closed

newspaper.nlp | ignored stopword: using #503

AndyTheFactory opened this issue Oct 24, 2023 · 2 comments
Labels
bug Something isn't working
Milestone

Comments

@AndyTheFactory
Copy link
Owner

Issue by ilkut
Sun Jan 17 13:26:39 2021
Originally opened as codelucas/newspaper#870


I noticed that the nlp loading the stopwords and as well as the stopwords-en.txt including 'using'.

however i still see n3k returning 'using' as a keyword for the below pages.

@AndyTheFactory
Copy link
Owner Author

Comment by johnbumgarner
Mon Feb 1 17:36:34 2021


Where in the code base does it load that second list of English stopwords?

@AndyTheFactory AndyTheFactory added the bug Something isn't working label Oct 30, 2023
@AndyTheFactory AndyTheFactory added this to the Release 0.9.1 milestone Oct 30, 2023
@AndyTheFactory
Copy link
Owner Author

module nlp.load_stopwords
weirdly, there were two stopwords lists for english. not sure why. The one that contained "using" was not loaded

fixed in 0.9.1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant