-
Notifications
You must be signed in to change notification settings - Fork 9
HTML tag stopwords #78
Comments
better approach: attempt to recognize html/xml and load with something like beautifulsoup to get text-only content
|
Ah ha! Python libraries that do what needs to be done // still learning |
I'm attaching the screenshot of what happened when I initially gave the --a. On Thu, Oct 17, 2013 at 8:47 PM, Mia [email protected] wrote:
|
@amrys I'm not seeing a screenshot. You might have to use the GitHub web interface (not sure if you can add attachments via email). |
Roger that. I will try to remember my GitHub login after I take care of a. On Wed, Oct 23, 2013 at 5:11 PM, Rebecca Sutton Koeser <
|
Site accepts html but only searches for tags. They need to be in the stopwords file or someway of recognizing these tags and ignoring them
The text was updated successfully, but these errors were encountered: