Skip to content

Experimenting with Natural Language Processing

Notifications You must be signed in to change notification settings

murphyb2/nlp-demo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SpaCy NLP Demo

This application runs fully in the terminal and will analyze all text (.txt) files located in the ./docs folder and output the 25 most frequent words in ALL of the documents read.

This project is build using SpaCy.

Setup

Initialize Python Environment

Install the python packages as required in the Pipfile

pipenv shell
pipenv install

Install SpaCy PreTrained Model

python -m spacy download en_core_web_sm

Run the Script

python spNLP.py

Analysis Details

After processing all the text through the SpaCy NLP pipeline, the program uses SpaCy's PhraseMatcher to find all the sentences containing the most frequent words. It matches on the word's lemma so it will find matches country -> countries.

About

Experimenting with Natural Language Processing

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages