Skip to content

kordjamshidi/illinois-cogcomp-nlp

 
 

Repository files navigation

illinois-cogcomp-nlp

Build Status

This project collects a number of core libraries for Natural Language Processing (NLP) developed by the University of Illinois' Cognitive Computation Group.

CogComp's main NLP libraries

  • illinois-core-utilities Provides a set of NLP-friendly data structures and a number of NLP-related utilities that support writing NLP applications, running experiments, etc.
  • illinois-corpusreaders Provides classes to read documents from corpora into illinois-core-utilities data structures.
  • illinois-curator Supports use of Illinois' NLP Curator, a tool to run NLP applications as services.
  • illinois-edison A library for feature extraction from illinois-core-utilities data structures.
  • illinois-lemmatizer An application that uses WordNet and simple rules to find the root forms of words in plain text.
  • illinois-tokenizer An application that identifies sentence and word boundaries in plain text.
  • illinois-pos An application that identifies the part of speech (e.g. verb + tense, noun + number) of each word in plain text.
  • illinois-ner An application that identifies named entities in plain text according to two different sets of categories.

Using each library programmatically

To include one of the modules in your Maven project, add the following snippet with the #modulename# and #version entries replaced with the relevant module name and the version listed in this project's pom.xml file. Note that you also add to need the <repository element for the CogComp maven repository in the <repositories> element.

    <dependencies>
         ...
        <dependency>
            <groupId>edu.illinois.cs.cogcomp</groupId>
            <artifactId>#modulename#</artifactId>
            <version>#version#</version>
        </dependency>
        ...
    </dependencies>
    ...
    <repositories>
        <repository>
            <id>CogcompSoftware</id>
            <name>CogcompSoftware</name>
            <url>http://cogcomp.cs.illinois.edu/m2repo/</url>
        </repository>
    </repositories>

About

CogComp's main NLP libraries

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Java 99.6%
  • Other 0.4%