Skip to content

omidb/nlputils

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

nlputils

simple tools for NLP

Anything could change in next releases

In this project there will be simple tools for everyday NLP tasks

Using:

 resolvers += Resolver.sonatypeRepo("snapshots")

 libraryDependencies += "com.github.omidb" %%% "nlputils" % "0.1.0-SNAPSHOT"

It contains a reader for OntoNotes.

 val file = scala.io.Source.fromFile("E:/data/ontonotes-release-5.0/data/files/data/english/annotations/mz/sinorama/10/ectb_1001.onf").getLines().toIndexedSeq
 
 val ontdoc = OntoNotesReader.read(file)

It contains coreference chains, name entities, predicate arguments, parse trees and word senses.

You can also do any S-Expr parse:

val tree = "(TOP (S (VP (V kill)) (NP him)))"

val dg:DGraph[IndexedSeq[String],String] = SExpression.parse(tree)

You can look at DGraph lib and its useful property here: https://github.com/omidb/dgraph

About

simple tools for NLP

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages