Skip to content

Latest commit

 

History

History
35 lines (21 loc) · 900 Bytes

README.md

File metadata and controls

35 lines (21 loc) · 900 Bytes

nlputils

simple tools for NLP

Anything could change in next releases

In this project there will be simple tools for everyday NLP tasks

Using:

 resolvers += Resolver.sonatypeRepo("snapshots")

 libraryDependencies += "com.github.omidb" %%% "nlputils" % "0.1.0-SNAPSHOT"

It contains a reader for OntoNotes.

 val file = scala.io.Source.fromFile("E:/data/ontonotes-release-5.0/data/files/data/english/annotations/mz/sinorama/10/ectb_1001.onf").getLines().toIndexedSeq
 
 val ontdoc = OntoNotesReader.read(file)

It contains coreference chains, name entities, predicate arguments, parse trees and word senses.

You can also do any S-Expr parse:

val tree = "(TOP (S (VP (V kill)) (NP him)))"

val dg:DGraph[IndexedSeq[String],String] = SExpression.parse(tree)

You can look at DGraph lib and its useful property here: https://github.com/omidb/dgraph