Skip to content

Concept Network Construction by Integrating Knowledge Base with Scientific and Technical Corpora

Notifications You must be signed in to change notification settings

remenberl/concept_newtork

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

concept_newtork

Paper: Concept Network Construction by Integrating Knowledge Base with Scientific and Technical Corpora

This project depends on several existing tools including Tagme and Jate. One needs to first apply for the Tagme API and compile Jate before running our code. Some necessary Python packages are also required, e.g., mwparserfromhell, networkx and MySQLdb.

For the datasets, please dowanload them from http://arnetminer.org/citation and http://dumps.wikimedia.org/enwiki/

Additionally, some MySQL tables for keeping Wiki surface forms and Wikification results are needed. Interesting readers can refer to their formats in folder "mysql_tables".

Given these preparation done, to construct the network, sequentially run the python code from step0 to step9 (step5 was written in C++ to ensure efficiency).

PS: please run Jate immediately after finish step2 in order to generate term extraction results from CValue.

About

Concept Network Construction by Integrating Knowledge Base with Scientific and Technical Corpora

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published