PINYON

PINYON implements a community detection algorithm that can consider the context and meaning of the entities in a post. PINYON accurately identifies semantically related posts in various contexts.

Using PINYON

Pre-processing

In order to use PINYON first, we need to pre-process the corpus of social media posts (tweets). The TweetsCOV19(may2020) can be downloaded using this link

After downloading the tweets dataset, we need to execute the three scripts in the tweets_process directory.

Once all the scripts finish executing, we need to obtain the tweets' original text. This can be done using Hydrator

Embeddings download

The embedding for each corresponding KG needs to be downloaded and placed in the embedding/data/ directory

DBpedia

Wikidata

UMLS

The PINYON SCD Approach

Now that we have all the necessary data, we can run the PINYON approach against the three KGs (UMLS, Wikidata, and DBpedia). For example, to run the approach against UMLS, please use the following:

python3 run_umls.py

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
embedding		embedding
tweets_process		tweets_process
.gitignore		.gitignore
LICENSE		LICENSE
PINYON_CACD.py		PINYON_CACD.py
README.md		README.md
graph.py		graph.py
requirements.txt		requirements.txt
run_db.py		run_db.py
run_umls.py		run_umls.py
run_wiki.py		run_wiki.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PINYON

Using PINYON

Pre-processing

Embeddings download

The PINYON SCD Approach

About

Releases

Packages

Languages

License

SDM-TIB/PINYON

Folders and files

Latest commit

History

Repository files navigation

PINYON

Using PINYON

Pre-processing

Embeddings download

The PINYON SCD Approach

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages