Bayesian models

LDA using collapsed gibbs sampling is implemented.

To use the LDA as a module, please use the TopicModels julia package.

Steps to use TopicModels package:

git clone #clone the repo
cd handson_julia  #cd to parent directory of TopicModels folder
Pkg.activate("TopicModels") #activate the package in julia REPL
corpus = TopicModels.documentset_readData("news.txt") #import the document collection
wordPrior = TopicModels.Dirichlet(corpus.vocab_count, 0.01) #dirichlet word prior
M = 3                             #number of topics
alpha = [0.01 for i in 1:M]      
topicPrior = TopicModels.Dirichlet(alpha); #dirichlet topic prior
lda = TopicModels.LDA(topicPrior, wordPrior) #build LDA struct using word prior and topic prior
samples = TopicModels.lda_sample(corpus.documents, lda) #run LDA with collapsed gibbs sampling
words, proportions = TopicModels.lda_topicN(1, 10, corpus, lda) #top 10 words and topic proportions of Topic 1

LDA on dummy news dataset as well as on NIPS paper dataset is applied. Implementation and results can be viewed in LDA_with_package.ipynb notebook.

To view the complete implementation in jupyter notebook, please have a look at try_LDA.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
20news		20news
TopicModels		TopicModels
dummy		dummy
nips		nips
.gitignore		.gitignore
20news_HLTM.ipynb		20news_HLTM.ipynb
README.md		README.md
Stable_marriage_problem.ipynb		Stable_marriage_problem.ipynb
keyphrase_and_bert.ipynb		keyphrase_and_bert.ipynb
nips_HLTM.ipynb		nips_HLTM.ipynb
stopwords.txt		stopwords.txt
try_LDA.ipynb		try_LDA.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bayesian models

About

Releases

Packages

Languages

haseeb33/julia_bayes

Folders and files

Latest commit

History

Repository files navigation

Bayesian models

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages