Skip to content

SaumyaPaigwar/K-Means-Clustering-on-Documents

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

K-Means-Clustering-on-Documents

Document clustering algorithm based on TF-IDF

Data set used is : Pick up the Reuters R52 dataset from https://www.cs.umb.edu/~smimarog/textmining/datasets/.

First run createDocument_in_folders.java to creats documents in AllDocumnets folder.

Then run kMeans.java to perform clustering on AllDocumnets.

About

Document clustering algorithm based on TF-IDF

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages