This application identifies topics in Bicol-related news articles through the use of generated clustered models.
- Beautiful Soup 4
- Scikit-learn
- matplotlib
- pandas
- NumPy
- requests
This algorithm is used to process textual data to numerical data as the Scikit-Learn library only accepts numerical data as a model.
This algorithm is responsible for clustering the data generated by the TFIDF algorithm. The main reason on why we used clustering instead of a classifier is that we do not know what exactly is in the data.
(To be added later)
- Tkinter graphical user interface
- Collect news data from Rappler and Inquirer
- Edit Cluster Labels
- View Cluster Contents
- Import Cluster Model
- View Graph (Scatterplot and Line Graph)
(More details to be added soon)