Text Classification using Reuters Data
-
reutersdata-scratch-classicML.ipynb : simple data processing from raw source and classical machine learning (SVM, Logistic Regression)
-
reutersdata_keras_processing.ipynb : keras way of processing reuters data (data util functions are integrated within keras)
-
reutersdata-keras-modeling_final.ipynb : data loaded from keras and trained with DL models (MLP, transformer)
Acknowledgement and References:
Adapted from various resources, tools, and descriptions as follows -
data: https://archive.ics.uci.edu/ml/machine-learning-databases/reuters21578-mld/ or http://kdd.ics.uci.edu/databases/reuters21578/reuters21578.html
https://martin-thoma.com/nlp-reuters/
https://github.com/MartinThoma/algorithms/tree/master/ML/nlp
https://github.com/SteffenBauer/KerasTools/tree/master/Reuters_Analysis
https://www.kaggle.com/drscarlat/reuters-document-classification-with-keras-tf