stl10-pca-gmm-experiment

STL10 dimensionlity reduction using PCA and labeling using GMM

PCA and GMM from scratch

PCA and GMM were implemented from scratch by only using python and numpy. STL10 dataset was used for the whole experiment. A subset of the STL10 unlabeled split were used to find the projection matrix fitted for the dataset. Various dims were use to test how well the PCA will perform for the unseen STL10 test split.
After analyzing the effectiveness of various dims, an optimal dim was determined. The STL10 unlabeled dataset was encoded using the optimal dim. These codes were then used in the GMM. The GMM was used to try to cluster the codes. The clusters where then labeled using the labeled STL10 test split. The performance of the GMM for clustering was then evaluated. The notebook can be accessed here.

Sklearn's PCA and GMM Implementation

The sanity of the experiment and the algorithm's implementation from scratch above was verified using the well tested PCA and GMM modules of sklearn. Overall, both the implementation from scratch and sklearn's modules yield the same trend in the results. The notebook can be accessed here.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.gitignore		.gitignore
README.md		README.md
gmm_clustering.png		gmm_clustering.png
pca_gmm_modules.ipynb		pca_gmm_modules.ipynb
pca_illustration.png		pca_illustration.png
requirements.txt		requirements.txt
stl10_pca_gmm_experiment.ipynb		stl10_pca_gmm_experiment.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

stl10-pca-gmm-experiment

PCA and GMM from scratch

Sklearn's PCA and GMM Implementation

About

Releases

Packages

Languages

peeeyow/stl10-pca-gmm-experiment

Folders and files

Latest commit

History

Repository files navigation

stl10-pca-gmm-experiment

PCA and GMM from scratch

Sklearn's PCA and GMM Implementation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages