Name		Name	Last commit message	Last commit date
parent directory ..
formulas		formulas
README.md		README.md
compute_centroids.m		compute_centroids.m
demo.m		demo.m
find_closest_centroids.m		find_closest_centroids.m
init_centroids.m		init_centroids.m
k_means_train.m		k_means_train.m
set1.mat		set1.mat
set2.mat		set2.mat

README.md

K-Means Algorithm

K-means clustering aims to partition n observations into K clusters in which each observation belongs to the cluster with the nearest mean, serving as a prototype of the cluster.

The result of a cluster analysis shown below as the coloring of the squares into three clusters.

Description

Given a training set of observations:

Where each observation is a d-dimensional real vector, k-means clustering aims to partition the m observations into K (≤ m) clusters:

... so as to minimize the within-cluster sum of squares (i.e. variance).

Below you may find an example of 4 random cluster centroids initialization and further clusters convergence:

Picture Source

Another illustration of k-means convergence:

Cost Function (Distortion)

- index of cluster (1, 2, ..., K) to which example x⁽ⁱ⁾ is currently assigned.

- cluster centroid k () and .

- cluster centroid of a cluster to which the example x⁽ⁱ⁾ has been assigned.

For example:

In this case optimization objective will look like the following:

The Algorithm

Randomly initialize K cluster centroids (randomly pick K training examples and set K cluster centroids to that examples).

Files

demo.m - main demo file that you should run from Octave console.
set1.mat - training data set #1.
set2.mat - training data set #2.
compute_centroids.m - compute the next mean centroid for each cluster.
find_closest_centroids.m - split training examples into cluster based on the distance to centroids.
init_centroids.m - randomly init centroids by taking random training examples.
k_means_train.m - function that runs K-Means algorithm.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

k-means

k-means

README.md

K-Means Algorithm

Description

Cost Function (Distortion)

The Algorithm

Files

Demo visualizations

References

Files

k-means

Directory actions

More options

Directory actions

More options

Latest commit

History

k-means

Folders and files

parent directory

README.md

K-Means Algorithm

Description

Cost Function (Distortion)

The Algorithm

Files

Demo visualizations

References