K-means++ in Pandas

An implementation of the k-means++ clustering algorithm using Pandas.

IMPORTANT NOTE

This package should not be used in production. The implementation of k-means++ contained therein is much slower than that of scikit-learn. Use that instead.

The only reason why I wrote any of this is to teach myself Pandas.

Prerequisites

Installation

If you have pip, then just do

pip install k-means-plus-plus

Otherwise,

Usage

Here are the constructor arguments:

Once you've constructed a KMeansPlusPlus object, then just call the cluster method, and everything else should happen automagically. Take a look at the examples folder.

TODO: