PyDataBlog
diff --git a/‎.github/workflows/CompatHelper.yml
+1-1 b/‎.github/workflows/CompatHelper.yml
+1-1
diff --git a/‎.github/workflows/benchmarks.yml
+1-1 b/‎.github/workflows/benchmarks.yml
+1-1
diff --git a/‎.travis.yml
+2-1 b/‎.travis.yml
+2-1
diff --git a/‎Project.toml
+2-1 b/‎Project.toml
+2-1
diff --git a/‎README.md
+10-54 b/‎README.md
+10-54
diff --git a/‎benchmark/bench01_distance.jl
-6 b/‎benchmark/bench01_distance.jl
-6
diff --git a/‎benchmark/bench02_kmeans.jl
+28 b/‎benchmark/bench02_kmeans.jl
+28
diff --git a/‎docs/src/benchmark_image.png
532 KB b/‎docs/src/benchmark_image.png
532 KB
diff --git a/‎docs/src/index.md
+168-1 b/‎docs/src/index.md
+168-1
diff --git a/‎docs/src/iris_example.jpg
165 KB b/‎docs/src/iris_example.jpg
165 KB
@@ -13,7 +13,7 @@ jobs:
     steps:
       - uses: julia-actions/setup-julia@latest
         with:
-          version: 1.3
+          version: 1.4
       - name: Pkg.add("CompatHelper")
         run: julia -e 'using Pkg; Pkg.add("CompatHelper")'
       - name: CompatHelper.main()
 
@@ -10,7 +10,7 @@ jobs:
       - uses: actions/checkout@v2
       - uses: julia-actions/setup-julia@latest
         with:
-          version: 1.3
+          version: 1.4
       - name: Install dependencies
         run: julia -e 'using Pkg; pkg"add PkgBenchmark Distances StatsBase BenchmarkTools [email protected]"'
       - name: Run benchmarks
 
@@ -5,6 +5,7 @@ os:
   - osx
 julia:
   - 1.3
+  - 1.4
   - nightly
 after_success:
   - julia -e 'using Pkg; Pkg.add("Coverage"); using Coverage; Coveralls.submit(process_folder())'
@@ -14,7 +15,7 @@ jobs:
   fast_finish: true
   include:
     - stage: Documentation
-      julia: 1.3
+      julia: 1.4
       script: julia --project=docs -e '
           using Pkg;
           Pkg.develop(PackageSpec(path=pwd()));
 
@@ -13,6 +13,7 @@ julia = "1.3"
 [extras]
 Random = "9a3f8284-a2c9-5f02-9a11-845980a1fd5c"
 Test = "8dfed614-e22c-5e08-85e1-65c5234f0b40"
+Suppressor = "fd094767-a336-5f1f-9728-57cf17d0bbfb"
 
 [targets]
-test = ["Test", "Random"]
+test = ["Test", "Random", "Suppressor"]
@@ -10,39 +10,32 @@ ________________________________________________________________________________
 _________________________________________________________________________________________________________
 
 ## Table Of Content
-
-1. [Motivation](#Motivatiion)
+1. [Documentation](#Documentation)
 2. [Installation](#Installation)
 3. [Features](#Features)
-4. [Benchmarks](#Benchmarks)
-5. [Pending Features](#Pending-Features)
-6. [How To Use](#How-To-Use)
-7. [Release History](#Release-History)
-8. [How To Contribute](#How-To-Contribute)
-9. [Credits](#Credits)
-10. [License](#License)
+4. [License](#License)
 
 _________________________________________________________________________________________________________
 
-### Motivation
-It's a funny story actually led to the development of this package.
-What started off as a personal toy project trying to re-construct the K-Means algorithm in  native Julia blew up after into a heated discussion on the Julia Discourse forums after I asked for Julia optimizaition tips. Long story short, Julia community is an amazing one! Andrey Oskin offered his help and together, we decided to push the speed limits of Julia with a parallel implementation of the most famous clustering algorithm. The initial results were mind blowing so we have decided to tidy up the implementation and share with the world. 
+### Documentation
+- Stable Documentation: [![Stable](https://img.shields.io/badge/docs-stable-blue.svg)](https://PyDataBlog.github.io/ParallelKMeans.jl/stable)
+
+- Experimental Documentation: [![Dev](https://img.shields.io/badge/docs-dev-blue.svg)](https://PyDataBlog.github.io/ParallelKMeans.jl/dev)
 
-Say hello to our baby, `ParallelKMeans`!
 _________________________________________________________________________________________________________
 
 ### Installation
 You can grab the latest stable version of this package by simply running in Julia.
 Don't forget to Julia's package manager with `]`
 
 ```julia
-pkg> add TextAnalysis
+pkg> add ParallelKMeans
 ```
 
 For the few (and selected) brave ones, one can simply grab the current experimental features by simply adding the experimental branch to your development environment after invoking the package manager with `]`:
 
 ```julia
-dev git@github.com:PyDataBlog/ParallelKMeans.jl.git
+pkg> dev git@github.com:PyDataBlog/ParallelKMeans.jl.git
 ```
 
 Don't forget to checkout the experimental branch and you are good to go with bleeding edge features and breaks!
@@ -54,46 +47,9 @@ ________________________________________________________________________________
 ### Features
 
 - Lightening fast implementation of Kmeans clustering algorithm even on a single thread in native Julia.
-- Support for multi-theading implementation of Kmeans clustering algorithm.
+- Support for multi-theading implementation of K-Means clustering algorithm.
 - Kmeans++ initialization for faster and better convergence.
-- Modified version of Elkan's Triangle inequality to speed up K-Means algorithm.
-
-_________________________________________________________________________________________________________
-
-### Benchmarks
-
-_________________________________________________________________________________________________________
-
-### Pending Features
-- [X] Implementation of Triangle inequality based on [Elkan C. (2003) "Using the Triangle Inequality to Accelerate
-K-Means"](https://www.aaai.org/Papers/ICML/2003/ICML03-022.pdf)
-- [ ] Support for DataFrame inputs.
-- [ ] Refactoring and finalizaiton of API desgin.
-- [ ] GPU support.
-- [ ] Even faster Kmeans implementation based on current literature.
-- [ ] Optimization of code base.
-
-_________________________________________________________________________________________________________
-
-### How To Use
-
-```Julia
-
-```
-
-_________________________________________________________________________________________________________
-
-### Release History
-
-- 0.1.0 Initial release
-
-_________________________________________________________________________________________________________
-
-### How To Contribue
-
-_________________________________________________________________________________________________________
-
-### Credits
+- Implementation of all the variants of the K-Means algorithm.
 
 _________________________________________________________________________________________________________
 
 
@@ -17,12 +17,6 @@ centroids = rand(10, 2)
 d = Vector{Float64}(undef, 100_000)
 suite["100kx10"] = @benchmarkable ParallelKMeans.colwise!($d, $X, $centroids)
 
-# for reference
-metric = SqEuclidean()
-#suite["100kx10_distances"] = @benchmarkable Distances.colwise!($d, $metric, $X, $centroids)
-dist = Distances.pairwise(metric, X, centroids, dims = 2)
-min = minimum(dist, dims=2)
-suite["100kx10_distances"] = @benchmarkable $d = min
 end # module
 
 BenchDistance.suite
@@ -0,0 +1,28 @@
+module BenchKMeans
+using Random
+using ParallelKMeans
+using BenchmarkTools
+
+suite = BenchmarkGroup()
+
+Random.seed!(2020)
+X = rand(10, 100_000)
+
+centroids3 = ParallelKMeans.smart_init(X, 3, 1, init="kmeans++").centroids
+centroids10 = ParallelKMeans.smart_init(X, 10, 1, init="kmeans++").centroids
+
+suite["10x100_000x3x1     Lloyd"] = @benchmarkable kmeans($X, 3, init = $centroids3, n_threads = 1, verbose = false, tol = 1e-6, max_iters = 1000)
+suite["10x100_000x3x1  Hammerly"] = @benchmarkable kmeans(Hamerly(), $X, 3, init = $centroids3, n_threads = 1, verbose = false, tol = 1e-6, max_iters = 1000)
+
+suite["10x100_000x3x2     Lloyd"] = @benchmarkable kmeans($X, 3, init = $centroids3, n_threads = 2, verbose = false, tol = 1e-6, max_iters = 1000)
+suite["10x100_000x3x2  Hammerly"] = @benchmarkable kmeans(Hamerly(), $X, 3, init = $centroids3, n_threads = 2, verbose = false, tol = 1e-6, max_iters = 1000)
+
+suite["10x100_000x10x1    Lloyd"] = @benchmarkable kmeans($X, 10, init = $centroids10, n_threads = 1, verbose = false, tol = 1e-6, max_iters = 1000)
+suite["10x100_000x10x1 Hammerly"] = @benchmarkable kmeans(Hamerly(), $X, 10, init = $centroids10, n_threads = 1, verbose = false, tol = 1e-6, max_iters = 1000)
+
+suite["10x100_000x10x2    Lloyd"] = @benchmarkable kmeans($X, 10, init = $centroids10, n_threads = 2, verbose = false, tol = 1e-6, max_iters = 1000)
+suite["10x100_000x10x2 Hammerly"] = @benchmarkable kmeans(Hamerly(), $X, 10, init = $centroids10, n_threads = 2, verbose = false, tol = 1e-6, max_iters = 1000)
+
+end # module
+
+BenchKMeans.suite
@@ -1,17 +1,184 @@
-# ParallelKMeans.jl Documentation
+# ParallelKMeans.jl Package
 
 ```@contents
+Depth = 4
 ```
 
+## Motivation
+It's actually a funny story led to the development of this package.
+What started off as a personal toy project trying to re-construct the K-Means algorithm in native Julia blew up after a heated discussion on the Julia Discourse forum when I asked for Julia optimizaition tips. Long story short, Julia community is an amazing one! Andrey offered his help and together, we decided to push the speed limits of Julia with a parallel implementation of the most famous clustering algorithm. The initial results were mind blowing so we have decided to tidy up the implementation and share with the world as a maintained Julia pacakge. 
+
+Say hello to `ParallelKMeans`!
+
+This package aims to utilize the speed of Julia and parallelization (both CPU & GPU) to offer an extremely fast implementation of the K-Means clustering algorithm and its variations via a friendly interface for practioners.
+
+In short, we hope this package will eventually mature as the "one stop" shop for everything KMeans on both CPUs and GPUs.
+
+## K-Means Algorithm Implementation Notes
+Since Julia is a column major language, the input (design matrix) expected by the package in the following format;
+
+- Design matrix X of size n×m, the i-th column of X `(X[:, i])` is a single data point in n-dimensional space.
+- Thus, the rows of the design design matrix represents the feature space with the columns representing all the training examples in this feature space.
+
+One of the pitfalls of K-Means algorithm is that it can fall into a local minima. 
+This implementation inherits this problem like every implementation does.
+As a result, it is useful in practice to restart it several times to get the correct results.
+
 ## Installation
+You can grab the latest stable version of this package from Julia registries by simply running;
 
+*NB:* Don't forget to Julia's package manager with `]`
+
+```julia
+pkg> add ParallelKMeans
+```
+
+For the few (and selected) brave ones, one can simply grab the current experimental features by simply adding the experimental branch to your development environment after invoking the package manager with `]`:
+
+```julia
+dev git@github.com:PyDataBlog/ParallelKMeans.jl.git
+```
+
+Don't forget to checkout the experimental branch and you are good to go with bleeding edge features and breaks!
+```bash
+git checkout experimental
+```
 
 ## Features
+- Lightening fast implementation of Kmeans clustering algorithm even on a single thread in native Julia.
+- Support for multi-theading implementation of Kmeans clustering algorithm.
+- 'Kmeans++' initialization for faster and better convergence.
+- Modified version of Elkan's Triangle inequality to speed up K-Means algorithm.
+
+
+## Pending Features
+- [X] Implementation of [Hamerly implementation](https://www.researchgate.net/publication/220906984_Making_k-means_Even_Faster). 
+- [ ] Full Implementation of Triangle inequality based on [Elkan - 2003 Using the Triangle Inequality to Accelerate K-Means"](https://www.aaai.org/Papers/ICML/2003/ICML03-022.pdf).
+- [ ] Implementation of [Geometric methods to accelerate k-means algorithm](http://cs.baylor.edu/~hamerly/papers/sdm2016_rysavy_hamerly.pdf).
+- [ ] Support for DataFrame inputs.
+- [ ] Refactoring and finalizaiton of API desgin.
+- [ ] GPU support.
+- [ ] Even faster Kmeans implementation based on current literature.
+- [ ] Optimization of code base.
+- [ ] Improved Documentation
+- [ ] More benchmark tests
 
 
 ## How To Use
+Taking advantage of Julia's brilliant multiple dispatch system, the package exposes users to a very easy to use API. 
+
+```julia
+using ParallelKMeans
+
+# Uses all available CPU cores by default
+multi_results = kmeans(X, 3; max_iters=300)
+
+# Use only 1 core of CPU
+results = kmeans(X, 3; n_threads=1, max_iters=300)
+```
+
+The main design goal is to offer all available variations of the KMeans algorithm to end users as composable elements. By default, Lloyd's implementation is used but users can specify different variations of the KMeans clustering algorithm via this interface
+
+```julia
+some_results = kmeans([algo], input_matrix, k; kwargs)
+
+# example
+r = kmeans(Lloyd(), X, 3)  # same result as the default 
+```
+
+```julia
+# r contains all the learned artifacts which can be accessed as;
+r.centers               # cluster centers (d x k)
+r.assignments           # label assignments (n)
+r.totalcost             # total cost (i.e. objective)
+r.iterations            # number of elapsed iterations
+r.converged             # whether the procedure converged
+```
+
+### Supported KMeans algorithm variations.
+- [Lloyd()](https://cs.nyu.edu/~roweis/csc2515-2006/readings/lloyd57.pdf) 
+- [Hamerly()](https://www.researchgate.net/publication/220906984_Making_k-means_Even_Faster) 
+- [Geometric()](http://cs.baylor.edu/~hamerly/papers/sdm2016_rysavy_hamerly.pdf) - (Coming soon)
+- [Elkan()](https://www.aaai.org/Papers/ICML/2003/ICML03-022.pdf) - (Coming soon) 
+- [MiniBatch()](https://www.eecs.tufts.edu/~dsculley/papers/fastkmeans.pdf) - (Coming soon)
+
+
+### Practical Usage Examples
+Some of the common usage examples of this package are as follows:
+
+#### Clustering With A Desired Number Of Groups
+
+```julia 
+using ParallelKMeans, RDatasets, Plots
+
+# load the data
+iris = dataset("datasets", "iris"); 
+
+# features to use for clustering
+features = collect(Matrix(iris[:, 1:4])'); 
+
+# various artificats can be accessed from the result ie assigned labels, cost value etc
+result = kmeans(features, 3); 
+
+# plot with the point color mapped to the assigned cluster index
+scatter(iris.PetalLength, iris.PetalWidth, marker_z=result.assignments,
+        color=:lightrainbow, legend=false)
+
+```
+
+![Image description](iris_example.jpg)
+
+#### Elbow Method For The Selection Of optimal number of clusters
+```julia
+using ParallelKMeans
+
+# Single Thread Implementation of Lloyd's Algorithm
+b = [ParallelKMeans.kmeans(X, i, n_threads=1; tol=1e-6, max_iters=300, verbose=false).totalcost for i = 2:10]
+
+# Multi Thread Implementation of Lloyd's Algorithm by default
+c = [ParallelKMeans.kmeans(X, i; tol=1e-6, max_iters=300, verbose=false).totalcost for i = 2:10]
+
+```
+
+
+## Benchmarks
+Currently, this package is benchmarked against similar implementation in both Python and Julia. All reproducible benchmarks can be found in [ParallelKMeans/extras](https://github.com/PyDataBlog/ParallelKMeans.jl/tree/master/extras) directory. More tests in various languages are planned beyond the initial release version (`0.1.0`).
+
+*Note*: All benchmark tests are made on the same computer to help eliminate any bias. 
+
+
+Currently, the benchmark speed tests are based on the search for optimal number of clusters using the [Elbow Method](https://en.wikipedia.org/wiki/Elbow_method_(clustering)) since this is a practical use case for most practioners employing the K-Means algorithm. 
+
+
+### Benchmark Results
+
+![benchmark_image.png](benchmark_image.png)
+
+
+_________________________________________________________________________________________________________
+
+| 1 million (ms) | 100k (ms) | 10k (ms) | 1k (ms) | package                 | language |
+|:--------------:|:---------:|:--------:|:-------:|:-----------------------:|:--------:|
+| 600184.00      | 31959.00  | 832.25   | 18.19   | Clustering.jl           | Julia    |
+| 35733.00       | 4473.00   | 255.71   | 8.94    | Lloyd                   | Julia    |
+| 12617.00       | 1655.00   | 122.53   | 7.98    | Hamerly                 | Julia    |
+| 1430000.00     | 146000.00 | 5770.00  | 344.00  | Sklearn Kmeans          | Python   |
+| 30100.00       | 3750.00   | 613.00   | 201.00  | Sklearn MiniBatchKmeans | Python   |
+| 218200.00      | 15510.00  | 733.70   | 19.47   | Knor                    | R        |
+
+_________________________________________________________________________________________________________
+
+
+## Release History 
+- 0.1.0 Initial release
+
+
+## Contributing
+Ultimately, we see this package as potentially the one stop shop for everything related to KMeans algorithm and its speed up variants. We are open to new implementations and ideas from anyone interested in this project.
 
+Detailed contribution guidelines will be added in upcoming releases.
 
+<!--- Insert Contribution Guidelines Below --->
 
 ```@index
 ```