Skip to content

erdogant/clusteval

Repository files navigation

clusteval

Python PyPI Version License BuyMeCoffee Github Forks GitHub Open Issues Project Status Downloads Downloads DOI Sphinx Open In Colab

clusteval is a python package that is developed to evaluate detected clusters and return the cluster labels that have most optimal clustering tendency, Number of clusters and clustering quality. Multiple evaluation strategies are implemented for the evaluation; silhouette, dbindex, and derivative, and four clustering methods can be used: agglomerative, kmeans, dbscan and hdbscan.

⭐️ Star this repo if you like it ⭐️

📘 Blogs

  1. A step-by-step guide for clustering images
  2. Detection of Duplicate Images Using Image Hash Functions
  3. From Data to Clusters: When is Your Clustering Good Enough?
  4. From Clusters To Insights; The Next Step

📚 Documentation

Full documentation is available at erdogant.github.io/clusteval, including examples and API references.


⚙️ Installation

It is advisable to use a virtual environment:

conda create -n env_clusteval python=3.12
conda activate env_clusteval

Install via PyPI:

pip install clusteval

To upgrade to the latest version:

pip install --upgrade clusteval

Import the library:

from clusteval import clusteval

💡 Examples

A structured overview is available in the documentation.


Silhouette Score

Optimal Clusters

Dendrogram

Davies-Bouldin Index

Derivative Method

DBSCAN

HDBSCAN A

HDBSCAN B

📖 Citation

Please cite clusteval in your publications if it has been helpful in your research. Citation information is available at the top right of the GitHub page.


🔍 Related Tools & Blogs


☕ Support

If you find this project useful, consider supporting me: