This project implements an unsupervised approach to segment cell images into distinct regions: Background, Cytoplasm, and Nuclei. The study explores the application of multiple clustering algorithms for the segmentation of cell images. Five clustering techniques—KMeans, MiniBatchKMeans, BisectingKMeans, Birch, and GaussianMixture—are compared to evaluate their performance in terms of Jaccard scores across a small dataset of images. Additionally, an ensemble approach (referred to as "Clusterer") aggregates the results of individual algorithms through majority voting to achieve robust and consistent segmentation. The findings highlight the ensemble's reliability and stability, along with key insights into the strengths and weaknesses of individual clustering methods.
The dataset consists of segmented cell images, each evaluated using Jaccard scores to quantify clustering performance.
A majority voting mechanism combines the outputs of all individual clustering algorithms to produce an aggregated result for each image. This ensemble method aims to improve performance stability and address inconsistencies in individual algorithm predictions.
Results are summarized and visualized to identify trends, assess consistency, and evaluate the benefits of the ensemble approach.
See the following tables for detailed results and descriptive statistics:
This table provides Jaccard scores for each clustering algorithm on individual images:
Image Name | KMeans | MiniBatchKMeans | BisectingKMeans | Birch | GaussianMixture | Clusterer |
---|---|---|---|---|---|---|
12a.jpg | 0.931569 | 0.925057 | 0.866394 | 0.912475 | 0.913492 | 0.916149 |
12b.jpg | 0.913992 | 0.913760 | 0.899200 | 0.906678 | 0.904579 | 0.914282 |
12c.jpg | 0.907294 | 0.907246 | 0.904665 | 0.877766 | 0.887136 | 0.907246 |
12d.jpg | 0.905713 | 0.908069 | 0.907775 | 0.899540 | 0.893810 | 0.914475 |
12e.jpg | 0.916694 | 0.696940 | 0.636804 | 0.906669 | 0.915642 | 0.906820 |
This table summarizes the descriptive statistics for Jaccard scores across all images:
Metric | KMeans | MiniBatchKMeans | BisectingKMeans | Birch | GaussianMixture | Clusterer |
---|---|---|---|---|---|---|
Count | 5.000000 | 5.000000 | 5.000000 | 5.000000 | 5.000000 | 5.000000 |
Mean | 0.915052 | 0.870214 | 0.842968 | 0.900626 | 0.902932 | 0.911794 |
Std | 0.010296 | 0.097124 | 0.116427 | 0.013577 | 0.012334 | 0.004409 |
Min | 0.905713 | 0.696940 | 0.636804 | 0.877766 | 0.887136 | 0.906820 |
25% | 0.907294 | 0.907246 | 0.866394 | 0.899540 | 0.893810 | 0.907246 |
50% | 0.913992 | 0.908069 | 0.899200 | 0.906669 | 0.904579 | 0.914282 |
75% | 0.916694 | 0.913760 | 0.904665 | 0.906678 | 0.913492 | 0.914475 |
Max | 0.931569 | 0.925057 | 0.907775 | 0.912475 | 0.915642 | 0.916149 |