The Silhouette Score is a metric of performance for clustering that represents the quality of clusters made as a result.
More...
|
static double | MeanDistanceFromCluster (const arma::colvec &distances, const arma::Row< size_t > &labels, const size_t &label, const bool &sameCluster=false) |
| Find mean distance of element from a given cluster. More...
|
|
template |
static double | Overall (const DataType &X, const arma::Row< size_t > &labels, const Metric &metric) |
| Find the overall silhouette score. More...
|
|
template |
static arma::rowvec | SamplesScore (const DataType &distances, const arma::Row< size_t > &labels) |
| Find the individual silhouette scores for precomputted dissimilarites. More...
|
|
template |
static arma::rowvec | SamplesScore (const DataType &X, const arma::Row< size_t > &labels, const Metric &metric) |
| Find silhouette score of all individual elements. More...
|
|
The Silhouette Score is a metric of performance for clustering that represents the quality of clusters made as a result.
It provides an indication of goodness of fit and therefore a measure of how well unseen samples are likely to be predicted by the model, considering the inter-cluster and intra-cluster dissimilarities. Silhoutte Score is dependent on the metric used to calculate the dissimilarities. The best possible score is
. Smaller values of Silhouette Score indicate poor clustering. Negative values would occur when a wrong label was put on the element. Values near zero indicate overlapping clusters. For an element i
is within cluster average dissimilarity and
is minimum of average dissimilarity from other clusters. the Silhouette Score
of a Sample is calculated by
The Overall Silhouette Score is the mean of individual silhoutte scores.
Definition at line 40 of file silhouette_score.hpp.