X Means
This task will create clusters based upon the specified feature or features. Unlike KMeans, this algorithm only needs to know an upper threshold of clusters. It will determine the optimal number of clusters using Bayesian Information Criterion or BIC for short.
CONFIGURATION
OPTION | DESCRIPTION |
---|---|
Column Selector | Use this to select 1 or more columns to be used in the analysis. |
Max Clusters | The maximum number of clusters to create. |
INPUT
Any dataset.
OUTPUT
The dex data will be appended with cluster information and a visual of will be generated about the cluster distribution.