Biometrics Northwest LLC

Performing Data Analysis and Modeling

Home

Services

About Us

Projects

Contact Us

Disclaimer

Unsupervised clustering using nearest neighbor likelihood

Nearest neighbor likelihood values are computed using a critical distance threshold. These nearest neighbor likelihood values are then used to identify level sets that are used to find data points that are far from points that have previously been clustered. In this context 'far' means outside the critical distance threshold. The first cluster is assumed to contain the most likely data point, which is used to start the algorithm, and approximate cluster centers are assigned as the most likely data point in each cluster. Singleton clusters and isolated clusters containing a small number of points are frequently found. These may be removed or assigned to the closest cluster.

Description	Data	Clusters
A one-dimensional example using a mixture of two normal distributions. The plots show the nearest neighbor likelihood values on the y-axis for the data and the identified clusters.
A two-dimensional example using a mixture of five normal distributions.
A two dimensional example using a data set consisting of concentric rings. These data were first converted to magnitude values by computing their distances from the origin and then clustered in one-dimension.
A two-dimensional example with structured data consisting of four spiral arms.
A three dimensional example using a mixture of five normal distributions.

For information send email to: info@biometricsnw.com

Last Update: October 20, 2024