Non-Hierarchical Cluster Analysis

Non-Hierarchical Cluster Analysis

Non-hierarchical cluster analysis (often known as K-means Clustering Method) forms a grouping of a set of units, into a pre-determined number of groups, using an iterative algorithm that optimizes a chosen criterion. Starting from an initial classification, units are transferred from one group to another or swapped with units from other groups, until no further improvement can be made to the criterion value. There is no guarantee that the solution thus obtained will be globally optimal - by starting from a different initial classification it is sometimes possible to obtain a better classification. However, starting from a good initial classification much increases the chances of producing an optimal or near-optimal solution.

(source: http://www.asreml.com/products/genstat/mva/NonHierarchicalClusterAnalysis.htm)

The algorithm is called k-means, where k is the number of clusters you want; since a case is assigned to the cluster for which its distance to the cluster mean is the smallest. The action in the algorithm centers on finding the k-means. You start out with an initial set of means and classify cases based on their distances to the centers. Next, you compute the cluster means again, using the cases that are assigned to the cluster; then, you reclassify all cases based on the new set of means. You keep repeating this step until cluster means don’t change much between successive steps. Finally, you calculate the means of the clusters once again and assign the cases to their permanent clusters.

(source: http://www.norusis.com/pdf/SPC_v13.pdf)

Steps in Non-Hierarchical Cluster Analyisis

In this method, the desired number of clusters is specified in advance and the ’best’ solution is chosen. The steps in such a method are as follows:

1. Choose initial cluster centers (essentially this is a set of observations that are far apart — each subject forms a cluster of one and its center is the value of the variables

Non-Hierarchical Cluster Analysis

You May Also Find These Documents Helpful

Nt1330 Unit 3 Assignment 1 Reference Set

Nt1330 Unit 3 Assignment 1 Reference Set

What Does Avogadro's Number Mean

What Does Avogadro's Number Mean

HW 2

HW 2

4 03

4 03

Bsc303 Chapter 1 Study Guide

Bsc303 Chapter 1 Study Guide

LYT2 Task2

LYT2 Task2

Welfare Drug Testing Essay Example

Welfare Drug Testing Essay Example

Probability and Statistics Research Project

Probability and Statistics Research Project

Market Segmentation

Market Segmentation

Category Partition Method

Category Partition Method

Power Law

Power Law

Statistics for Bi - Hypothesis Testing

Statistics for Bi - Hypothesis Testing

To Test the Effects of Antacids on Pepsin's Ability to Digest Protein

To Test the Effects of Antacids on Pepsin's Ability to Digest Protein

Business to Business Marketing

Business to Business Marketing

Data Mining

Data Mining

Related Topics