Data Mining Techniques for Segmentation - Data Mining Techniques in CRM: Inside Customer Segmentation

Database Reference

In-Depth Information

Model Summary

Algorithm

K-Means

5

Input Features

Clusters

6

Cluster Quality

Poor

Fair

Good

-1.0

-0.5

0.0

0.5

1.0

Silhouette measure of cohesion and separation

Figure 3.14 The overall silhouette measure of the clustering solution.

provides a useful visual examination of the clusters. The following figures show the

Cluster Viewer results for the mobile phone case study introduced in ''How Many

Components Are to Be Extracted?''.

Six clusters were derived after applying a K-means clustering algorithm to the

principal component scores. The cluster quality of the solution (Figure 3.14) is

good with an overall silhouette coefficient slightly greater than 0.5.

The next output of the Cluster Viewer is a cluster-by-inputs grid (Figure 3.15)

that summarizes the cluster centroids. The rows contain all the clustering fea-

tures/input fields, the five principal components, and the columns contain the

revealed clusters. The first row of the table illustrates the size of each cluster. The

table cells denote the cluster centers: the mean values of the component scores

over all members of each cluster.

At first sight the differentiating factors of each cluster are evident. Cluster 1

includes a relatively small number of ''superactive'' customers with increased usage

of all services. Cluster 2 customers are predominantly voice users. On the contrary,

cluster 3 mainly includes SMS users. Cluster 4 customers show increased roaming

usage while cluster 5 customers show increased usage of MMS and Internet

services. The customers in cluster 6 present the lowest usage of all services.

Cluster interpretation can be aided by additional Cluster Viewer graphs such

as the ones presented below. The graph in Figure 3.16 for instance compares

clusters 2 and 3 for the first two components, SMS and voice usage.

Search WWH ::

Custom Search

Home