Advanced Analytical Theory and Methods: Classification - Data Science and Big Data Analytics

Database Reference

In-Depth Information

Figure 7.2 Example of a decision stump

To illustrate how a decision tree works, consider the case of a bank that wants

to market its term deposit products (such as Certificates of Deposit) to the

appropriate customers. Given the demographics of clients and their reactions to

previous campaign phone calls, the bank's goal is to predict which clients would

subscribe to a term deposit. The dataset used here is based on the original dataset

collected from a Portuguese bank on directed marketing campaigns as stated in

the work by Moro et al. [6]. Figure 7.3 shows a subset of the modified bank

marketing dataset. This dataset includes 2,000 instances randomly drawn from

the original dataset, and each instance corresponds to a customer. To make the

example simple, the subset only keeps the following categorical variables: (1) job ,

(2) marital status, (3) education level, (4) if the credit is in default , (5) if

there is a housing loan, (6) if the customer currently has a personal loan , (7)

contact type, (8) result of the previous marketing campaign contact ( poutcome ),

and finally (9) if the client actually subscribed to the term deposit. Attributes (1)

through (8) are input variables, and (9) is considered the outcome. The outcome

subscribed is either yes (meaning the customer will subscribe to the term

deposit) or no (meaning the customer won't subscribe). All the variables listed

earlier are categorical.

Search WWH ::

Custom Search

Home