Database Reference
In-Depth Information
9.19
Comment on the dashboard design guidelines.
9.20
Define a dashboard using an example of an application domain that
you are familiar with.
9.7 Exercises
9.1
Consider the following training data about students:
StudID Age Country FamilyIncome Distance Finish
s1
1
local
low
1
1
s2
0
local
medium
0
0
s3
0
local
high
1
0
s4
0
local
medium
1
0
s5
4
foreigner
medium
2
1
s6
3
foreigner
medium
1
1
s7
3
foreigner
low
1
2
s8
2
foreigner
low
1
3
s9
1
local
high
2
3
s10
0
local
high
1
2
where the classes are as follows:
Age
indicates the age at which the student started the studies.
Possible values are as follows: 0 (between 17 and 21), 1 (between
22 and 26), 3 ( between 27 and 32), and 4 (older than 32).
Country
can have two values:
local
and
foreigner
.
FamilyIncome
can be
low
,
medium
,and
high
.
Distance
indicates the distance that the student has to travel to go
to university. It can take values 0 (less than 1 mile), 1 (between 1
and 3 miles), and 2 (more than 3 miles).
Finish
indicates whether the student finished her studies in the years
planned for the corresponding career. It can take the values 0 (the
student finished her studies on time), 1 (the student finished at most
with 1-year delay), 2 (the student finished with 2 or more years of
delay), and 3 (the student abandoned her studies).
(a) Manually run the ID3 algorithm to build a decision tree over the
class
Finish
. Use the Gini index to partition the nodes.
(b) Use the K-means algorithm to generate three clusters of students.
9.2
Consider the Foodmart data warehouse of Fig.
6.4
:
(a) Build a decision tree model that predicts whether a new customer
is likely to order an item of a (sub)category
X
.Usethe
Customer
,
Product
,
Product class
,and
Sales
tables in the Foodmart data
Search WWH ::
Custom Search