Database Reference
In-Depth Information
Training
set
selected
Data set
Instance
selection
algorithm
DM
algorithm
Model
Fig. 5.2. IS-TSS strategy.
Figure 5.2 shows a general framework for the application of an IS algorithm for
TSS. Starting from the data set D , the IS algorithm finds a suitable training set
selected, S .
In this work, we use EAs for IS-TSS following a stratified approach [27], [33],
which is outlined in Figure 5.3. The data set, D , is divided into two non-
overlapping sets with the 50% of the elements, T1 and T2 ( D = T1 T2 and T1
T2 = ), which are classically called strata . Then an IS algorithm may be applied
on them independently, obtaining two sets with the selected instances, S1 and S2 .
The final training set will be the union of these sets ( S = S1 S2 ).
This technique seems adequate for applying EAs as IS algorithms to DM
problems with large data sets (see Section 5.5.1).
Data set D
S 1
Training
set selected
T 1
S = S 1 S 2
Instance
selection
algorithm
T 2
S 2
Fig. 5.3. Stratified approach for IS-TSS.
 
Search WWH ::




Custom Search