Databases Reference
In-Depth Information
Finally, we outline few research directions in the area of feature
selection.
criteria 113
(1) Define
standardized
evaluation
to
enable
systematic
comparison of existing and novel approaches.
(2) Scale up intelligent focusing approaches by combining technologies from
machine learning research and the database community.
(3) Develop more intelligent focusing solutions that provide data reduction
techniques beyond pure statistical sampling 46 and make use of the
specific characteristics of concrete contexts in data mining.
(4) A more rigorous investigation is required to formulate and implement
unsupervised feature selection as a multi-objective problem.
Much work still remains to be done. Instance and feature selection
corresponds to scaling down data and reduce the feature space. When we
understand better instance and feature selection, it is natural to investigate
if this work can be combined with other lines of research in overcoming the
problem of huge amounts of data.
References
1. J. Kittler. Feature set search algorithms, Pattern Recognition and Signal
Processing . 835-855, (1978).
2. A.
Jain
and
B.
Chandrasekaran.
Dimensionality
and
sample
size
considerations, Pattern Recognition in Practice . 2 , 835-855, (1982).
3. R. Setiono and H. Liu. Neurolinear: From neural networks to oblique decision
rules, Neurocomputing . 17 , 1-24, (1997).
4. K. C. Tan, Q. Yu, and T. H. Lee. A distributed co-evolutionary classifier for
knowledge discovery in data mining, IEEE Transactions on Systems, Man
and Cybernetics: Part C . 35 (2), 131-142, (2005).
5. D. Huang and T. Chow. Effective feature selection scheme using mutual
information, Neurocomputing . 63 , 325-343, (2005).
6. I. Guyon and A. Elisseeff. An introduction to variable and feature selection,
Journal of Machine Learning Research . 3 , 1157-1182, (2003).
7. O. Carlborg and C. Haley. Epistasis: too often neglected in complexity
studies?, Nat. Rev. Genet. 5 (8), 618-625, (2004).
8. H. Vafaie and K. Jong. Genetic algorithms as a tool for feature selection
in machine learning. In Proc. of the 4th International Conference on Tools
with Artificial Intelligence , (1992).
9. C. Su and Y. Hsiao. An evaluation of the robustness of mts for imbalanced
data, IEEE Trans. Knowledge and Data Engineering . 19 (10), 1321-1332
(October, 2007).
Search WWH ::




Custom Search