Database Reference
In-Depth Information
6.3.3 Criticism of the Previous Approaches
As reported in Wong et al.'s work, the EP formulation seems to have a clear
advantage over the GA one. To account for this, we conjecture that the per-
formance gain is largely due to the different choice of genetic operators in
producing the offspring, which, in effect, influences the exploration of search
space. On the one hand, the success of EP readily suggests that sheer muta-
tions, which correspond to adding or removing an edge or the combination
of the two, are good enough for generating new search points. On the other
hand, the crossover operation, which plays an important role in GAs, seems
to be ineffective. The reason is not di cult to understand because the one-
point crossover, in our case, effectively recombines two graphs arbitrarily. In
most cases, this could result in an invalid structure. In this regard, such re-
combination would seem insignificant and offspring do not properly inherit,
which is supposedly the merit of the crossover operator. Although the inten-
tion to exchange information among the population is good, the traditional
one-point crossover fails to achieve the purpose.
Despite the EP approach performs better, we note that it often requires
a long execution time. For instance, to learn a network with 37 nodes from
a given data set of 10,000 cases, MDLEP needs about an hour to find the
solution, 5 which is intolerable for practical use. At closer inspection, we find
that a major cause of its long execution time is that there are much more
worse offspring (comparing an offspring with its parent) produced than bet-
ter offspring on average. From our experience, if the population size is 50,
we would have, on average, fewer than five better offspring produced in each
generation. This implies that most of the mutation operations generate in-
ferior network structures. Hence, we conjecture that MDLEP is not e cient
enough in finding good solutions.
6.4 Proposed Algorithm
It is straightforward to combine the two approaches for various purposes.
In the literature, there are several attempts that show different realizations
of the idea. 6 We propose a new hybrid framework that targets improving
the learning e ciency. In essence, information from dependency analysis is
exploited in the search-and-scoring process so that the searching will be more
e cient.
5 We use 5000 generations as the termination criterion.
6 For instance, the CB algorithm [6.34] employs a search-and-score approach (i.e.,
K2), which takes as input a node ordering given by the network constructed
by a dependency analysis approach. In another scenario, the BENEDICT algo-
rithm [6.35] uses a metric definition that reflects the discrepancy between the
independence assertions implied by the network and the data.
Search WWH ::




Custom Search