Finding It Now: Construction and Configuration of Networked Classifiers in Real-Time Stream Mining Systems - Signal Processing Systems

Digital Signal Processing Reference

In-Depth Information

⎧

⎨

i = 1 ρ i t i − 1 ( x )

maximize

x ∈ [ 01 ]

(

g N (

) −

Kt N (

) −

(4)

⎩

r max

subject to 0

≤

and M r

≤

2.3

Operating Point Selection

Given a topology, the resource-constrained optimization problem defined in Eq. ( 4 )

may be formulated as a network optimization problem (NOP) [ 21 , 26 ] . This problem

has been well studied in [ 14 , 27 , 32 ] and we refer the interested reader to the

corresponding literature.

The solutions proposed involve using iterative optimization techniques based

on Sequential Quadratic Programming (SQP) [ 5 ] . SQP is based on gradient-

descent, and models a nonlinear optimization problem as an approximate quadratic

programming subproblem at each iteration, ultimately converging to a locally

optimal solution.

A significant body of work exists which discusses how to adapt stream mining

applications to resource constraints [ 15 , 24 , 30 , 31 , 36 , 37 ] . The majority of these

approaches are based on load-shedding algorithms, which determine when, where,

what, and how much data to discard given the observed data characteristics, desired

Quality of Service requirements [ 3 ] , available resources, and delay constraints [ 12 ] .

Naive load-shedding does not hold for more complex data classification tasks, where

results depend on dynamic statistical properties of the data, semantic relationships

between the concepts of interest, underlying features extracted, selected classifica-

tion algorithms, etc.

Instead of deciding on what fraction of the data to process, as in load-shedding

based approaches, we propose to determine how the available data should be

processed given the underlying resource allocation. Hence, we allow individual

classifiers in the ensemble to select different operating points in order to maximize

the global end-to-end performance while meeting system resource constraints. This

simultaneous configuration of multiple classifiers, makes our problem significantly

more complex than traditional NOPs, all the more due to the non-concavity of the

utility function defined in Eq. ( 4 ) .

In our setting, selecting the operating point for a fixed order

σ f can be done by

applying the SQP-algorithm to the Lagrangian function of the optimization problem

in ( 4 ) :

2 x

L (

, ν 1 , ν 2 , ν 2 )=

(

) − ν

(

−

)+ ν

Because of the gradient-descent nature of the SQP algorithm, it is not possible

to guarantee convergence to the global maximum and the convergence may only

be locally optimal. However, the SQP algorithm can be initialized with multiple

starting configurations in order to find a better local optimum (or even the global

optimum). Since the number and size of local optima depend on the shape of the

various DET curves of each classifier, a rigorous bound on the probability to find

Signal Processing Systems

Search WWH ::

Custom Search

Home