A Framework for Data Warehousing and Mining in Sensor Stream Application Domains - Evolving Application Domains of Data Warehousing and Mining

Database Reference

In-Depth Information

Query Processing Stage

and association mining approach for stream data

application based on multiple sensor relationships.

We compared the estimation accuracy, running

time and memory space usage when applying

each method to our proposed framework.

At the query processing stage, which is on the

top level of the domain-driven framework, dif-

ferent users' query can be fulfilled at the users'

specified query criteria at the same time. The end

user will specify their query parameters on their

online queries, in this case, the support, confi-

dence threshold, and the sliding window size of

their online association queries, or specify they

would use the system recommend parameter val-

ues. The query processor then use these specified

parameters as query criteria to pass the request

to the data warehousing and mining components

and retrieve the query results for different end

users' requests.

Performance Study of

Estimation Accuracy

The evaluation of the estimation accuracy of the

missing values is done by using the average Root

Mean Square Error (RMSE):

estimations

(

)

RMSE

numStates

estimations

Performance Study

where X ai and X ei are the actual value and the

estimated value, respectively; #estimations is the

number of estimations performed in a simulation

run; and numStates is the number of states in which

the actual readings are distributed.

The performance of our proposed framework is

studied by means of simulation. Several different

data warehousing and data mining experiments

are conducted in order to evaluate the proposed

framework while using the Average Window

Size (AWS) approach, the linear interpolation

approach, the linear trend approach, the Window

based Association Rule Mining (WARM) ap-

proach (Halatchev, 2005), and the Closed itemsets

basedAssociation Rule Mining (CARM) approach

(Jiang, 2007).

All these methods are applied to our proposed

framework to answer the user's request for missing

sensor value estimation. The AWS approach uses

the average value of the missing sensor readings

in the current sliding window as the estimated

value. The linear interpolation approach uses

the linear interpolation of the missing sensor's

neighbor readings as the estimated value. The

linear trend uses the linear regression trend for the

missing sensor readings as the estimated value. The

WARM approach (Halatchev, 2005) is a revised

data warehousing technique for sensor network

database based on the relationship between two

sensors. And the CARM approach is a pattern

estimations

(

)

The expression

repre-

estimations

sents the standard error and is an estimate of the

standard deviation under the assumption that the

errors in the estimated values (i.e. X ai - X ei ) are

normally distributed. Thus, the RMSE allows the

construction of confidence intervals describing the

performance of different candidate missing value

estimators. The smaller the RMSE (the standard

deviation), the better the estimated results.

From Figure 5, we can see that CARM gives

the best result of the above approaches in the

proposed framework regarding the estimation

accuracy, followed by the WARM approach.

The linear interpolation, AWS, and linear trend

approaches perform no better than WARM and

CARM approaches. The main reason might be

that it only considers the relationship between the

neighbor nodes, while CARM and WARM find

Evolving Application Domains of Data Warehousing and Mining

Search WWH ::

Custom Search

Home