A Discovery Method of Anteroposterior Correlation for Big Data Era - Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing

Information Technology Reference

In-Depth Information

APIs, etc. on the web. The exponential growth in the amount of field data means

that innovative measures are needed for data management, analysis, and accessi-

bility. Therefore, almost users' needs are changing. Online databases have become

crucial to access and publish various data depending on the user's purpose, task, or

interest.

First, for leading the solutions, we have to marshal what Big Data is. We con-

sider that the Big Data includes two prominent types of direction for ICT research

purposes.

The one is the scale and the speed issue of data processing. A lot of researchers

have done this theme such as HPC, parallel distributed processing, and etc. Another

is a schemaless data processing issue. It is important to real-timely discover the

answers or clues for a user. A system has to create an appropriate schema from the

data themselves given by a user's query. Until now, data organized on the database

schema. Currently, there are only fragmental various data on the web. This is a

big paradigm shift. That is, it is necessary to create the schema and data structures

corresponding to user's required processing after a user inputs some queries. We

have to shift the system from closed assumption to opened assumption. In the case

of the closed assumption, the schema was designed in advance in consideration of

orthogonal or independence. In the case of the opened assumption, we cannot care

orthogonal and independence when the system dynamically creates schema.

On this background, it is important to research new data analytics method. Es-

pecially, it is necessary to discover the relation between each thing, event and phe-

nomenon. By this discovery, we can do risk aversion, can pursue a cause, or can

predict the phenomenon which may happen in the near future. On the other view-

point, we consider that the essence of Big data is not only massive data processing,

but also optimization of the real world by the knowledge acquired from aggregated

data. The current tendency of the research on Big data is how to aggregate massive

data and how to process these data quickly. The future tendency of researches will

become discovery methods of the optimized solution from the Big data. It is impor-

tant to realize risk aversion, the cause unfolding, and the phenomenon prediction in

the near future. In the first step, it is necessary to extract relationships between each

heterogeneous thing, events and phenomena.

We propose a new concept, anteroposterior correlation. The anteroposterior cor-

relation means the correlation based on the time anteroposterior relation. The antero-

posterior correlations are represented in relative comparison with each conditional

probability distribution. The one of the most important points is relatively compara-

ble. In the other words, this system discovers the relation with higher correlation by

comparing each value of a conditional probability as each weight. In this paper, we

represent a new discover method of anteroposterior correlation between each het-

erogeneous thing, events and phenomena. We discover a correlation in consideration

of the continuity of time. By our method, we effectively discover relationship be-

tween heterogeneous things, events and phenomena. The one of the features of our

method is a measurement correlation by using conditional probability. We calculate

the anteroposterior correlation relative by representing all in conditional probability.

Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing

Search WWH ::

Custom Search

Home