Predicting Road Accidents Based on Current and Historical Spatio-temporal Traffic Flow Data - Computational Logistics

Information Technology Reference

In-Depth Information

(K=3). When K=3, the mode of the outcomes of the three retrieved cases is consi-

dered. This was repeated for each similarity combination, S TD , S SP and S var . The num-

ber of correct retrievals and the success rate with respect to the different similarity

combinations, S TD , S SP and S var are shown in Table 1 for both situations, i.e., when the

target cases consist of known accident cases and when the target cases consist of

known non-accident cases, generated as explained above. The case base consists of

both accident and non-accident cases.

Table 1. Average number of correct retrievals and success rate of accident and non-accident

cases

S TD

S SP

S var

S All

K=1 K=3

K=1

K=3

K=1

K=3

K=1

K=3

Target cases = Accident cases

Number of correct retrievals

0

25

30

21

24

28

27

Success rate

0

64% 77% 54% 62% 72% 69%

Target cases = Non-accident cases

Number of correct retrievals

0

25

27

26

20

23

0

Success rate

78% 84% 81% 81% 63% 72%

Total success rate

0

0 71% 81% 68% 71% 68% 71%

As can be seen from the table, the best retrieval success rate is obtained when the

spatio-temporal, single-point similarity combination S SP is used with K=3. As expected,

computing the similarity based on spatio-temporal attribute differences has the strongest

prediction power for both accident and non-accident cases as target cases. Counter intui-

tively, it was found that the time-day category attribute is not capable of differentiating

between accidents and non-accidents and on its own is not sufficient to retrieve a case

with the same outcome as the test cases. It is possible that the time and day information

is already included in the flow data. For instance, during rush hour we assume that the

flow of vehicles is large and the speed is small. Also, it is difficult to define accurately

the time-day categories, which are currently arbitrarily defined. The success rate im-

proves in general when three cases are retrieved and the mode of the retrieved cases'

outcomes is used as the solution. By retrieving more than one case, we can reduce the

risk of retrieving an uncharacteristic case that happens to have a large similarity with the

target case. The retrieval success rates obtained with S SP , in particular, look promising

and can be considered to be a good starting point for further research.

5

Conclusion and Future Work

The relatively high success rate of 81% shows that the CBR system under develop-

ment is capable of differentiating between accident and non-accident cases based on

Computational Logistics

Search WWH ::

Custom Search

Home