Forecasting Supply Chain Demand Using Machine Learning Algorithms - Distributed Artificial Intelligence, Agent Technology, and Collaborative Applications

Information Technology Reference

In-Depth Information

manufacturers' data sets. Therefore, the automatic exponential smoothing, 20% exponential smoothing

and the 6 period window moving average all provide about the same performance.

The difference when including the Statistics Canada manufacturing data was much larger since the

patterns were stronger as a result of its aggregate nature and there were more data for the automatic

version to achieve better performance. The average error of the automatic exponential smoothing for

all three datasets is 0.6096 and the average for the fixed exponential smoothing of 20% is 0.6337 and

the difference has a significance of 0.00000002159. The moving average with a window of 6 periods

had an average error of 0.6288 and a significance of 0.0000005752 with the average of the automatic

exponential smoothing. We are impartial about which method is best when testing with only the two

manufacturer's data sets. However, when we included the Statistics Canada data set in the testing, there

was a significant difference in favor of the automatic exponential smoothing and consequently we would

identify this technique to be superior since there is added value at no loss.

In the case of the Statistics Canada dataset, the results were a little different; we found the MLR (Table

3 - Rank 3), SVM (Table 3 - Rank 4 and 5) and Theta (Table 3 - Rank 6) outperformed exponential

smoothing (Table 3 - Rank 7). However, because these approaches had such poor performance on the

chocolate and toner cartridge manufacturer datasets and because the performance gain by these over

the ES method was very small, we did not consider these results convincing. They may be the result of

the very large amount of data (12 years) and the aggregate nature of the data that was less noisy.

It is interesting to note that the trend approach (an informal way of planning by extrapolating that

a certain trend will continue in the future) was by far the worst forecasting approach since it always

ranked at the bottom of all three tables (Rank 21 and 22). Also, ARMA and most of the ML approaches

other than SVM showed a relatively poor performance.

Support Vector Machines using the Super Wide Model

The overall best performance was obtained using support vector machines in combination with the Super

Wide model. Since we have previously identified that the best traditional technique was automatic expo-

nential smoothing (it performed well on both manufacturers' data, as well as the aggregate manufacturing

data), we can calculate the forecast error reduction provided by the best ML approach. For the chocolate

manufacturer's dataset (Table 1 - Rank 2 and 5), we found a 6.70% ((0.8270 - 0.7717) / 0.8270) reduction

in the overall forecasting error and for the toner cartridge manufacturer dataset (Table 2 - Rank 1 and

5) we found a 3.11% ((0.6994 - 0.6777) / 0.6994) reduction in the overall forecasting error. In the case

of the Statistics Canada manufacturing dataset (Table 3 - Rank 2 and 7), we found a 10.00% ((0.5055

- 0.4547 / 0.5055) reduction in the forecasting error as compared to automatic exponential smoothing.

This was an average of 4.90% for our two manufacturers' dataset and an average of 6.61% for all three

as compared to automatic exponential smoothing. The performance of the Super Wide models has a

potential to improve further, if more products are included beyond the limit of 100 used in our study.

We will further examine in detail four major components of results; (1) cross- validation, (2) alternative

methods, (3) t-tests and (4) sensitivity analysis.

Cross-Validation

We tested two different support vector machine cross-validation-based parameter optimization proce-

dures: the windowed (time-oriented) approach and the standard approach. For the chocolate manufacturer

Search WWH ::

Custom Search

Home