How Engines Learn to Generate Recommendations: Adaptive Learning Algorithms - Realtime Data Mining

Database Reference

In-Depth Information

Second we observe that with increasing k min the prediction quality of p ss 0

improves, and for higher k min it even outperforms p ss 0 . This is because Assumption

5.2 requires a more complex treatment of the data, including a partition of the

transactions and their different handling for the two probability types. On the

contrary, Assumption 5.1 makes use of all data for unified learning, and hence its

algorithms achieve good prediction results even for small statistical volumes. The

higher the statistical mass, however, algorithms based on Assumption 5.2 increas-

ingly benefit from their structural advantage and finally outperform the simple ones.

Of course, this only applies if Assumption 5.2 is actually realistic! But Table 5.4

seems to confirm that and that's another good news. Finally, we emphasize that

Assumption 5.1 used in conjunction with simple update schemas like Algorithm

4.1, though quite simple, exhibits a good overall prediction rate that is really hard to

top. We will see this, for example, in Sect. 8.4.4 where we will continue this

discussion.

■

So first experience supports Assumption 5.2. The presented results are also

confirmed by similar tests on other data sets. Nevertheless, it is too early to speak

about a full improvement. Yet our methodology may be subject to another critical

objection: despite all random variations by the softmax policy, our recommenda-

tions are still the result of previous analyses and thus not fully statistically inde-

pendent. This raises the question whether the presented results are indeed based on

the effect of recommendations rather than their analytical selection.

Luckily the effect can be studied by comparison with the control group.

We remember that in the control group no recommendations of the RE algorithm

are displayed. In the transaction log files described in Sect. 4.4 (column

itemsAction ), the RDE also stores the products that it would recommend if it

would be allowed to do that. Since recommendation and control sessions are always

mixed in time, these recommendations represent that current one of the RE algo-

rithm. By treating these would-like recommendations in the same way as “real”

recommendations, we can repeat all tests and compare them for both recommen-

dation and control group.

Example 5.3 We again used data from a real-world web shop; this time it was a

fashion shop. We have analyzed data from two days with (in total) about 12,500

different products and 1.6 Mio. transactions. The procedure was exactly like that of

Example 5.2 but now separately for the recommendation and the control group.

Although the recommendations have been less explorative than that of Example 5.2,

we obtained similar results.

Figure 5.4 shows the quotient of conditional and unconditional probabilities for

both groups in the same setting as Fig. 5.3 .

Not surprising the control group coefficient rs C _ ctrl is about 1, whereas the one of

the recommendation group is higher, between 2 and 3. As in Fig. 5.3 it clearly

increases at k min ¼ 20 but then only slightly. The recommendation coefficient rs C

is about twice as high as that of Example 5.2 - this also corresponds to reality

(recommendations in the fashion shop are more accepted) and confirmed by click

statistics.

■

Realtime Data Mining

Search WWH ::

Custom Search

Home