Recommendations as a Game: Reinforcement Learning for Recommendation Engines - Realtime Data Mining

Database Reference

In-Depth Information

Chapter 4

Recommendations as a Game: Reinforcement

Learning for Recommendation Engines

Abstract We describe the application of reinforcement learning to recommendation

engines. At this, we introduce RE-specific empirical assumptions to reduce

the complexity of RL in order to make it applicable to real-live recommendation

problems. Especially, we provide a new approach for estimating transition probabil-

ities of multiple recommendations based on that of single recommendations.

The estimation of transition probabilities for single recommendations is left as an

open problem that is covered in Chap. 5 . Finally, we introduce a simple framework

for testing online recommendations.

An effective approach to using reinforcement learning for recommendation engines

is described below. In the simplest case, the product detail views form the states, the

recommended products the actions, and the rewards the clicks or purchases of

the products. The goal consists (depending on the chosen reward) in maximizing

the activity (clicks) or the success (sales).

Figure 4.1 illustrates the use of RL for product recommendations in a web shop

and shows the interaction between the recommendation engine and the user. Here,

the optimal proven recommendations are marked with “*.”

In the first and third steps, the recommendation engine is following the

proven recommendations (exploitation); in the second step, a new recommendation

is tried out (exploration). The user ignores the first recommendation, but accepts

the second and third. The feedback arrows symbolize the updating of the

recommendations.

Although this modeling may appear self-evident, it nevertheless represents a

highly complex task. Firstly, web shops generally offer very many products, as

a rule between a few thousand up to a few million (for instance, at a bookshop).

Many of these products have virtually no transaction history, i.e., they have scarcely

ever been bought; indeed, some have never even been clicked on. Furthermore,

Search WWH ::

Custom Search

Home