The Big Picture: Toward a Synthesis of RL and Adaptive Tensor Factorization - Realtime Data Mining

Database Reference

In-Depth Information

Chapter 10

The Big Picture: Toward a Synthesis of RL

and Adaptive Tensor Factorization

Abstract We explore the subject of uniting the control-theoretic with the

factorization-based approach to recommendation, arguing that tensor factorization

may be employed to vanquish combinatorial complexity impediments related to

more sophisticated MDP models that take a history of previous states rather than

one single state into account. Specifically, we introduce a tensor representation

of transition probabilities of Markov-k-processes and devise a Tucker-based

approximation architecture that relies crucially on the notion of an aggregation

basis described in Chap. 6 . As our method requires a partitioning of the set of state

transition histories, we are left with the challenge of how to determine a suitable

partitioning, for which we propose a genetic algorithm.

In this research-oriented chapter, we shall study a refinement of the concept of a

Markov decision process which enables a recommendation engine to incorporate

sequences of previously visited products rather than making decision exclusively

upon the current state. As foreshadowed in Chap. 8 , more sophisticated models of

this kind entail some complexity issues. Therefore, so as to vanquish the latter,

we shall introduce a tensor factorization-based approximation framework. The

reasoning provided in this chapter is thus a step toward a unification of classical

(factorization based) data mining on one hand and the novel control-theoretic

framework on the other hand. We should stress, however, that the approach

presented in the following is currently still in its infancy and a subject of ongoing

research. Hence, a major part of the subsequent elaborations are still based upon

speculation rather than scientific rigor.

Search WWH ::

Custom Search

Home