Database Reference
In-Depth Information
Theorem 8.1 states that the truncated SVD provides the best matrix factorization
with respect to quality of approximation. In general, however, this goes along with a
loss of stochasticity of the matrix of transition probabilities P
:¼ P k .
Hence, we require, on one hand, that the factorized transition probabilities
satisfy
p ss 0 1, 8s , s 0
0 e
S
:
ð 8
:
26 Þ
On the other hand, the truncated SVD also violates the row sum condition (3.2).
Thus, we furthermore require
X
s 0 p ss 0 ¼ 1 8s
S
:
ð 8
:
27 Þ
It is possible to impose the conditions ( 8.26 ) and ( 8.27 ) by subsequent normal-
ization of the factorization ( 8.14 ). Although this may look somewhat artificial at the
first glance, one must take into account that violation of stochasticity by a rank- k
approximation has no deeply rooted causes with respect to content. And the
truncated SVD is still the best approximation.
An alternative remedy consists in using specific nonnegative matrix factoriza-
tions, which we shall consider in the next section.
8.4.3 Nonnegative Matrix Factorizations
In a nonnegative matrix factorization (NMF), we consider the problem ( 8.1 ) for a
nonnegative matrix A and require that the factor matrices X, Y, as well, be nonneg-
ative. With regard to the Frobenius norm, we obtain the following problem
statement:
2
F , X , Y 0
min
k
A XY
k
:
ð 8
:
28 Þ
,
mk
kn
X ∈R
Y ∈R
As compared with the SVD-based approach, this gives rise to certain advantages
for nonnegative A . Most importantly, the matrices X,Y have a clear interpretation -
as opposed to the SVD. If we consider the columns of the left factor X as basis
vectors, i.e.,
X ¼ x 1
½
j
...
jx k
,
then the columns of A ,
A ¼ a 1
½
j
...
ja m
,
are approximately expressed in terms of the basis B X : ¼ { x 1 ,
...
, x k }:
a j x 1 y 1 j þ ...þ x k y kj ,1 j n
:
Search WWH ::




Custom Search