Information Technology Reference
In-Depth Information
Ta b l e 1 .
Details (folds, repetitions) of an exemplary experiment no. 1
no. of
expe-
riment repetition fold
R
emp
(
ω
I
)
is
ω
I
=
ω
I
?
R
emp
(
ω
I
)
1
1
1
0
.
397
false
0
.
444
1
1
2
0
.
418
true
0
.
369
1
1
3
0
.
400
false
0
.
468
C
=
0
.
417
1
2
1
0
.
359
true
0
.
369
1
2
2
0
.
374
true
0
.
339
0
.
370
0
.
348
C
=
0
.
352
1
2
3
true
.
.
.
.
.
.
1
10
1
0
.
403
true
0
.
384
1
10
2
0
.
395
true
0
.
399
1
10
3
0
.
394
true
0
.
399
C
=
0
.
394
We show the results in two tables 1 and 2. The first one gives an insight on details of
a
single
exemplary experiment: results of its particular folds and repetitions. The second
one shows collective results, where each row encapsulates 10 repetitions
12
.
To comment on the results we first remark that before each single experiment (1-12)
the whole data set was drawn once from
p
(
z
)
and remained fixed throughout repetitions.
However, in the repetitions due to the non-stratified cross-validation we parted the data
set (via permutations) into different training and testing subsets. That is why in the table
R
emp
(
ω
I
)
and
V
are constant per experiment, whereas the cross-validation varies within
some observed range. In the table 2 we also present the interval
[
V
−
ε
L
,
V
+
ε
U
]
which
is implied by the theorems.
Please note that for
all
experiments the observed range for
C
was contained inside
[
V
ε
U
]
— an empirical confirmation of theoretical results. Although the bounds
are true with probability at least 1
−
ε
L
,
V
+
−
α
(
η
,
n
)
, in this particular experiment they held with
frequency one.
In particular one can note in the table that the upper bounds
V
+
ε
U
are closer to
actual
C
outcomes, while lower bounds
V
+
ε
L
are more loose — a fact we already
indicated in theoretical sections. Only in the case of experiment no. 9 the lower bound
we obtained was trivial. In the results one can also observe the qualitative fact that both
intervals tighten with 1
/
√
I
approximately. Keep in mind that this result stops working
for the 'leave-one-out' cross-validation (or a close one) and we experimented on
n
=
3
and
n
=
5.
12
It was difficult to allow ourselves for more repetitions, say 100, due to large amount of results
and the time-consumption of each experiment. Yet, the observed ratio 1
.
0of
C
falling inside
bounds shows that 10 repetitions was sufficient.