Information Technology Reference
In-Depth Information
Table 1.
Description of the training and test datasets in free groups
F
10
,
F
15
and
F
20
.
Dataset
size %min%non-min(min,avg,max) word lengths
D
20000
49.1
50.9
(3,558.2,1306)
S
e
5000
48.9
51.1
(3,559,1292)
S
10
5000
49.1
50.9
(3,1016.5,13381)
S
R
5000
98.3
1.7
(3,501.2,999)
S
P
3850
0.0
100.0
(3,194.7,8719)
a)
F
3
;
Dataset
size %min%non-min(min,avg,max) word lengths
D
20000
48.5
51.5
(5,581.3,1388)
S
e
5000
49.2
50.8
(8,583.7,1382)
S
10
5000
48.0
52.0
(7,1693.22,28278)
S
R
5000
97.2
2.8
(6,504.2,999)
S
P
2900
0.0
100.0
(5,656.9,22430)
c)
F
5
;
Dataset size %min%non-min(min,avg,max) word lengths
D
9660
48.9
51.1
(26,617.4,1461)
S
e
4811
49.2
50.8
(26,619.7,1443)
S
10
4837
49.5
50.5
(29,2589.8,65274)
S
R
4867
96.5
3.5
(18,512.7,999)
S
P
165
0.0
100.0
(12,150.8,1459)
a)
F
10
;
Dataset size %min%non-min(min,avg,max) word lengths
D
9357
49.5
50.5
(41,635.3,1472)
S
e
4685
49.2
50.8
(40,642.5,1462)
S
10
4722
49.7
50.3
(46,3056.6,53422)
S
R
4755
95.3
4.7
(26,523.8,999)
S
P
870
0.0
100.0
(28,1109.3,4981)
b)
F
15
;
Dataset size %min%non-min(min,avg,max) word lengths
D
9144
49.6
50.4
(47,658.3,1488)
S
e
4576
49.3
50.7
(48,659.8,1484)
S
10
4597
49.1
50.9
(64,3351.4,68316)
S
R
4643
94.0
6.0
(48,534.9,999)
S
P
182
0.0
100.0
(66,945.1,4762)
c)
F
20
;
4.2 Accuracy Measure
To evaluate the performance of the classification system
PR
MIN
we define an
accuracy measure
A
.
Search WWH ::
Custom Search