reference implementation
A software implementation of a JSR

specification that validates the interface for practical implementa-

tion and usage. It must meet the tests defined in the TCK. See also

technology compatibility kit (TCK)
.

regression
A supervised data mining technique that predicts

continuous targets. One of the JDM mining functions.

residual(s)
In
regression
, the difference between the actual target

value and the predicted value. In
time series
, residual is what remains

after accounting for trend, cyclic variations, and interventions.

return on investment
A measure used to make capital investment

decisions. One possible calculation involves (increased revenue

costs)/investment.

ROC
See
Receiver Operating Characteristics.

ROI
See
Return On Investment.

rule
An expression of the general form
if X
,
then Y
. An output of

certain models (e.g., association rules models or decision tree mod-

els). The
X
may be a compound predicate.

sample
(
n
) A representative set of cases taken from a larger data

population. (
v
) To extract a set of cases from a larger population,

typically at random to minimize bias in the dataset.

seasonality
In time series, this is a periodic effect due to the recur-

rence of certain drivers of the time series, for example strong sales

around holidays. See also
time series
and
cycle
.

session
The duration of an open connection to the DME.

settings
The parameters used to control mining operations. See

build settings, apply settings, algorithm settings
.

signature attribute
A type of attribute used to define one of the

inputs to a model for test and apply. See
model signature
.

single-record case
A representation of physical data that uses a

single record to store each case. Each column contains data to be

mined that can correspond to a logical attribute.

SOA
Service Oriented Architecture.

statistics
The science and practice of collecting, organizing, and

analyzing data. In JDM, statistics refers to the type of summary data

made available on individual attributes (univariate) and analysis of

multiple attributes (multivariate). Univariate statistics include values

