recoded) value. Upon performing the transformation on a column,
all matching original values are replaced with the recoded values.
reference implementation A software implementation of a JSR
specification that validates the interface for practical implementa-
tion and usage. It must meet the tests defined in the TCK. See also
technology compatibility kit (TCK) .
regression A supervised data mining technique that predicts
continuous targets. One of the JDM mining functions.
residual(s) In regression , the difference between the actual target
value and the predicted value. In time series , residual is what remains
after accounting for trend, cyclic variations, and interventions.
return on investment A measure used to make capital investment
decisions. One possible calculation involves (increased revenue
ROC See Receiver Operating Characteristics.
ROI See Return On Investment.
rule An expression of the general form if X , then Y . An output of
certain models (e.g., association rules models or decision tree mod-
els). The X may be a compound predicate.
sample ( n ) A representative set of cases taken from a larger data
population. ( v ) To extract a set of cases from a larger population,
typically at random to minimize bias in the dataset.
seasonality In time series, this is a periodic effect due to the recur-
rence of certain drivers of the time series, for example strong sales
around holidays. See also time series and cycle .
session The duration of an open connection to the DME.
settings The parameters used to control mining operations. See
build settings, apply settings, algorithm settings .
signature attribute A type of attribute used to define one of the
inputs to a model for test and apply. See model signature .
single-record case A representation of physical data that uses a
single record to store each case. Each column contains data to be
mined that can correspond to a logical attribute.
SOA Service Oriented Architecture.
statistics The science and practice of collecting, organizing, and
analyzing data. In JDM, statistics refers to the type of summary data
made available on individual attributes (univariate) and analysis of
multiple attributes (multivariate). Univariate statistics include values