Database Reference
In-Depth Information
keyboard. Delete the Filter Examples operator at this time as well. Note that your spline that was
connected to the res port is also deleted. This is not a problem, you can reconnect the exa port
from the Replace Missing Values operator to the res port, or you will find that the spline will
reappear when you complete the steps under Handling Inconsistent Data.
HANDLING INCONSISTENT DATA
Inconsistent data is different from missing data. Inconsistent data occurs when a value does
exist , however that value is not valid or meaningful. Refer back to Figure 3-25, a close up version
of that image is shown here as Figure 3-29.
?!?!
Figure 3-29. Inconsisten data in the Twitter attribute.
What is that 99 doing there? It seems that the only two valid values for the Twitter attribute
should be 'Y' and 'N'. This is a value that is inconsistent and is therefore meaningless. As data
miners, we can decide if we want to filter this observation out, as we did with the missing
Online_Shopping records, or, we could use an operator designed to allow us to replace certain
values with others.
1) Return to design perspective if you are not already there. Ensure that you have deleted
your sampling and filter operators from your stream, so that your window looks like Figure
3-30.
Figure 3-30. Returning to a full data set in RapidMiner.
 
Search WWH ::




Custom Search