Databases Reference
In-Depth Information
It looks like there's
kind of
a linear relationship here, and it makes sense;
the more new friends you have, the more time you might spend on the
site. But how can you figure out how to describe that relationship?
Let's also point out that there is no perfectly
deterministic
relationship
between number of new friends and time spent on the site, but it makes
sense that there is an
association
between these two variables.
Start by writing something down
There are two things you want to capture in the model. The first is the
trend
and the second is the
variation
. We'll start first with the trend.
First, let's start by assuming there actually
is
a relationship and that it's
linear. It's the best you can do at this point.
There are many lines that look more or less like they might work, as
shown in
Figure 3-3
.
Figure 3-3. Which line is the best fit?