Database Reference
In-Depth Information
Extracting features from the bike sharing dataset
To illustrate the concepts in this chapter, we will be using the bike sharing dataset. This
dataset contains hourly records of the number of bicycle rentals in the capital bike sharing
system. It also contains variables related to date and time, weather, and seasonal and holi-
day information.
Note
The dataset is available at http://archive.ics.uci.edu/ml/datasets/Bike+Sharing+Dataset .
Click on the Data Folder link and then download the Bike-Sharing-Dataset.zip
file.
The bike sharing data was enriched with weather and seasonal data by Hadi Fanaee-T at
the University of Porto and used in the following paper:
Fanaee-T, Hadi and Gama Joao, Event labeling combining ensemble detectors and back-
ground knowledge, Progress in Artificial Intelligence , pp. 1-15, Springer Berlin Heidel-
berg, 2013.
The paper is available at http://link.springer.com/article/10.1007%2Fs13748-013-0040-3 .
Once you have downloaded the Bike-Sharing-Dataset.zip file, unzip it. This will
create a directory called Bike-Sharing-Dataset , which contains the day.csv ,
hour.csv , and the Readme.txt files.
The Readme.txt file contains information on the dataset, including the variable names
and descriptions. Take a look at the file, and you will see that we have the following vari-
ables available:
instant : This is the record ID
dteday : This is the raw date
season : This is different seasons such as spring, summer, winter, and fall
yr : This is the year (2011 or 2012)
mnth : This is the month of the year
hr : This is the hour of the day
holiday : This is whether the day was a holiday or not
weekday : This is the day of the week
Search WWH ::




Custom Search