++++Data Science
May 2026×Notebook lesson
Notebook converted from Jupyter for blog publishing.
04-Linear-Regression-Project-DataSet
Driptanil DattaSoftware Developer
NOTE: BEFORE WE CAN CONTINUE WITH THIS PROJECT WE NEED TO PERFORM FEATURE ENGINEERING. THE NEXT SECTION OF THE COURSE WILL FOCUS ON TEACHING YOU FEATURE ENGINEERING ON THIS DATASET.
JUMP FORWARD TO THE FEATURE ENGINEERING SECTION
The dataset we will be working with later on in the next section:
- Ames Iowa Data Set: http://jse.amstat.org/v19n3/decock.pdf (opens in a new tab)
import pandas as pddf = pd.read_csv("..\DATA\Ames_Housing_Data.csv")df.head()HTML
MORE
PID
MS SubClass
MS Zoning
Lot Frontage
Lot Areadf.info()STDOUT
MORE
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 2930 entries, 0 to 2929
Data columns (total 81 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----