Section 16 Data Overview
I am interested in applying multivariate analysis to understand the factors affecting property prices. To do this, I sourced a property data-set from the website Kaggle (2015). The original data-set contains sale prices and property details for 21613 residential property transactions between May 2014 and May 2015 in King County USA (includes Seattle).
In the Data Source sub-section I discuss the Original Data set and conduct an in-depth exploratory analysis using a range of visualisations. Detecting Outliers and Errors is discussed in the Outliers and Errors sub-section. Multivariate techniques such as a Principal Component Decomposition and a Multivariate Clustering Technique are applied.
References
Kaggle. 2015. “This Dataset Contains House Sale Prices for King County.” https://www.kaggle.com/harlfoxem/housesalesprediction.