Tag: Data Science Project
-
Clustering Mixed Data

Code and data used can be found here: Repository Objective To identify clusters of individuals that have suffered from heart failure. The data consists of both numerical and categorical data, to cluster such data we can use K-prototypes clustering. K-prototypes Clustering K-prototypes work similarly to K-means clustering but works for both numerical and categorical data.…
-
Will it Rain Tomorrow?

Code and data used can be found here: Repository In this episode we will be expanding on Logistic Regression in Python, implementing much more data pre-processing steps on a larger data set that contains both numerical and categorical data (words). Objective Construct a logistic regression model to predict if it will rain tomorrow in a…
-
Predicting Weather Data

Code and data used can be found here: Repository This episode combines knowledge from all previous episodes to build, evaluate and improve a ridge regression model that makes predictions for weather data in Hungary, Szeged. Objective Construct a regression model that makes reasonable predictions for Humidity given the follow data: Link Our model should take…