Linear Regression From Scratch

Linear Regression with statsmodels

Data Cleaning

As seen above, there is no multicollinearity among independent/dependent variables.

MODELLING

Evaluate the model

Linear Regression with scikit-learn

Visualizing Linear Regression

Linear regression is a common machine learning technique that predicts a real-valued output using a weighted linear combination of one or more input values.

The "learning" part of linear regression is to figure out a set of weights w1, w2, w3, ... w_n, b that leads to good predictions. This is done by looking at lots of examples one by one (or in batches) and adjusting the weights slightly each time to make better predictions, using an optimization technique called Gradient Descent.

Lets create some sample data with one feature "x" and one dependent variable "y". We'll assume that "y" is a linear function of "x", with some noise added to account for features we haven't considered here. Here's how we generate the data points, or samples:

Linear Regression

Linear Regression