What is Linear Regression?

In this topic we actually learn how do we make Machines learn from data and what particular Mathematics is used behind that and how to the particular algorithm is built.

Linear Regression is a part of Supervised Learning Technique which is first most basic part of Machine Learning. So, Linear Regression can be called as first most Machine Learning algorithm.

Linear Regression

Definition:

Linear Regression is a Supervised Learning Algorithm that learns from a set of training samples. It estimates the relationship between a dependent variable (target/label) and one or more independent variable (predictors).

Linear Regression

Regression Equation:

y = mx + c

Where,

Y is Output variable,

X is Input variable

m and c are the parameters

Types of Linear Regression:

There are three types of Linear Regression. They are:

Univariate Linear Regression
Multivariate Linear Regression
Polynomial Linear Regression

Linear Regression Type

So, clearly from the equation we can make out that the Regression equation depends on the values of m and c. Let’s see that in the below examples.

Linear Regression Equations

is the predicted value

When m=0 and c=40; Line is parallel to the X-axis.

When m=0.8 and c=0; Line passes across center.

When m=0.8 and c>0; Line is shifted above the x-axis for the value equal to c.

Error function:

e = – y

Where is Predicted value and

Y is Actual value.

Error depends on the values of m and c.

Linear Regression Error Functions

Our aim here is to build an algorithm which can minimize the error of the problem.

Cost function:

For that we use the cost function of Linear Regression. Which is,

Linear Regression Cost Functions

Here goal is to minimize the cost function J by changing m and c. This is called optimization and in order to perform this we need particular optimizers. The most important optimizer we use here in Linear Regression among all is Gradient Descent Algorithm.

Gradient Descent Algorithm

Where,

w is the parameter(m or c) and

lr is the Learning rate

What is Learning Rate?

Learning Rate is the value which the developer (person building the model) gives which decides how fast the algorithm learns from data.

Learning Rate should range between 0 to 1.

Simple Python code for Linear Regression:

# Importing Libraries
import numpy
import matplotlibas plt
Import pandas
# Importing data set
dataset=pandas.read_csv(‘salary_data.csv’) X=dataset.iloc[:,:-1].values Y=dataset.iloc[:,1].values
# Train Test Spilt
from sklearn.model_selectionimport train_test_split
xtrain,xtest,ytrain,ytest=
train_test_split(X,y,test_size=0.2,random_state=0)
# Linear Regression method
from sklearnimport linear_model
alg= linear_model.LinearRegression()
alg.fit(xtrain,ytrain)
# Predicting the test results
ypred=alg.predict(xtest)

General

What is Linear Regression?