To analyse given data using coeffificient of correlation and regression line.
Python
Correlation describes the strength of an association between two variables, and is completely symmetrical, the correlation between A and B is the same as the correlation between B and A. However, if the two variables are related it means that when one changes by a certain amount the other changes on an average by a certain amount.
If y represents the dependent variable and x the independent variable, this relationship is described as the regression of y on x. The relationship can be represented by a simple equation called the regression equation. The regression equation representing how much y changes with any given change of x can be used to construct a regression line on a scatter diagram, and in the simplest case this is assumed to be a straight line.
import math
import numpy as np
import matplotlib.pyplot as plt
x = [25,28,35,32,31,36,29,38,34,32]
y = [43,46,49,41,36,32,31,30,33,39]
sx=sy=sxy=sx2=sy2=0
for i in range(0,10):
sx +=x[i]
sy +=y[i]
sxy+=x[i]*y[i]
sx2+=x[i]**2
sy2+=y[i]**2
n=10
r = (n*sxy-sx*sy)/(math.sqrt(n*sx2-sx*2)*math.sqrt(n*sy2-sy*2))
print("The Correlation Coefficient is %.3f"%r)
byx=(n*sxy-sx*sy)/(n*sx2-sx**2)
xmean=sx/n
ymean=sy/n
print("the reg line Y on x : Y=%0.3f %0.3f(X-%0.3f)"%(ymean,byx,xmean))
plt.scatter(x,y)
def Reg(x):
return ymean+byx*(x-xmean)
x=np.linspace(20,40,51)
y1=Reg(x)
plt.plot(x,y1,'r')
plt.xlabel("x-data")
plt.ylabel("y-label")
plt.legend(['REGRESSION LINE', 'DATA POINTS'])
plt.show()
Thus the given data using coeffificient of correlation and regression line is implemented using python program.