scatter plot and karl Pearson's correlation cofficient
correlation analysis is used as a statistical tool to ascertain the association between variables.it may be noted that correlation analysis is one of the most widely used statistical techniques adopted by applied stasticans
Summary
correlation analysis is used as a statistical tool to ascertain the association between variables.it may be noted that correlation analysis is one of the most widely used statistical techniques adopted by applied stasticans
Things to Remember
- understand the importance as also the limitations of correlation analysys.
- know when pearson's coefficient of correlation can be used appropriately.
- recognize when a scatter diagram suggest a relationship between two variables.
MCQs
No MCQs found.
Subjective Questions
No subjective questions found.
Videos
No videos found.

scatter plot and karl Pearson's correlation cofficient
scatter plot
scatter plots are graphs with plots between two variable.one variable values are kept in X-axis and another variable's values are kept in Y-axis.the diagram formed , by plotting these pairs of values of X and Y, is known as scatter diagram.if the plotted dots show some trend of upward or downward, then the two considering variables are said to be correlated.if the dots are close together and follows some trend of either increasing or decreasing,then there is a strong relation between them.

13.png)
correaltion
correlation is a statistical device designed to measure the degree of association between two or more variables.
types of correlation
- positive correlation
- negative correlation
- linear correlation
- non-linear correlation
- partial correlation
- multiple correlationn
measures of correlation
- graphic method or scatter diagram method
- karl Pearson's correlation coefficient
- Sparkman's correlation coefficient
graphic method or scatter diagram method
it is the sipmlest method of studying correlation between two variables.in this method,one variable's values are kept in X-axis and another variable's values are kept in Y-axis.the diagram formed,by plotting these pairs of values of X and Y , is known as scatter diagram.if the plotted dots show some trend of upward or downward, then the two considering variables are said to be correlated.if the dots are close together and follows some trend of either increasing or decreasing, then there is a strong relation between them.

karl Pearson's correlation coefficient
karl Pearson's correlation cofficient measures a degree of association between two variables only to extent to which it is linear. let X and Y are two variables, karl Pearson's correlation coefficient between X and Y is generally denoted by rxy or r(X,Y) or simply r only. it is also called product moment correlation coefficient or simple correlation cofficient or simply a correlation.it is defined as follows:
r=\{\frac{COV(X,Y)}{\{sqrt{var(X) \}{sqrt{var(Y)}\}
where, COV(X,Y) is read as covarience between X and Y. This measures the simultraneous changes between two variables.
and COV(X,Y) = 1/n\{sum{(X - \{overline{X})\} }\} (Y -\{overline{Y}\})
=1/n\{sum{XY - \{overline{X}\} \{overline{Y}\}}\}
properties of karl Pearson's correlation coefficient
- correlation cofficient (r) lies between -1 to +1.
- correlation cofficient (r) is the geomatric mean between two regression coefficients i.e. r= + -\{sqrt{byx.bxy}\}
where, byx = regression coefficient of regression line of Y on X
bxy =regression coefficient of regression line of xon X
- correlation cofficient is independent of change of origin as well as scale.
- correlation cofficient is a relative stastical measures.
- two independent variables are uncorrelated but the converse may not be true i.e. uncorrelated variables may not be independent.
example:
calculate the coefficient of correlation for the following data:
X | 2 | 3 | 4 | 5 | 6 |
Y | 7 | 9 | 10 | 14 | 15 |
solution:
X | Y | x=X -\{\overline{X}\} (\{\overline{X}\}= 4) | x2 | y= Y -\{\overline{y}\} (\{\overline{Y}\}= 11 ) | y2 | xy |
2 | 7 | -2 | 4 | -4 | 16 | 8 |
3 | 9 | -1 | 1 | -2 | 4 | 2 |
4 | 10 | 0 | 0 | -1 | 1 | 0 |
5 | 14 | 1 | 1 | 3 | 9 | 3 |
6 | 15 | 2 | 4 | 4 | 16 | 8 |
\{\sum{X}\}=20 | \{\sum{Y}\}=55 | \{\sum{x}\}=0 | \{\sum{x2}\}=10 | \{\sum{y}\}=0 | \{\sum{y2}\}=46 | \{\sum{xy}\}=21 |
we have,\{\overline{X}\} =\{\frac{\{\sum{X}}{n}\} =\{\frac{20}{5}\} = 4
\{\overline{Y}\} =\{\frac{\{\sum{Y}}{n}\} =\{\frac{55}{5}\= 11
now correlation coefficient, r =\{\frac{\{sum{xy}\} }{ \{sqrt{x2}\} \{sqrt{y2}\} }\} = \{\frac{21}{\{sqrt{10}\} \{sqrt{46}\} }\} =0.98
therefore, r= 0.98. this shows that there is almost perfect positive correlation between X and Y.
(References)
Chaudary, A.K. (2061).Business statistics. kathmandu:Bhundipuran Prakshan
Dhakal Bashanta (2014).Business Statistics,Buddha academic publisher
Lesson
Simple Linear Correlation
Subject
Business Statistics
Grade
Bachelor of Business Administration
Recent Notes
No recent notes.
Related Notes
No related notes.