4. Correlation Coefficient, Properties, Types, Important Formulas for Correlation Coefficient

Correlation Coefficient, Properties of the Correlation Coefficient, Important Formulas for Correlation Coefficient

In this post, we are going to discuss correlation coefficient, types, properties, importance and important formulas for Correlation Coefficient and also cover correlation coefficient with theoretical perspective.

Correlation Coefficient

The correlation coefficient is a statistical measure that calculates or quantifies the relationship or strength and direction of the linear relationship between two variables. In other words, it indicates how closely the values of two variables move together. Correlation Coefficient always denoted by the symbol “r” and it’s answer ranges between -1 and 1. It’s answer has five possibilities:

Types of Correlation

1. Perfectly Positive

1: Perfectly Positive: A correlation coefficient of +1 indicates a perfect positive linear relationship which means as one variable increases, the other variable also increases proportionally or vice versa.

Perfectly Positive Correlation

2. Positive

2: Positive: A correlation coefficient ranges between 0 and +1 indicates a positive relationship which means as one variable increases, the other also increases but not proportionally or vice versa.

Positive Correlation

3. Perfectly Negative

3: Perfectly Negative: A correlation coefficient of -1 indicates a perfect negative linear relationship which means as one variable increases, the other variable decreases proportionally or vice versa.

Perfectly Negative Correlation

4. Negative

4: Negative: A correlation coefficient ranges between 0 and -1 indicates a negative relationship which means as one variable increases, the other decreases but not proportionally or vice versa.

Negative Correlation

5. Zero

5: Zero: A correlation coefficient close to 0 or exactly 0 indicates a weak or no linear relationship between the variables.

Properties of the Correlation Coefficient

1. The correlation coefficient does not change the measurement scale. That is, if the height is expressed in meters or feet, the correlation coefficient does not change.

2. The sign of the correlation coefficient is the same as the covariance.

3. The linear correlation coefficient is a real number between −1 and 1. −1 ≤ r ≤ 1

4. The Correlation Coefficient -1 means a strong perfectly negative relation.

5. The Correlation Coefficient +1 means a perfectly positive relation.

6. The Correlation Coefficient 0<r<1 means a positive relation.

7. The Correlation Coefficient -1<r<0 means a negative relation.

8. The Correlation Coefficient 0 means no relation.

9. The Correlation Coefficient is symmetric, which means the Correlation between X and Y equals to Correlation Coefficient between Y and X.

10. Non-linearity property of the correlation coefficient is that it tells best about linear relations. In non-linear relations, it is inaccurate in prediction.

11. Causation property of the Correlation Coefficient is that it only quantifies the relationships, it does not tell the cause of the relationship.

12. It is geometric mean of two regression coefficients

    \[ r = \sqrt{(bxy) \times (byx)}\  \]

Importance

Correlation coefficient is widely used in data analysis, research, business predictions such as investments and many other fields you name it. One important reality must be considered that it does not mean that one variable cause changes to other variable, there may be other hidden factors or variables responsible for it so we can say that it does not imply causation.

Important Formulas for Correlation Coefficient

Formula 1

    \[  r_{xy} = \frac{\sum(X - \overline{X})(Y - \overline{Y})}{n \times S_{x}{\times S}_{y}}\  \]

Where

    \[ S_{x} \]

is standard deviation of X variable and

    \[ S_{y} \]

is standard deviation of Y variable.

    \[ S_{x} = \sqrt{\frac{\sum\left( X - \overline{X} \right)^{2}}{n}}\ \]

&

    \[\ S_{y} = \sqrt{\frac{\sum\left( Y - \overline{Y} \right)^{2}}{n}} \]

Formula 2

    \[ r_{xy} = \frac{\sum(X - \overline{X})(Y - \overline{Y})}{\sqrt{\sum\left( X - \overline{X} \right)^{2} \times \sum\left( Y - \overline{Y} \right)^{2}}} \]

Formula 3

    \[ r_{xy} = \frac{\sum XY - \frac{(\sum X)(\sum Y)}{n}}{\sqrt{\left\lbrack \sum X^{2} - \frac{(\sum X)^{2}}{n} \right\rbrack\left\lbrack \sum Y^{2} - \frac{(\sum Y)^{2}}{n} \right\rbrack}} \]

Formula 4

    \[ r_{xy} = \frac{n\sum XY - (\sum X)(\sum Y)}{\sqrt{\left( n\sum X^{2} - (\sum X)^{2} \right)\left( n\sum Y^{2} - (\sum Y)^{2} \right)}} \]

Formula 5

    \[ r_{xy} = \frac{\sum XY - n\overline{X}\overline{Y}}{\sqrt{\left( \sum X^{2} - {n\left( \overline{X} \right)}^{2} \right)\left( \sum Y^{2} - {n\left( \overline{Y} \right)}^{2} \right)}} \]

Formula 6

    \[ r_{xy} = \frac{\frac{\sum XY}{n} - \overline{X}\overline{Y}}{S_{x}{\times S}_{y}}\ Where\ \overline{X}\ \&\ \overline{Y}\ are\ mean\ of\ X\ \&\ Y\ values\  \]

    \[ Where\;S_x\;is\;Standard\;Deviation\;of\;X\;Variable \]

    \[ Where\;S_y\;is\;Standard\;Deviation\;of\;Y\;Variable \]

Their Formulas are Given Below:

    \[ S_{x} = \sqrt{\left\lbrack \frac{\sum X^{2}}{n} - \left( \overline{X} \right)^{2} \right\rbrack}\ \&\ S_{y} = \ \sqrt{\left\lbrack \frac{\sum Y^{2}}{n} - \left( \overline{Y} \right)^{2} \right\rbrack}\ \&\ \overline{X} = \frac{\sum X}{n},\ \overline{Y} = \frac{\sum Y}{n} \]

Formula 7

    \[ {r_{xy} = r}_{DxDy} =\]

    \[ \frac{\sum D_{x}D_{y} - \frac{(\sum D_{x})(\sum D_{y})}{n}}{\sqrt{\left\lbrack \sum{D_{x}}^{2} - \frac{\left( \sum D_{x} \right)^{2}}{n} \right\rbrack\left\lbrack \sum{D_{y}}^{2} - \frac{\left( \sum D_{y} \right)^{2}}{n} \right\rbrack}}\ \]

Formula 8

    \[ r_{xy} = \frac{n\sum D_{x}D_{y} - (\sum D_{x})(\sum D_{y})}{\sqrt{\left( n\sum{D_{x}}^{2} - \left( \sum D_{x} \right)^{2} \right)\left( n\sum{D_{y}}^{2} - \left( \sum D_{y} \right)^{2} \right)}}\ \]

    \[  Where\;D_x=X-A\;\&\;D_y=Y-B \]

Formula 9

    \[ {r_{xy} = r}_{uv} = \frac{\sum UV - \frac{(\sum U)(\sum V)}{n}}{\sqrt{\left\lbrack \sum U^{2} - \frac{(\sum U)^{2}}{n} \right\rbrack\left\lbrack \sum V^{2} - \frac{(\sum V)^{2}}{n} \right\rbrack}}\]

Formula 10

    \[ r_{xy} = \frac{n\sum UV - (\sum U)(\sum V)}{\sqrt{\left( n\sum U^{2} - (\sum U)^{2} \right)\left( n\sum V^{2} - (\sum V)^{2} \right)}}\ where\ U = \frac{(X - A)}{h}\ \&\ V = \frac{(Y - B)}{k} \]

Formula 11

    \[ r_{xy} = \pm \sqrt{\left( b_{yx} \right)\left( b_{xy} \right)}\ \]

    \[Where\;b_{yx}\;is\;Regression\;Coefficient\;of\;Y\;on\;X\;\&\]

    \[ Where\;b_{xy}\;is\;Regression\;Coefficient\;of\;X\;on\;Y \]

Introduction to Statistics Basic Important Concepts

Measures of Central Tendency, Arithmetic Mean, Median, Mode, Harmonic, Geometric Mean

Depreciation, Reasons of Depreciation, Methods of Depreciation, Straight Line/ Original Cost/Fixed Instalment Method, Diminishing/Declining/Reducing Balance Method.

Business Mathematics and Statistics, Solved Paper 2007, Punjab University, BCOM,ADCI

Business Mathematics and Statistics, Solved Paper 2008, Punjab University, BCOM,ADCI

Business Statistics and Mathematics, Solved Paper 2009, Punjab University, BCOM,ADCI

Business Statistics and Mathematics, Solved Paper 2010, Punjab University, BCOM,ADCI

Business Statistics and Mathematics Solved Paper 2011, Punjab University, BCOM, ADC I

1 thought on “4. Correlation Coefficient, Properties, Types, Important Formulas for Correlation Coefficient”

Leave a Comment

Your email address will not be published. Required fields are marked *