banner



How To Find The Sample Correlation Coefficient

Correlation Coefficient Formula

In statistics, correlation is a way of establishing the relationship/association between two variables. In other words, the correlation coefficient formula helps in calculating the correlation coefficient which measures the dependency of one variable on the other variable. Correlation is measured numerically using the correlation coefficient. The correlation coefficient lies between -1 and 1. A negative correlation coefficient indicates that the human relationship between two variables is inverse. A positive correlation coefficient indicates that the value of ane variable depends on the other variable directly. A cypher-correlation coefficient indicates that in that location is no correlation between both variables. There are many types of correlation coefficients, among them, the Pearson Correlation Coefficient (PCC) is the almost common ane. Permit us explore how to calculate the correlation coefficient formula for a given population or sample beneath.

What Is the Correlation Coefficient Formula?

The correlation coefficient is a statistical concept. Information technology establishes a relation between predicted and actual values obtained at the end of a statistical experiment. The correlation coefficient formula helps to calculate the relationship betwixt two variables and thus the upshot so obtained explains the carefulness between the predicted and actual values.

Correlation Coefficient Formula

Pearson Correlation Coefficient Formula:

1. Sample Correlation Coefficient

The formula for pearson correlation coefficient for population of sizeN(written equallyρX, Y ) is given as:

\(\rho_{10, Y}=\frac{\operatorname{cov}(X, Y)}{\sigma_{10} \sigma_{Y}}=\dfrac{\sum_{i=1}^{north}\left(X_{i}-\bar{X}\correct)\left(Y_{i}-\bar{Y}\correct)}{\sqrt{{\Sigma}_{\Sigma_{i}=i}^{north}\left(X_{i}-\bar{Ten}\right)^{two}} \sqrt{{\sum}_{\Sigma_{i}=1}^{n}\left(Y_{i}-\bar{Y}\right)^{two}}}\)

where covis the covariance and (cov(Ten,Y)= \(\frac{\sum_{i=ane}^{Northward}\left(X_{i}-\bar{Ten}\right)\left(Y_{i}-\bar{Y}\right)}{Northward}\), σX  is standard divergence of XandσY  is standard divergence of Y.

Given X and Y are two random variables.

2. Population Correlation Coefficient

The formula for pearson correlation coefficient for sample of sizedue north (written asrxy ) is given as:

\(r_{x, y}=\frac{\sum_{i=1}^{north}\left(x_{i}-\bar{x}\correct)\left(y_{i}-\bar{y}\correct)}{\sqrt{\Sigma}_{\Sigma_{i}=1}^{northward}\left(x_{i}-\bar{x}\correct)^{2} \sqrt{\sum}_{\Sigma_{i}=1}^{n}\left(y_{i}-\bar{y}\right)^{2}}\)

wherenis the sample size,xi&yiare the ith sample points and&ȳare the sample ways for the random variables X and Y respectively.

Given X and Y are ii random variables.

3. Linear Correlation Coefficient

It uses pearson's correlation coefficient to determine the linear relationship between two variables. Its value lies between -1 and 1. It is given every bit:

\(r=\frac{n(\Sigma x y)-(\Sigma ten)(\Sigma y)}{\sqrt{\left[n \Sigma ten^{2}-(\Sigma x)^{2}\right]\left[n \Sigma y^{2}-(\Sigma y)^{two}\correct]}}\)

wherenis the sample size,xi&yiare the ithursday sample points and&ȳare the sample means for the random variables x and y respectively.

The sign of r indicates the strength of the linear relationship between the variables.

  • If r is almost 1, and then the ii variables have a potent linear relationship.
  • If r is near 0, and then the two variables have no linear relation.
  • If r is near -1, then the two variables have a weak (negative) linear human relationship.

Suspension downwardly tough concepts through uncomplicated visuals.

Math will no longer be a tough subject, especially when you understand the concepts through visualizations with Cuemath.

Book a Costless Trial Class

Let us see the applications of the correlation coefficient formula in the post-obit section.

Examples using Correlation Coefficient Formula

Case 1.Given the following population data. Find the Pearson correlation coefficient between x and y for this data. (Accept 1√7 equally 0.378)

x 600 800 thou
y 1200 thou 2000

Solution:

To simplify the calculation, we divide both x and y past 100.

x/100 y/100 \(x_{i}-\bar{x}\) \(y_{i}-\bar{y}\) \(\left(\mathrm{ten}_{\mathrm{i}}-\overline{\mathrm{x}}\right)^{ii}\) \(\left(\mathrm{y}_{\mathrm{i}}-\overline{\mathrm{y}}\right)^{2}\) \((x_i - \bar{x})(y_i - \bar{y})\)
6 12 -two -ii four 4 4
viii 10 0 -4 0 16 0
10 20 2 vi 4 36 12
\(\bar{x}=8\) \(\bar{y}=14\) \(\Sigma\left(x_{i}-\bar{ten}\correct)^{2}=8\) \(\Sigma\left(y_{i}-\bar{y}\correct)^{2}=56\) \(\Sigma(x_i-\bar{x})(y_i - \bar{y}) = 16\)

Using the correlation coefficient formula,

Pearson correlation coefficient for population = \(\frac{\Sigma\left(x_{i}-\bar{x}\right)\left(y_{i}-\bar{y}\right)}{\sqrt{\Sigma\left(x_{i}-\bar{10}\right)^{2} \sum\left(y_{i}-\bar{y}\right)^{2}}}\) = \(\frac{sixteen}{\sqrt{8} \sqrt{56}}\) = \(\frac{ii}{\sqrt{vii}}\) = 0.756

Respond: Pearson correlation coefficient = 0.756

Example 2. A survey was conducted in your city. Given is the following sample data containing a person'southward historic period and their corresponding income. Detect out whether the increment in age has an effect on income using the correlation coefficient formula. (Use i√181 as 0.074 and 1√2091 as 0.07)

Age 25 30 36 43
Income 30000 44000 52000 70000

Solution:

To simplify the calculation, we divide y by 1000.

Age (xi) Income/one thousand (yi/yard) \(x_{i}-\bar{10}\) \(y_{i}-\bar{y}\) \(\left(\mathrm{10}_{\mathrm{i}}-\overline{\mathrm{x}}\right)^{2}\) \(\left(\mathrm{y}_{\mathrm{i}}-\overline{\mathrm{y}}\right)^{two}\) \((x_i - \bar{10})(y_i - \bar{y})\)
25 30 -8.5 -19 72.25 361 161.5
30 44 -3.5 -5 12.25 25 17.5
36 52 2.5 3 half dozen.25 9 7.5
43 70 nine.five 21 90.25 441 199.5
\(\bar{x}=33.5\) \(\bar{y}=49\) \(\Sigma\left(x_{i}-\bar{ten}\right)^{2}=181\) \(\Sigma\left(y_{i}-\bar{y}\right)^{2}=836\) \(\Sigma(x_i-\bar{x})(y_i - \bar{y}) = 386\)

Pearson correlation coefficient for sample = \(\frac{\Sigma\left(x_{i}-\bar{10}\right)\left(y_{i}-\bar{y}\right)}{\sqrt{\Sigma\left(x_{i}-\bar{x}\correct)^{two} \sum\left(y_{i}-\bar{y}\right)^{2}}}\) = \(\frac{386}{\sqrt{181} \sqrt{836}}\) = \(\frac{193}{\sqrt{181} \sqrt{209}}\) = 0.99

Answer: Yes, with the increase in age a person's income increases as well, since the Pearson correlation coefficient between age and income is very close to one.

Example 3: Calculate the Correlation coefficient of given data.

ten 41 42 43 44 45
y 3.2 3.3 3.4 3.5 three.six

Solution:

Hither northward = 5

Permit us find ∑x , ∑y, ∑xy, ∑10 ii, ∑yii

10 y xy x2 ytwo
41 3.2 131.2 1681 10.24
42 3.3 138.half-dozen 1764 10.89
43 iii.iv 146.2 1849 xi.56
44 3.5 154 1936 12.25
45 3.6 162 2025 12.96
∑x = 215 ∑y = 17 ∑xy = 732 ∑x2 = 9255 ∑yii = 57.9

 values:

∑x = 215

∑x2 = 9255

x̄  = 43

∑(x - x̄)2 = σσ10=10

Y values:

∑y = 17

∑yii = 57.ix

∑(y - ȳ)2 =σσy= 0.1

X and Y combined

N = 5

∑((x - x̄)(y - ȳ)) = one

∑xy = 732

R calculation:

r = ∑((x - x̄)(y - ȳ))/√((σσx)(σσy))

r = one/√((10)(0.ane)) = one

Since r = 1, this indicates pregnant relation between x and y.

FAQs on Correlation Coefficient Formula

What Is Correlation Coefficient Formula in Statistics?

The correlation coefficient formula determines the relationship between two variables in a dataset and thus checks for the exactness between the predicted and actual values.

How To Utilize Correlation Coefficient Formula?

Nosotros can use the coefficient correlation formula to summate the Pearson product-moment correlation,

  • Stride i: Determine the covariance of the ii given variables.
  • Stride ii: Summate the standard difference of each variable.
  • Step 3: Divide the covariance by the product of the standard deviations of ii variables.

What Is due north in the Correlation Coefficient Formula?

In the coefficient correlation formula, n refers to the sample size.

  • Sample Correlation Coefficient: \(\rho_{X, Y}=\frac{\operatorname{cov}(10, Y)}{\sigma_{X} \sigma_{Y}}=\frac{\Sigma_{i=1}^{Due north}\left(X_{i}-\bar{10}\correct)\left(Y_{i}-\bar{Y}\right)}{\sqrt{\Sigma_{i=1}^{N}\left(X_{i}-\bar{X}\right)^{2}}^{Northward} \sum_{\Sigma_{i}=1}^{N}\left(Y_{i}-\bar{Y}\right)^{2}}\)
  • Population Correlation Coefficient: \(r_{x, y}=\frac{\sum_{i=i}^{n}\left(x_{i}-\bar{x}\right)\left(y_{i}-\bar{y}\correct)}{\sqrt{\Sigma}_{\Sigma_{i}=1}^{n}\left(x_{i}-\bar{x}\correct)^{2} \sqrt{\sum}_{\Sigma_{i}=1}^{northward}\left(y_{i}-\bar{y}\right)^{2}}\)
  • Linear Correlation Coefficient: \(r=\frac{northward(\Sigma x y)-(\Sigma x)(\Sigma y)}{\sqrt{\left[north \Sigma x^{2}-(\Sigma x)^{2}\right]\left[n \Sigma y^{ii}-(\Sigma y)^{2}\right]}}\)

What Are the Applications of Correlation Coefficient Formula?

Given below are the most important applications of the coefficient correlation formula:

  • The coefficient correlation formula helps in the analysis of the given data past quantifying the caste to which two variables are related which further depicts a linear human relationship between two variables.
  • It is used for financial analysis as it determines the human relationship between data sets in business and thus, in a way support determination making.
  • It helps a lot in decision-making in various fields every bit information technology helps to understand the strength of the relationship between two different variables.

Source: https://www.cuemath.com/correlation-coefficient-formula/

Posted by: kellerflust1973.blogspot.com

0 Response to "How To Find The Sample Correlation Coefficient"

Post a Comment

Iklan Atas Artikel

Iklan Tengah Artikel 1

Iklan Tengah Artikel 2

Iklan Bawah Artikel