Correlation Examples in Statistics

In statistics, the CorrelationCorrelationCorrelation is a statistical measure between two variables that is defined as a change in one variable corresponding to a change in the other. It is calculated as (x(i)-mean(x))*(y(i)-mean(y)) / ((x(i)-mean(x))2 * (y(i)-mean(y))2.read more mainly analyzes the strength of the relationship between the variables under consideration. However, it also measures if there is any linear relationship, i.e., linear, between the given sets of data and how well they could be related. One such common measure used in the field of statistics for correlation is the Pearson Correlation CoefficientPearson Correlation CoefficientPearson correlation coefficient measures the strength between the different variables and their relationships. Therefore, whenever any statistical test is conducted between the two variables, it is good to analyze the correlation coefficient value to know how strong the relationship between the two variables is.read more. The following correlation example provides an outline of the most common correlations.

Example #1

Vivek and Rupal are siblings. Rupal is older than Vivek by three years. Sanjeev, their father, is a statistician, and he was interested in researching the linear relationshipLinear RelationshipA linear relationship describes the relation between two distinct variables - x and y - in the form of a straight line on a graph. When presenting a linear relationship through an equation, the value of y is derived through the value of x, reflecting their correlation.read more between height and weight. Hence, since their birth, he noted their height and weight at various ages and arrived at the following:

He tries to identify any correlation between age, height, and weight, and is there any differentiation between them?

Solution:

We will first plot a scatter chart. We get below the result for Rupal’s and Vivek’s age, height, and weight.

As age increases, height increases, and weight increases, so there appears to be a positive relationship. In other words, height and age are positive correlations. Further, Sanjeev observed that weight fluctuates and is not stable. It could either increase or decrease marginally, but he observed a positive relationship between height and weight; weight also tends to increase when height increases.

Thus, he observed two crucial relationships here: with age – height increases, and weight increases with height increases. Hence, all three-carry positive correlations.

Example #2

John is excited about summer vacation. However, his parents are worried since the teenager would be sitting home and playing games on his mobile and switching on the air conditioner the whole time. They noted the various temperatures and the units consumed last year and found interesting data. And they wanted to anticipate their upcoming may month bill and expect the temperature to be near 40*C. Still, they want to know if there is any correlation between temperature and electricity bills.

Let’s analyze this as well through a chart.

We have plotted electricity bills and temperature and noted their various points. There appears to be a correlation between the temperature and the electricity bill when the temperature is cold. The electricity bill is under control, making sense as the family would use less air conditioner. As and when temperature increases, air conditioners, and geysers would increase, which would hit them with a higher cost, evident from the above graph where the electricity bill rises heavily.

Thus, we can conclude that there is no linear relationship, but yes, there is a positive correlation. Hence, the family can again expect a bill amount for may in the range of 6400 to 7000.

Example #3

Tom has started a new catering business, where he first analyzes the cost of making a sandwich and what price he should sell them. After talking to various cooks currently selling the sandwich, he has gathered the information below. Tom was convinced that there is a positive linear relationship between the number of sandwiches and the total cost of making them. Analyze if this statement is true.

Tom convinces a positive linear relationship between the number of sandwiches and the total cost of making them. Analyze if this statement is true.

After plotting the points between the number of sandwiches prepared and the cost of making them, there is a positive relationship.

And it can be seen from the above table that yes, there is a positive linear relationship. If one runs correlation, it will come to +1. Hence, as Tom makes more sandwiches, the cost will increase, and it appears valid as one will require the more the sandwich, the more vegetables, and so would bread be required. Hence, this has a positive perfect linear relationship based on the data.

Example #4

Rakesh has been investing in ABC stock for quite a long time. He wants to know whether ABC stock is a good hedge for the market as he has also invested in an ETFETFAn exchange-traded fund (ETF) is a security that contains many types of securities such as bonds, stocks, commodities, and so on, and that trades on the exchange like a stock, with the price fluctuating many times throughout the day when the exchange-traded fund is bought and sold on the exchange.read more fund that tracks a market indexMarket IndexA market index tracks the performance of a diverse selection of securities that make up a significant part of the financial market. It serves as an indicator of the overall financial market condition by listing the historical and real-time trends in different market segments. read more. He has gathered the below data for the past 12 monthly returns on the stock ABC and index.

Using correlation, identify the relationship ABC stock has with the market and whether it hedges the portfolio.

Using the correlation coefficient formulaCorrelation Coefficient FormulaCorrelation Coefficient, sometimes known as cross-correlation coefficient, is a statistical measure used to evaluate the strength of a relationship between 2 variables. Its values range from -1.0 (negative correlation) to +1.0 (positive correlation). read more below treating ABC stock price changes as x and changes in markets index as y, we get a correlation of -0.90.

It is a close-to-perfect negative correlationNegative CorrelationA negative correlation is an effective relationship between two variables in which the values of the dependent and independent variables move in opposite directions. For example, when an independent variable increases, the dependent variable decreases, and vice versa.read more. In other words, a negative relationship.

Therefore, as the market rises, the stock price of ABC falls, and when the market falls, the stock price of ABC rises. Hence, it is a good hedge for the portfolio.

Conclusion

One can conclude that there could be a correlation between two variables but not necessarily a linear relationship. For example, there could be exponential correlation or log correlation. Hence, if one gets a result stating a positive or negative correlation, then it should be judged by plotting the variables on the graph and finding out if there is truly any relationship or a spur correlation.

This article has been a guide to Correlation Examples in Statistics. Here, we discussed the various examples to understand the correlation between two variables, which can be positive or negative. You can learn more about financing from the following articles: –

  • Coefficient of Determination FormulaCorrelation vs. CovarianceNegative Correlation Definition