Access free live classes and tests on the app
Download
+
Unacademy
  • Goals
    • AFCAT
    • AP EAMCET
    • Bank Exam
    • BPSC
    • CA Foundation
    • CAPF
    • CAT
    • CBSE Class 11
    • CBSE Class 12
    • CDS
    • CLAT
    • CSIR UGC
    • GATE
    • IIT JAM
    • JEE
    • Karnataka CET
    • Karnataka PSC
    • Kerala PSC
    • MHT CET
    • MPPSC
    • NDA
    • NEET PG
    • NEET UG
    • NTA UGC
    • Railway Exam
    • SSC
    • TS EAMCET
    • UPSC
    • WBPSC
    • CFA
Login Join for Free
avtar
  • ProfileProfile
  • Settings Settings
  • Refer your friendsRefer your friends
  • Sign outSign out
  • Terms & conditions
  • •
  • Privacy policy
  • About
  • •
  • Careers
  • •
  • Blog

© 2023 Sorting Hat Technologies Pvt Ltd

CBSE Class 11 » CBSE Class 11 Study Materials » Commerce » Data Science
CBSE

Data Science

Data science, database meaning, and ways of ranking data sets are explained in the article. It includes details on Spearman's Rank Correlation and elucidates Correlation Coefficients, Pearson’s Correlation, Population Correlation Coefficient, and Sample Correlation Coefficient. This article also contains answers to commonly asked questions concerning assigning ranks to data.

Table of Content
  •  

Data science is the field that utilizes formulas, algorithms, and highly researched techniques to reveal information from raw or unstructured data. Though essentially a statistical or mathematical term, data science has been developed into an independent stream. This field has various career opportunities in the future like data scientists, data analysts, etc. Therefore, understanding the concepts of data science is essential. 

Database meaning: A database collects unstructured or raw data and stores it for future uses. It comprises data and the system used to manage it. Organizations protect their database by using the latest security features and technologies. 

Data sets are ranked to determine their order. In statistics, ranking means data transformation in numerical cells is replaced by their ranks when the data is sorted in ascending or descending order. When the data is sorted in ascending order, rank one is assigned to the smallest value. On the other hand, when the data is sorted in descending order, rank 1 is the most significant value. 

Correlation Coefficients

To calculate the strength of the relationship between 2 variables, we use correlation coefficients. The two most popular types of correlation coefficients are Pearson’s Correlation Coefficient and Spearman’s Correlation Coefficient. Pearson’s Correlation Coefficient is probably the first correlation coefficient taught to students who have started learning statistics. 

It is so often used that whenever someone refers to the correlation coefficient, it usually means talking about the Pearson’s Correlation Coefficient. It is generally denoted by R. 

What is Pearson’s Correlation?

Pearsons’ Correlation is the most used method to calculate the relationship between data sets. It shows the linear relationship between variables. In simpler terms, it indicates whether or not you can draw a line on a graph to represent the variables. 

The formula used to determine Pearson’s Correlation Coefficient is given below:

r=n(Σxy)−(Σx)(Σy)[nΣx2−(Σx)2][nΣy2−(Σy)2]

The other commonly used correlation coefficients are the Sample Correlation Coefficient and Population Correlation Coefficient. The formulas for these coefficients are given below:

Population Correlation Coefficient 

The Population Correlation Coefficient pxy is given below:

pxy=xyxy

Sample Correlation Coefficient 

The Sample Correlation Coefficient rxy is given below:

rxy=sxysxsy

Spearman’s Rank Correlation: Ranks are Given

When there is a monotonic relation between two variables,i.e., an increase in one would increase the other and vice versa, it becomes hard to identify the direction and strength of the monotonic relation. For example, while calculating Pearson’s Correlation Coefficient, we get the direction and strength of the linear association between the variables you are interested in. However, Spearman’s Rank Correlation indicates the limit to which two variables fluctuate together.

Before evaluating Spearman’s Rank Correlation, we need to rank the observations. It is essential because we have to identify whether increasing the value of one variable has a similar effect on the other,i.e., whether it rises or notAl; a comparison between 2 variables is mat each level ade before assigning ranks. For instance, if the variables are ranked from 1 to n (n is the number of variables), with the rank 1 being assigned to the highest value. The order of the set will be from highest to lowest case the same value repeats twice, the arithmetic mean or average of the ranks is assigned to them. For example, X represents the selling price values of a product. 

X = (22, 34, 24, 30, 28, 24) 

The descending order of the set, i.e., from highest to lowest, will be as follows:

34, 30, 28, 24, 24, 22 and their corresponding ranks will be as follows:

34 -> 1, 30 -> 2, 28 -> 3, and so on. 

However, the fourth and fifth rank cannot be given the same values. Therefore, as per the Spearman’s Rank Correlation, (4+5)/2 rank will be given to each value. The remaining values will have the same rank as they had before. The final ranks are listed below:

34 -> 1, 30 -> 2. 28 -> 3, 24 -> 4.5, 24 -> 4.5, and 22 -> 6. 

Spearman Rank Correlation Formula 

Spearman Rank Correlation is represented by either p or r. The formula for Spearman Rank Correlation is given below:

rR=1-6∑di2n(n2-1)

Where, 

di is the difference in the ranks of the ith observation of each random variable and n is the number of observations in the set. The value of Spearman Rank Coefficient can vary between -1 to +1.

If the value of p is -1, it means that there is a perfect negative association between the ranks. 

If the value of p is 0, it means that there is no association between the ranks.

If the value of p is 1, it means that there is a perfect positive association between the ranks. 

faq

Frequently asked questions

Get answers to the most common queries related to the CBSE Class 11 Examination Preparation.

Why is ranking data important?

Ans :Ranking is important in situations when the order of the observations is prioritized over the magnitude of the ...Read full

Find the Spearman’s Correlation Coefficient for the following data points.

Ans. English (Marks out of 100) ...Read full

Which is better for ranking variables, Person’s Correlation Coefficient or Spearman’s Correlation Coefficient?

Ans : Several shortcomings or issues can be encountered while working with the Pearson’s Correlation Coefficient. ...Read full

Ans :Ranking is important in situations when the order of the observations is prioritized over the magnitude of the differences between them. When there is little or no information regarding the distribution of data points, non-parametric statistics come into play and are used to define ranked data. Ranking is usually done for statistical purposes like finding countries with the highest population, ranking sports teams and their players, etc. 

Ans.

English (Marks out of 100) Math (Marks out of 100)
55 45
43 47
35 74
56 39

 

First, let’s find values as per the following table:

 

English  Math Rank (English Subject) Rank (Math Subject) di  d2 (d*d)
55 45 2 3 1 1
43 47 3 2 1 1
35 74 4 1 3 9
56 39 1 4 3 9

 

Summation of d2 is 1 + 1 + 9 + 9 = 20

 

 

We have the Spearson’s Correlation Formula as below:

 

rR=1-6∑di2n(n2-1)

 

If we substitute these values we get the following answer:

 

r = 1 – 6 (20) / 4 (16 -1) = 1 – (120/60) = 1 – 2 = -1

 

As the Spearson’s Correlation Coefficient is -1, it indicates a strong negative relationship between the ranks that the students obtained in Maths and English. 

 

  1. Give the rank of the following data from lowest to highest.

 

12, 14, 16, 18, 20, 22, 12, 16 

 

For calculating the final ranks, let’s first find the ranks as per the below table

 

Observations  Ascending order (LowestHighesthest)  Initial Ranks 
12 12 1
14 12 2
16 14 3
18 16 4
20 16 5
22 18 6
12 20 7
16 22 8

 

N ranks rank 1 and 2 are gi to equal numbers. Therefore, the final number. will be (1+2)/2 = 1.5. , Also rank 4 and 5 are again between similar equal numbers. Therefore, the lastinal ranks will be (4+5)/2 = 4.5. lastinal ranks of the numbers are now given below:

 

Observations Final Ranks 
12 1.5
12 1.5
14 3
16 4.5
16 4.5
18 6
20 7
22 8

 

Ans : Several shortcomings or issues can be encountered while working with the Pearson’s Correlation Coefficient. It determines a relationship between two variables but does not identify the difference between independent and dependent variables. 

Suppose that the relationship between eating chocolates and diabetes has a high correlation of  you would get the same result even if you switch the variables around. While we can say that eating more chocolates increases diabetes, we cannot say that an increase in diabetes results in higher chocolates. It does not make any sense. Therefore, it is less valuable when it comes to assigning ranks. Here is where Spearman’s Rank Correlation becomes more relevant as it also tells us how monotonous the relationship between two variants is based on their strength and direction. 

 

Crack K-12 with Unacademy

Get subscription and access unlimited live and recorded courses from India’s best educators

  • Structured syllabus
  • Daily live classes
  • Ask doubts
  • Tests & practice
Learn more

Notifications

Get all the important information related to the CBSE Class 11 Exam including the process of application, important calendar dates, eligibility criteria, exam centers etc.

Data Correction
Exam Pattern for Class 11th
Registration Process
Syllabus
See all

Related articles

Learn more topics related to Commerce
Spearman’s Rank Correlation

Spearman’s Rank Correlation gives the monotonic relationship and Pearson's Coefficient gives the linear relationship amidst two variables.

Rank Correlation Using Repeated Ranks

Rank Correlation using Repeated Ranks, Spearman's correlation, its formulas, and its merits and demerits.

Oversubscription

In oversubscription, applied debentures are more than offered debentures. This can be understood better with an accounting analysis of oversubscription, as discussed in this article.

Nature of Financial Statements

Financial statements are a group of large reports that summarise an organisation's financial performance, financial condition, and cash flows. The financial statements are generated utilising facts about financial occurrences that are recorded in chronological order. Every one of these financial transactions, as well as the nature of financial statement analysis, must be first recorded.

See all
Access more than

5,130+ courses for CBSE Class 11

Get subscription

Trending Topics

  • Withdrawal Slip
  • Wildlife Conservation
  • Moving Coil Galvanometer
  • Ogive Curves
  • PPT Full Form
  • Reordering Of Sentences
  • Central Problems Of An Economy
  • Transcription In Eukaryotes
combat_iitjee

Important Links

  • NCERT Solutions
  • NCERT Books
  • Physics Formulas
  • Maths Formulas
  • Chemistry Formulas
testseries_iitjee
Download NEET 2022 question paper
.
Company Logo

Unacademy is India’s largest online learning platform. Download our apps to start learning


Starting your preparation?

Call us and we will answer all your questions about learning on Unacademy

Call +91 8585858585

Company
About usShikshodayaCareers
we're hiring
BlogsPrivacy PolicyTerms and Conditions
Help & support
User GuidelinesSite MapRefund PolicyTakedown PolicyGrievance Redressal
Products
Learner appLearner appEducator appEducator appParent appParent app
Popular goals
IIT JEEUPSCSSCCSIR UGC NETNEET UG
Trending exams
GATECATCANTA UGC NETBank Exams
Study material
UPSC Study MaterialNEET UG Study MaterialCA Foundation Study MaterialJEE Study MaterialSSC Study Material

© 2025 Sorting Hat Technologies Pvt Ltd

Unacademy
  • Goals
    • AFCAT
    • AP EAMCET
    • Bank Exam
    • BPSC
    • CA Foundation
    • CAPF
    • CAT
    • CBSE Class 11
    • CBSE Class 12
    • CDS
    • CLAT
    • CSIR UGC
    • GATE
    • IIT JAM
    • JEE
    • Karnataka CET
    • Karnataka PSC
    • Kerala PSC
    • MHT CET
    • MPPSC
    • NDA
    • NEET PG
    • NEET UG
    • NTA UGC
    • Railway Exam
    • SSC
    • TS EAMCET
    • UPSC
    • WBPSC
    • CFA

Share via

COPY