**Are you an EPFL student looking for a semester project?**

Work with us on data science and visualisation projects, and deploy your project as an app on top of GraphSearch.

Concept# Spearman's rank correlation coefficient

Summary

In statistics, Spearman's rank correlation coefficient or Spearman's ρ, named after Charles Spearman and often denoted by the Greek letter (rho) or as , is a nonparametric measure of rank correlation (statistical dependence between the rankings of two variables). It assesses how well the relationship between two variables can be described using a monotonic function.
The Spearman correlation between two variables is equal to the Pearson correlation between the rank values of those two variables; while Pearson's correlation assesses linear relationships, Spearman's correlation assesses monotonic relationships (whether linear or not). If there are no repeated data values, a perfect Spearman correlation of +1 or −1 occurs when each of the variables is a perfect monotone function of the other.
Intuitively, the Spearman correlation between two variables will be high when observations have a similar (or identical for a correlation of 1) rank (i.e. relative position label of the observations within the variable: 1st, 2nd, 3rd, etc.) between the two variables, and low when observations have a dissimilar (or fully opposed for a correlation of −1) rank between the two variables.
Spearman's coefficient is appropriate for both continuous and discrete ordinal variables. Both Spearman's and Kendall's can be formulated as special cases of a more general correlation coefficient.
The Spearman correlation coefficient is defined as the Pearson correlation coefficient between the rank variables.
For a sample of size n, the n raw scores are converted to ranks , and is computed as
where
denotes the usual Pearson correlation coefficient, but applied to the rank variables,
is the covariance of the rank variables,
and are the standard deviations of the rank variables.
Only if all n ranks are distinct integers, it can be computed using the popular formula
where
is the difference between the two ranks of each observation,
n is the number of observations.
Consider a bivariate sample with corresponding ranks .

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Related concepts (12)

Related people (35)

Related courses (32)

Related publications (272)

Related units (10)

Kendall rank correlation coefficient

In statistics, the Kendall rank correlation coefficient, commonly referred to as Kendall's τ coefficient (after the Greek letter τ, tau), is a statistic used to measure the ordinal association between two measured quantities. A τ test is a non-parametric hypothesis test for statistical dependence based on the τ coefficient. It is a measure of rank correlation: the similarity of the orderings of the data when ranked by each of the quantities.

Nonparametric statistics

Nonparametric statistics is the type of statistics that is not restricted by assumptions concerning the nature of the population from which a sample is drawn. This is opposed to parametric statistics, for which a problem is restricted a priori by assumptions concerning the specific distribution of the population (such as the normal distribution) and parameters (such the mean or variance).

Rank correlation

In statistics, a rank correlation is any of several statistics that measure an ordinal association—the relationship between rankings of different ordinal variables or different rankings of the same variable, where a "ranking" is the assignment of the ordering labels "first", "second", "third", etc. to different observations of a particular variable. A rank correlation coefficient measures the degree of similarity between two rankings, and can be used to assess the significance of the relation between them.

, , , , , , , , ,

This course is an introduction to quantitative risk management that covers standard statistical methods, multivariate risk factor models, non-linear dependence structures (copula models), as well as p

This course is an introduction to the non-perturbative bootstrap approach to Conformal Field Theory and to the Gauge/Gravity duality, emphasizing the fruitful interplay between these two ideas.

Introduction à la théorie des transitions de phase

Marcos Rubinstein, Hamidreza Karami

This paper presents a comparison of machine learning (ML) methods used for three-dimensional localization of partial discharges (PD) in a power transformer tank. The study examines ML and deep learning (DL) methods, ranging from support vector machines (SV ...

2024Related lectures (148)

Background: Quantification of the T2 signal by means of T2 mapping in acute pancreatitis (AP) has the potential to quantify the parenchymal edema. Quantitative T2 mapping may overcome the limitations of previously reported scoring systems for reliable asse ...

Michael Herzog, Simona Adele Garobbio

Using batteries of visual tests, most studies have found that there are only weak correlations between the performance levels of the tests. Factor analysis has confirmed these results. This means that a participant excelling in one test may rank low in ano ...

2024Determinantal Point Processes and Extrapolation

Covers determinantal point processes, sine-process, and their extrapolation in different spaces.

Autocorrelation and PeriodicityENV-400: Air pollution and climate change

Explores autocorrelation, periodicity, and spurious correlations in time series data, emphasizing the importance of understanding underlying processes and cautioning against misinterpretation.

Renormalization Group: Ising ModelPHYS-316: Statistical physics II

Explores the renormalization group applied to the Ising model and the derivation of the central limit theorem.