Concept

Canonical correlation

In statistics, canonical-correlation analysis (CCA), also called canonical variates analysis, is a way of inferring information from cross-covariance matrices. If we have two vectors X = (X1, ..., Xn) and Y = (Y1, ..., Ym) of random variables, and there are correlations among the variables, then canonical-correlation analysis will find linear combinations of X and Y which have maximum correlation with each other. T. R. Knapp notes that "virtually all of the commonly encountered parametric tests of significance can be treated as special cases of canonical-correlation analysis, which is the general procedure for investigating the relationships between two sets of variables." The method was first introduced by Harold Hotelling in 1936, although in the context of angles between flats the mathematical concept was published by Jordan in 1875. Given two column vectors and of random variables with finite second moments, one may define the cross-covariance to be the matrix whose entry is the covariance . In practice, we would estimate the covariance matrix based on sampled data from and (i.e. from a pair of data matrices). Canonical-correlation analysis seeks vectors () and () such that the random variables and maximize the correlation . The (scalar) random variables and are the first pair of canonical variables. Then one seeks vectors maximizing the same correlation subject to the constraint that they are to be uncorrelated with the first pair of canonical variables; this gives the second pair of canonical variables. This procedure may be continued up to times. Let be the cross-covariance matrix for any pair of (vector-shaped) random variables and . The target function to maximize is The first step is to define a change of basis and define where and can be obtained from the eigen-decomposition (or By diagonalization): and And thus we have By the Cauchy–Schwarz inequality, we have There is equality if the vectors and are collinear. In addition, the maximum of correlation is attained if is the eigenvector with the maximum eigenvalue for the matrix (see Rayleigh quotient).

Official source

https://en.wikipedia.org/wiki/Canonical_correlation

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Related courses (4)

EE-607: Advanced Methods for Model Identification

This course introduces the principles of model identification for non-linear dynamic systems, and provides a set of possible solution methods that are thoroughly characterized in terms of modelling as

PHYS-732: Plasma Diagnostics in Basic Plasma Physics Devices and Tokamaks: from Principles to Practice

The programme will allow students to learn plasma diagnostics and data processing methods of modern fusion experiments and to bridge the gap between diagnostics theory and experimental practice.

MATH-342: Time series

A first course in statistical time series analysis and applications.

Related concepts (3)

Dimensionality reduction

Dimensionality reduction, or dimension reduction, is the transformation of data from a high-dimensional space into a low-dimensional space so that the low-dimensional representation retains some meaningful properties of the original data, ideally close to its intrinsic dimension. Working in high-dimensional spaces can be undesirable for many reasons; raw data are often sparse as a consequence of the curse of dimensionality, and analyzing the data is usually computationally intractable (hard to control or deal with).

Eigenvalues and eigenvectors

In linear algebra, an eigenvector (ˈaɪgənˌvɛktər) or characteristic vector of a linear transformation is a nonzero vector that changes at most by a constant factor when that linear transformation is applied to it. The corresponding eigenvalue, often represented by , is the multiplying factor. Geometrically, a transformation matrix rotates, stretches, or shears the vectors it acts upon. The eigenvectors for a linear transformation matrix are the set of vectors that are only stretched, with no rotation or shear.

Principal component analysis

Principal component analysis (PCA) is a popular technique for analyzing large datasets containing a high number of dimensions/features per observation, increasing the interpretability of data while preserving the maximum amount of information, and enabling the visualization of multidimensional data. Formally, PCA is a statistical technique for reducing the dimensionality of a dataset. This is accomplished by linearly transforming the data into a new coordinate system where (most of) the variation in the data can be described with fewer dimensions than the initial data.

Official source

https://en.wikipedia.org/wiki/Canonical_correlation

About this result

Related courses (4)

EE-607: Advanced Methods for Model Identification

PHYS-732: Plasma Diagnostics in Basic Plasma Physics Devices and Tokamaks: from Principles to Practice

The programme will allow students to learn plasma diagnostics and data processing methods of modern fusion experiments and to bridge the gap between diagnostics theory and experimental practice.

MATH-342: Time series

A first course in statistical time series analysis and applications.

Related lectures (18)

Canonical Correlation Analysis

Covers the mathematical development of canonical correlation analysis, including population and sample CCA.

Canonical Correlation Analysis: Overview

Covers Canonical Correlation Analysis, a method to find relationships between two sets of variables.

Canonical Correlation Analysis: Exercises Solutions

Presents solutions to exercises on Canonical Correlation Analysis, exploring correlation, Gram matrices, kernel matrices, and vector properties.

Related publications (24)

The hindbrain and cortico-reticular pathway in adolescent idiopathic scoliosis

Bénédicte Marie Maréchal

AIM: To characterise the corticoreticular pathway (CRP) in a case -control cohort of adolescent idiopathic scoliosis (AIS) patients using high -resolution slice -accelerated readoutsegmented echo -planar diffusion tensor imaging (DTI) to enhance the discri ...

W B Saunders Co Ltd2024

The two-point correlation function covariance with fewer mocks

Cheng Zhao

We present FITCOV an approach for accurate estimation of the covariance of two-point correlation functions that requires fewer mocks than the standard mock-based covariance. This can be achieved by dividing a set of mocks into jackknife regions and fitting ...

Oxford Univ Press2023

Positive Definite Completions and Continuous Graphical Models

Kartik Waghmare

This thesis concerns the theory of positive-definite completions and its mutually beneficial connections to the statistics of function-valued or continuously-indexed random processes, better known as functional data analysis. In particular, it dwells upon ...

EPFL2023

Related concepts (3)

Dimensionality reduction

Eigenvalues and eigenvectors

Principal component analysis