Concept

Dimensionality reduction

Dimensionality reduction, or dimension reduction, is the transformation of data from a high-dimensional space into a low-dimensional space so that the low-dimensional representation retains some meaningful properties of the original data, ideally close to its intrinsic dimension. Working in high-dimensional spaces can be undesirable for many reasons; raw data are often sparse as a consequence of the curse of dimensionality, and analyzing the data is usually computationally intractable (hard to control or deal with). Dimensionality reduction is common in fields that deal with large numbers of observations and/or large numbers of variables, such as signal processing, speech recognition, neuroinformatics, and bioinformatics. Methods are commonly divided into linear and nonlinear approaches. Approaches can also be divided into feature selection and feature extraction. Dimensionality reduction can be used for noise reduction, data visualization, cluster analysis, or as an intermediate step to facilitate other analyses. Feature selectionCombinatorial optimization Feature selection approaches try to find a subset of the input variables (also called features or attributes). The three strategies are: the filter strategy (e.g. information gain), the wrapper strategy (e.g. search guided by accuracy), and the embedded strategy (selected features are added or removed while building the model based on prediction errors). Data analysis such as regression or classification can be done in the reduced space more accurately than in the original space. Feature extraction Feature projection (also called feature extraction) transforms the data from the high-dimensional space to a space of fewer dimensions. The data transformation may be linear, as in principal component analysis (PCA), but many nonlinear dimensionality reduction techniques also exist. For multidimensional data, tensor representation can be used in dimensionality reduction through multilinear subspace learning.

Official source

https://en.wikipedia.org/wiki/Dimensionality_reduction

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Related courses (32)

PHYS-467: Machine learning for physicists

Machine learning and data analysis are becoming increasingly central in sciences including physics. In this course, fundamental principles and methods of machine learning will be introduced and practi

DH-406: Machine learning for DH

This course aims to introduce the basic principles of machine learning in the context of the digital humanities. We will cover both supervised and unsupervised learning techniques, and study and imple

CS-433: Machine learning

Machine learning methods are becoming increasingly central in many sciences and applications. In this course, fundamental principles and methods of machine learning will be introduced, analyzed and pr

Related lectures (32)

Dimensionality Reduction: PCA & LDA

Covers PCA and LDA for dimensionality reduction, explaining variance maximization, eigenvector problems, and the benefits of Kernel PCA for nonlinear data.

Neural Networks Recap: Activation Functions

Covers the basics of neural networks, activation functions, training, image processing, CNNs, regularization, and dimensionality reduction methods.

Understanding Autoencoders

Explores autoencoders, from linear mappings in PCA to nonlinear mappings, deep autoencoders, and their applications.

Related publications (30)

Relaxing the Additivity Constraints in Decentralized No-Regret High-Dimensional Bayesian Optimization

Patrick Thiran

Bayesian Optimization (BO) is typically used to optimize an unknown function f that is noisy and costly to evaluate, by exploiting an acquisition function that must be maximized at each optimization step. Even if provably asymptotically optimal BO algorith ...

2024

Learning the intrinsic dynamics of spatio-temporal processes through Latent Dynamics Networks

Alfio Quarteroni, Francesco Regazzoni, Stefano Pagani

Predicting the evolution of systems with spatio-temporal dynamics in response to external stimuli is essential for scientific progress. Traditional equations-based approaches leverage first principles through the numerical approximation of differential equ ...

Nature Portfolio2024

High-Dimensional Kernel Methods under Covariate Shift: Data-Dependent Implicit Regularization

Volkan Cevher, Fanghui Liu

This paper studies kernel ridge regression in high dimensions under covariate shifts and analyzes the role of importance re-weighting. We first derive the asymptotic expansion of high dimensional kernels under covariate shifts. By a bias-variance decomposi ...

2024

Official source

https://en.wikipedia.org/wiki/Dimensionality_reduction

About this result

Ontological neighbourhood

Information engineering

Data science: Dimensionality reduction

Related courses (32)

PHYS-467: Machine learning for physicists

DH-406: Machine learning for DH

CS-433: Machine learning

Related lectures (32)

Dimensionality Reduction: PCA & LDA

Covers PCA and LDA for dimensionality reduction, explaining variance maximization, eigenvector problems, and the benefits of Kernel PCA for nonlinear data.

Neural Networks Recap: Activation Functions

Covers the basics of neural networks, activation functions, training, image processing, CNNs, regularization, and dimensionality reduction methods.

Understanding Autoencoders

Explores autoencoders, from linear mappings in PCA to nonlinear mappings, deep autoencoders, and their applications.

Related publications (30)

Relaxing the Additivity Constraints in Decentralized No-Regret High-Dimensional Bayesian Optimization

Patrick Thiran

2024

Learning the intrinsic dynamics of spatio-temporal processes through Latent Dynamics Networks

Alfio Quarteroni, Francesco Regazzoni, Stefano Pagani

Nature Portfolio2024

High-Dimensional Kernel Methods under Covariate Shift: Data-Dependent Implicit Regularization

Volkan Cevher, Fanghui Liu

2024

Related people (9)

Volkan Cevher

Volkan Cevher received the B.Sc. (valedictorian) in electrical engineering from Bilkent University in Ankara, Turkey, in 1999 and the Ph.D. in electrical and computer engineering from the Georgia Institute of Technology in Atlanta, GA in 2005. He was a Research Scientist with the University of Maryland, College Park from 2006-2007 and also with Rice University in Houston, TX, from 2008-2009. Currently, he is an Associate Professor at the Swiss Federal Institute of Technology Lausanne and a Faculty Fellow in the Electrical and Computer Engineering Department at Rice University. His research interests include machine learning, signal processing theory, optimization theory and methods, and information theory. Dr. Cevher is an ELLIS fellow and was the recipient of the Google Faculty Research award in 2018, the IEEE Signal Processing Society Best Paper Award in 2016, a Best Paper Award at CAMSAP in 2015, a Best Paper Award at SPARS in 2009, and an ERC CG in 2016 as well as an ERC StG in 2011.

Pascal Frossard

Jean-Philippe Thiran

Jean-Philippe Thiran was born in Namur, Belgium, in August 1970. He received the Electrical Engineering degree and the PhD degree from the Université catholique de Louvain (UCL), Louvain-la-Neuve, Belgium, in 1993 and 1997, respectively. From 1993 to 1997, he was the co-ordinator of the medical image analysis group of the Communications and Remote Sensing Laboratory at UCL, mainly working on medical image analysis. Dr Jean-Philippe Thiran joined the Signal Processing Institute (ITS) of the Swiss Federal Institute of Technology (EPFL), Lausanne, Switzerland, in February 1998 as a senior lecturer. He was promoted to Assistant Professor in 2004, to Associate Professor in 2011 and is now a Full Professor since 2020. He also holds a 20% position at the Department of Radiology of the University of Lausanne (UNIL) and of the Lausanne University Hospital (CHUV) as Associate Professor ad personam. Dr Thiran's current scientific interests include Computational medical imaging: acquisition, reconstruction and analysis of imaging data, with emphasis on regularized linear inverse problems (compressed sensing, convex optimization). Applications to medical imaging: diffusion MRI, ultrasound imaging, inverse planning in radiotherapy, etc.Computer vision & machine learning: image and video analysis, with application to facial expression recognition, eye tracking, lip reading, industrial inspection, medical image analysis, etc.

Pierre Vandergheynst

Pierre Vandergheynst received the M.S. degree in physics and the Ph.D. degree in mathematical physics from the Université catholique de Louvain, Louvain-la-Neuve, Belgium, in 1995 and 1998, respectively. From 1998 to 2001, he was a Postdoctoral Researcher with the Signal Processing Laboratory, Swiss Federal Institute of Technology (EPFL), Lausanne, Switzerland. He was Assistant Professor at EPFL (2002-2007), where he is now a Full Professor of Electrical Engineering and, by courtesy, of Computer and Communication Sciences. As of 2015, Prof. Vandergheynst serves as EPFL’s Vice-Provost for Education. His research focuses on harmonic analysis, sparse approximations and mathematical data processing in general with applications covering signal, image and high dimensional data processing, computer vision, machine learning, data science and graph-based data processing. He was co-Editor-in-Chief of Signal Processing (2002-2006), Associate Editor of the IEEE Transactions on Signal Processing (2007-2011), the flagship journal of the signal processing community and currently serves as Associate Editor of Computer Vision and Image Understanding and SIAM Imaging Sciences. He has been on the Technical Committee of various conferences, serves on the steering committee of the SPARS workshop and was co-General Chairman of the EUSIPCO 2008 conference. Pierre Vandergheynst is the author or co-author of more than 70 journal papers, one monograph and several book chapters. He has received two IEEE best paper awards. Professor Vandergheynst is a laureate of the Apple 2007 ARTS award and of the 2009-2010 De Boelpaepe prize of the Royal Academy of Sciences of Belgium.

Jan Sickmann Hesthaven

Prof. Hesthaven received an M.Sc. in computational physics from the Technical University of Denmark (DTU) in August 1991. During the studies, the last 6 months of 1989 was spend at JET, the european fusion laboratory in Culham, UK. Following graduation, he was awarded a 3 year fellowship to begin work towards a Ph.D. at Riso National Laboratory in the Department of Optics and Fluid Dynamics. During the 3 years of study, the academic year of 1993-1994 was spend in the Division of Applied Mathematics at Brown University and three 3 months during the summer of 1994 in Department of Mathematics and Statistics at University of New Mexico. In August 1995, he recieved a Ph.D. in Numerical Analysis from the Institute of Mathematical Modelling (DTU). Following graduation in August 1995, he was awarded an NSF Postdoctoral Fellowship in Advanced Scientific Computing and was approinted Visiting Assistant Professor in the Division of Applied Mathematics at Brown University. In December of 1996, he was appointed consultant to the Institute of Computer Applications in Science and Engineering(ICASE) at NASA Langley Research Center (NASA LaRC). As of July 1999, he was appointed Assistant Professor of Applied Mathematics, in September 2000 he was awarded an Alfred P. Sloan Fellowship, as of July 2001 he was awarded a Manning Assistant Professorship, and in March 2002, he was awarded an NSF Career Award. In January 2003, he was promoted to Associate Professor of Applied Mathematics with tenure and in May 2004 he was awarded Philip J. Bray Award for Excellence in Teaching in the Sciences (the highest award given for teaching excellence in all sciences at Brown University). He was promoted to Professor of Applied Mathematics as of July 2005. From October 2006 to June 2013, he was the Founding Director of the Center for Computation and Visualization (CCV) at Brown University. As of October 2007, he holds the (honorary) title of Professor (Adjunct) at the Technical University of Denmark. In November 2009, he successfully defended his dr.techn thesis at the Technical University of Denmark and was rewarded the degree of Doctor Technices -- the highest academic distinction awarded based on ... substantial and lasting contributions that has helped to move the research area forward and penetrated into applications. As grant Co-PI he served from Aug 2010 to June 2013 as Deputy Director of the Institute of Computational and Experimental Research in Mathematics (ICERM), the newest NSF Mathematical Sciences Research Institute. After having spend his entire academic career at Brown University, Prof Hesthaven decided to pursue new challenges and joined the Mathematics Institute of Computational Science and Engineering (MATHICSE) at Ecole Polytechnique Fédérale de Lausanne (EPFL) in Switzerland in July 2013. In March 2014 he was elected SIAM Fellow for contributions to high-order methods for partial differential equations.

Alfio Quarteroni

Of italian nationality, Alfio Quarteroni was born on May 30th 1952. He pursued his studies in mathematics at University of Pavia and at University of Paris VI. In 1986 he was nominated full professor at Catholic University of Brescia, later professor in mathematics at University of Minnesota at Minneapolis and professor in numerical analysis at Politecnico di Milano. He is designated full professor in 1997 and enters into service with EPFL in 1998. At EPFL, he teaches numerical analysis to engineers and mathematicians and holds specialized courses about mathematical modelling and scientific computing for master and PhD students. He had been scientific director of CRS4, plenary speaker of more than two hundred international conferences; he is member of the European Academy of Sciences, the Italian Academy of Sciences, the Lombard Academy of Science and Letters. He is Editor in Chief of two book series (MS&A and Unitext) by Springer, associate editor of 25 international journals. He has been plenary speaker at the International Congress of Mathematicians ICM2006. He had been responsible of several European research networks. His team has carried out the aerodynamic and hydrodynamic simulations for the optimization of Alinghi, the Swiss sailing yacht that has won two editions of the America's Cup in 2003 and 2007.

Hervé Bourlard

Related units (9)

Signal Processing Laboratory 5

Laboratory for Information and Inference Systems

IEM - Administration

Related concepts (27)

Nonlinear dimensionality reduction

Nonlinear dimensionality reduction, also known as manifold learning, refers to various related techniques that aim to project high-dimensional data onto lower-dimensional latent manifolds, with the goal of either visualizing the data in the low-dimensional space, or learning the mapping (either from the high-dimensional space to the low-dimensional embedding or vice versa) itself. The techniques described below can be understood as generalizations of linear decomposition methods used for dimensionality reduction, such as singular value decomposition and principal component analysis.

Canonical correlation

In statistics, canonical-correlation analysis (CCA), also called canonical variates analysis, is a way of inferring information from cross-covariance matrices. If we have two vectors X = (X1, ..., Xn) and Y = (Y1, ..., Ym) of random variables, and there are correlations among the variables, then canonical-correlation analysis will find linear combinations of X and Y which have maximum correlation with each other. T. R.

Multidimensional scaling

Multidimensional scaling (MDS) is a means of visualizing the level of similarity of individual cases of a dataset. MDS is used to translate "information about the pairwise 'distances' among a set of objects or individuals" into a configuration of points mapped into an abstract Cartesian space. More technically, MDS refers to a set of related ordination techniques used in information visualization, in particular to display the information contained in a distance matrix. It is a form of non-linear dimensionality reduction.

Related people (9)

Jan Sickmann Hesthaven

Alfio Quarteroni

Hervé Bourlard