Concept

False discovery rate

In statistics, the false discovery rate (FDR) is a method of conceptualizing the rate of type I errors in null hypothesis testing when conducting multiple comparisons. FDR-controlling procedures are designed to control the FDR, which is the expected proportion of "discoveries" (rejected null hypotheses) that are false (incorrect rejections of the null). Equivalently, the FDR is the expected ratio of the number of false positive classifications (false discoveries) to the total number of positive classifications (rejections of the null). The total number of rejections of the null include both the number of false positives (FP) and true positives (TP). Simply put, FDR = FP / (FP + TP). FDR-controlling procedures provide less stringent control of Type I errors compared to family-wise error rate (FWER) controlling procedures (such as the Bonferroni correction), which control the probability of at least one Type I error. Thus, FDR-controlling procedures have greater power, at the cost of increased numbers of Type I errors. The modern widespread use of the FDR is believed to stem from, and be motivated by, the development in technologies that allowed the collection and analysis of a large number of distinct variables in several individuals (e.g., the expression level of each of 10,000 different genes in 100 different persons). By the late 1980s and 1990s, the development of "high-throughput" sciences, such as genomics, allowed for rapid data acquisition. This, coupled with the growth in computing power, made it possible to seamlessly perform a very high number of statistical tests on a given data set. The technology of microarrays was a prototypical example, as it enabled thousands of genes to be tested simultaneously for differential expression between two biological conditions. As high-throughput technologies became common, technological and/or financial constraints led researchers to collect datasets with relatively small sample sizes (e.g. few individuals being tested) and large numbers of variables being measured per sample (e.

Source officielle

https://en.wikipedia.org/wiki/False_discovery_rate

À propos de ce résultat

Cette page est générée automatiquement et peut contenir des informations qui ne sont pas correctes, complètes, à jour ou pertinentes par rapport à votre recherche. Il en va de même pour toutes les autres pages de ce site. Veillez à vérifier les informations auprès des sources officielles de l'EPFL.

Cours associés (4)

BIO-449: Understanding statistics and experimental design

This course is neither an introduction to the mathematics of statistics nor an introduction to a statistics program such as R. The aim of the course is to understand statistics from its experimental d

CS-401: Applied data analysis

This course teaches the basic techniques, methodologies, and practical skills required to draw meaningful insights from a variety of data, with the help of the most acclaimed software tools in the dat

MATH-232: Probability and statistics (for IC)

A basic course in probability and statistics

Afficher plus

Séances de cours associées (25)

Essais d'hypothèse en statistique

Déplacez-vous dans les tests d'hypothèses, couvrant les statistiques d'essais, les régions critiques, les fonctions de puissance, les valeurs p, les tests multiples et les statistiques non paramétriques.

Problème de tests multiples

Explore les défis que posent les essais multiples dans l'analyse des données génomiques, y compris le contrôle des taux d'erreur, les valeurs de p ajustées, les tests de permutation et les pièges dans les essais d'hypothèses.

Sélection de la stratégie

Explore les défis de sélection de la stratégie, l'évaluation de la performance et les tests statistiques en finance, en soulignant l'importance des portefeuilles de stratégies.

Afficher plus

Publications associées (27)

Density estimation in RKHS with application to Korobov spaces in high dimensions

Fabio Nobile, Yoshihito Kazashi

A kernel method for estimating a probability density function from an independent and identically distributed sample drawn from such density is presented. Our estimator is a linear combination of kernel functions, the coefficients of which are determined b ...

2023

Evaluation and optimization of novel extraction algorithms for the automatic detection of atrial activations recorded within the pulmonary veins during atrial fibrillation

Jean-Marc Vesin, Adrian Luca, Yann Prudat, Sasan Yazdani, Etienne Pruvot

Background and objective The automated detection of atrial activations (AAs) recorded from intracardiac electrograms (IEGMs) during atrial fibrillation (AF) is challenging considering their various amplitudes, morphologies and cycle length. Activation time ...

BMC2022

Social Learning with Disparate Hypotheses

Ali H. Sayed, Virginia Bordignon, Stefan Vlaski, Konstantinos Ntemos

In this paper we study the problem of social learning under multiple true hypotheses and self-interested agents. In this setup, each agent receives data that might be generated from a different hypothesis (or state) than the data other agents receive. In c ...

IEEE2022

Afficher plus

Source officielle

https://en.wikipedia.org/wiki/False_discovery_rate

À propos de ce résultat

Cours associés (4)

BIO-449: Understanding statistics and experimental design

CS-401: Applied data analysis

MATH-232: Probability and statistics (for IC)

A basic course in probability and statistics

Afficher plus

Séances de cours associées (25)

Essais d'hypothèse en statistique

Problème de tests multiples

Sélection de la stratégie

Explore les défis de sélection de la stratégie, l'évaluation de la performance et les tests statistiques en finance, en soulignant l'importance des portefeuilles de stratégies.

Afficher plus

Publications associées (27)

Density estimation in RKHS with application to Korobov spaces in high dimensions

Fabio Nobile, Yoshihito Kazashi

2023

Evaluation and optimization of novel extraction algorithms for the automatic detection of atrial activations recorded within the pulmonary veins during atrial fibrillation

Jean-Marc Vesin, Adrian Luca, Yann Prudat, Sasan Yazdani, Etienne Pruvot

BMC2022

Social Learning with Disparate Hypotheses

Ali H. Sayed, Virginia Bordignon, Stefan Vlaski, Konstantinos Ntemos

IEEE2022

Afficher plus

Personnes associées (3)

Stephan Morgenthaler

EDUCATION Ph.D., Statistics, Princeton University, Princeton, 1983 Diplôme, Mathématiques, Ecole polytechnique fédérale de Zurich, 1979 CARRIÈRE ACADEMIQUE Professeur de statistique appliquée, EPFL, 1991-présent Professeur extraordinaire, statistique appliquée, EPFL, 1988-1991 Professeur associé, statistique, Yale University, 1987-1988 Professeur assistant, statistique, Yale University, 1984-1987 Instructor, mathématiques, Massachusetts Institute of Technology, 1983-1984

Jean-Philippe Thiran

Jean-Philippe Thiran was born in Namur, Belgium, in August 1970. He received the Electrical Engineering degree and the PhD degree from the Université catholique de Louvain (UCL), Louvain-la-Neuve, Belgium, in 1993 and 1997, respectively. From 1993 to 1997, he was the co-ordinator of the medical image analysis group of the Communications and Remote Sensing Laboratory at UCL, mainly working on medical image analysis. Dr Jean-Philippe Thiran joined the Signal Processing Institute (ITS) of the Swiss Federal Institute of Technology (EPFL), Lausanne, Switzerland, in February 1998 as a senior lecturer. He was promoted to Assistant Professor in 2004, to Associate Professor in 2011 and is now a Full Professor since 2020. He also holds a 20% position at the Department of Radiology of the University of Lausanne (UNIL) and of the Lausanne University Hospital (CHUV) as Associate Professor ad personam. Dr Thiran's current scientific interests include Computational medical imaging: acquisition, reconstruction and analysis of imaging data, with emphasis on regularized linear inverse problems (compressed sensing, convex optimization). Applications to medical imaging: diffusion MRI, ultrasound imaging, inverse planning in radiotherapy, etc.Computer vision & machine learning: image and video analysis, with application to facial expression recognition, eye tracking, lip reading, industrial inspection, medical image analysis, etc.

Afficher plus

Unités associées (3)

Laboratoire de traitement des signaux 5

Groupe de scientifiques IEL

IEM - Gestion

Concepts associés (5)

Multiple comparisons problem

In statistics, the multiple comparisons, multiplicity or multiple testing problem occurs when one considers a set of statistical inferences simultaneously or infers a subset of parameters selected based on the observed values. The more inferences are made, the more likely erroneous inferences become. Several statistical techniques have been developed to address that problem, typically by requiring a stricter significance threshold for individual comparisons, so as to compensate for the number of inferences being made.

Valeur p

vignette|redresse=1.5|Illustration de la valeur-p. X désigne la loi de probabilité de la statistique de test et z la valeur calculée de la statistique de test. Dans un test statistique, la valeur-p (en anglais p-value pour probability value), parfois aussi appelée p-valeur, est la probabilité pour un modèle statistique donné sous l'hypothèse nulle d'obtenir une valeur au moins aussi extrême que celle observée. L'usage de la valeur-p est courant dans de nombreux domaines de recherche comme la physique, la psychologie, l'économie et les sciences de la vie.

Data dredging

vignette|Exemple de Data dredging. Le data dredging (littéralement le dragage de données mais mieux traduit comme étant du triturage de données) est une technique statistique qui . Une des formes du data dredging est de partir de données ayant un grand nombre de variables et un grand nombre de résultats, et de choisir les associations qui sont « statistiquement significatives », au sens de la valeur p (on parle aussi de p-hacking).

Afficher plus