Concept

Bootstrapping (statistics)

Bootstrapping is any test or metric that uses random sampling with replacement (e.g. mimicking the sampling process), and falls under the broader class of resampling methods. Bootstrapping assigns measures of accuracy (bias, variance, confidence intervals, prediction error, etc.) to sample estimates. This technique allows estimation of the sampling distribution of almost any statistic using random sampling methods. Bootstrapping estimates the properties of an estimand (such as its variance) by measuring those properties when sampling from an approximating distribution. One standard choice for an approximating distribution is the empirical distribution function of the observed data. In the case where a set of observations can be assumed to be from an independent and identically distributed population, this can be implemented by constructing a number of resamples with replacement, of the observed data set (and of equal size to the observed data set). It may also be used for constructing hypothesis tests. It is often used as an alternative to statistical inference based on the assumption of a parametric model when that assumption is in doubt, or where parametric inference is impossible or requires complicated formulas for the calculation of standard errors. The bootstrap was published by Bradley Efron in "Bootstrap methods: another look at the jackknife" (1979), inspired by earlier work on the jackknife. Improved estimates of the variance were developed later. A Bayesian extension was developed in 1981. The bias-corrected and accelerated (BCa) bootstrap was developed by Efron in 1987, and the ABC procedure in 1992. The basic idea of bootstrapping is that inference about a population from sample data (sample → population) can be modeled by resampling the sample data and performing inference about a sample from resampled data (resampled → sample). As the population is unknown, the true error in a sample statistic against its population value is unknown.

Official source

https://en.wikipedia.org/wiki/Bootstrapping_(statistics)

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Related courses (18)

MATH-412: Statistical machine learning

A course on statistical machine learning for supervised and unsupervised learning

MATH-600: Optimization and simulation

Master state-of-the art methods in optimization with heuristics and simulation. Work involves:

reading the material beforehand
class hours to discuss the material and solve problems
homework

EE-209: Elements of statistics for data science

Related lectures (32)

Model Evaluation

Delves into model evaluation, covering theory, training error, prediction error, resampling methods, and information criteria.

Assumption-lean Inference: Generalised Linear Model Parameters

Explores assumption-lean inference for statistical estimands in generalised linear models, emphasizing robust and generic approaches.

Extreme Value Theory: Applications and Estimation

Explores Extreme Value Theory applications, estimation strategies, and modelling techniques for statistical analysis of extremes in time series.

Related publications (29)

Bootstrapping smooth conformal defects in Chern-Simons-matter theories

Barak Gabai, Amit Sever

The expectation value of a smooth conformal line defect in a CFT is a conformal invariant functional of its path in space-time. For example, in large N holographic theories, these fundamental observables are dual to the open-string partition function in Ad ...

Springer Nature2024

We Need Subject Matter Expertise to Choose and Identify Causal Estimands: Comment on "Estimands for Recurrent Event Endpoints in the Presence of a Terminal Event"

Matias Janvin, Mats Julius Stensrud

We summarize what we consider to be the two main limitations of the "Estimands for Recurrent Event Endpoints in the Presence of a Terminal Event" (Schmidli et al. 2022). First, the authors did not give detailed guidance on how to choose an appropriate esti ...

TAYLOR & FRANCIS INC2023

The two-point correlation function covariance with fewer mocks

Cheng Zhao

We present FITCOV an approach for accurate estimation of the covariance of two-point correlation functions that requires fewer mocks than the standard mock-based covariance. This can be achieved by dividing a set of mocks into jackknife regions and fitting ...

Oxford Univ Press2023

Official source

https://en.wikipedia.org/wiki/Bootstrapping_(statistics)

About this result

Ontological neighbourhood

Statistics

Statistical inference: Mathematical statistics

Related courses (18)

MATH-412: Statistical machine learning

A course on statistical machine learning for supervised and unsupervised learning

MATH-600: Optimization and simulation

Master state-of-the art methods in optimization with heuristics and simulation. Work involves:

reading the material beforehand
class hours to discuss the material and solve problems
homework

EE-209: Elements of statistics for data science

Related lectures (32)

Model Evaluation

Delves into model evaluation, covering theory, training error, prediction error, resampling methods, and information criteria.

Assumption-lean Inference: Generalised Linear Model Parameters

Explores assumption-lean inference for statistical estimands in generalised linear models, emphasizing robust and generic approaches.

Extreme Value Theory: Applications and Estimation

Explores Extreme Value Theory applications, estimation strategies, and modelling techniques for statistical analysis of extremes in time series.

Related publications (29)

Bootstrapping smooth conformal defects in Chern-Simons-matter theories

Barak Gabai, Amit Sever

Springer Nature2024

We Need Subject Matter Expertise to Choose and Identify Causal Estimands: Comment on "Estimands for Recurrent Event Endpoints in the Presence of a Terminal Event"

Matias Janvin, Mats Julius Stensrud

TAYLOR & FRANCIS INC2023

The two-point correlation function covariance with fewer mocks

Cheng Zhao

Oxford Univ Press2023

Related concepts (20)

T-statistic

In statistics, the t-statistic is the ratio of the departure of the estimated value of a parameter from its hypothesized value to its standard error. It is used in hypothesis testing via Student's t-test. The t-statistic is used in a t-test to determine whether to support or reject the null hypothesis. It is very similar to the z-score but with the difference that t-statistic is used when the sample size is small or the population standard deviation is unknown.

Resampling (statistics)

In statistics, resampling is the creation of new samples based on one observed sample. Resampling methods are: Permutation tests (also re-randomization tests) Bootstrapping Cross validation Permutation test Permutation tests rely on resampling the original data assuming the null hypothesis. Based on the resampled data it can be concluded how likely the original data is to occur under the null hypothesis.

Nuisance parameter

In statistics, a nuisance parameter is any parameter which is unspecified but which must be accounted for in the hypothesis testing of the parameters which are of interest. The classic example of a nuisance parameter comes from the normal distribution, a member of the location–scale family. For at least one normal distribution, the variance(s), σ2 is often not specified or known, but one desires to hypothesis test on the mean(s).