Concept

Posterior predictive distribution

In Bayesian statistics, the posterior predictive distribution is the distribution of possible unobserved values conditional on the observed values. Given a set of N i.i.d. observations , a new value will be drawn from a distribution that depends on a parameter , where is the parameter space. It may seem tempting to plug in a single best estimate for , but this ignores uncertainty about , and because a source of uncertainty is ignored, the predictive distribution will be too narrow. Put another way, predictions of extreme values of will have a lower probability than if the uncertainty in the parameters as given by their posterior distribution is accounted for. A posterior predictive distribution accounts for uncertainty about . The posterior distribution of possible values depends on : And the posterior predictive distribution of given is calculated by marginalizing the distribution of given over the posterior distribution of given : Because it accounts for uncertainty about , the posterior predictive distribution will in general be wider than a predictive distribution which plugs in a single best estimate for . The prior predictive distribution, in a Bayesian context, is the distribution of a data point marginalized over its prior distribution . That is, if and , then the prior predictive distribution is the corresponding distribution , where This is similar to the posterior predictive distribution except that the marginalization (or equivalently, expectation) is taken with respect to the prior distribution instead of the posterior distribution. Furthermore, if the prior distribution is a conjugate prior, then the posterior predictive distribution will belong to the same family of distributions as the prior predictive distribution. This is easy to see. If the prior distribution is conjugate, then i.e. the posterior distribution also belongs to but simply with a different parameter instead of the original parameter Then, Hence, the posterior predictive distribution follows the same distribution H as the prior predictive distribution, but with the posterior values of the hyperparameters substituted for the prior ones.

Source officielle

https://en.wikipedia.org/wiki/Posterior_predictive_distribution

À propos de ce résultat

Cette page est générée automatiquement et peut contenir des informations qui ne sont pas correctes, complètes, à jour ou pertinentes par rapport à votre recherche. Il en va de même pour toutes les autres pages de ce site. Veillez à vérifier les informations auprès des sources officielles de l'EPFL.

Cours associés (11)

EE-209: Elements of statistics for data science

COM-406: Foundations of Data Science

We discuss a set of topics that are important for the understanding of modern data science but that are typically not taught in an introductory ML course. In particular we discuss fundamental ideas an

MATH-232: Probability and statistics (for IC)

A basic course in probability and statistics

Afficher plus

Publications associées (29)

Valence can control the nonexponential viscoelastic relaxation of multivalent reversible gels

Hugo Camille Valentin Le Roy

Gels made of telechelic polymers connected by reversible cross-linkers are a versatile design platform for biocompatible viscoelastic materials. Their linear response to a step strain displays a fast, near-exponential relaxation when using low-valence cros ...

Amer Assoc Advancement Science2024

TIC-TAC: A Framework for Improved Covariance Estimation in Deep Heteroscedastic Regression

Mathieu Salzmann, Alexandre Massoud Alahi, Megh Hiren Shukla

Deep heteroscedastic regression involves jointly optimizing the mean and covariance of the predicted distribution using the negative log-likelihood. However, recent works show that this may result in sub-optimal convergence due to the challenges associated ...

2024

Influence of model uncertainty and long term deformations in action effects calculation in reinforced concrete structures

Xhemsi Malja

Most codes of practice adopt a semi probabilistic design approach for the dimensioning and assessment of structures. Accordingly, structural safety is ensured by performing limit state verifications using design values determined with adequately calibrated ...

EPFL2024

Afficher plus

Personnes associées (1)

Hervé Bourlard

Concepts associés (7)

Compound probability distribution

In probability and statistics, a compound probability distribution (also known as a mixture distribution or contagious distribution) is the probability distribution that results from assuming that a random variable is distributed according to some parametrized distribution, with (some of) the parameters of that distribution themselves being random variables. If the parameter is a scale parameter, the resulting mixture is also called a scale mixture.

Dirichlet-multinomial distribution

In probability theory and statistics, the Dirichlet-multinomial distribution is a family of discrete multivariate probability distributions on a finite support of non-negative integers. It is also called the Dirichlet compound multinomial distribution (DCM) or multivariate Pólya distribution (after George Pólya). It is a compound probability distribution, where a probability vector p is drawn from a Dirichlet distribution with parameter vector , and an observation drawn from a multinomial distribution with probability vector p and number of trials n.

Categorical distribution

In probability theory and statistics, a categorical distribution (also called a generalized Bernoulli distribution, multinoulli distribution) is a discrete probability distribution that describes the possible results of a random variable that can take on one of K possible categories, with the probability of each category separately specified. There is no innate underlying ordering of these outcomes, but numerical labels are often attached for convenience in describing the distribution, (e.g. 1 to K).

Afficher plus

Source officielle

https://en.wikipedia.org/wiki/Posterior_predictive_distribution

À propos de ce résultat

Cours associés (11)

EE-209: Elements of statistics for data science

COM-406: Foundations of Data Science

MATH-232: Probability and statistics (for IC)

A basic course in probability and statistics

Afficher plus

Séances de cours associées (30)

Famille exponentielle : Distribution d'entropie maximale

Couvre les familles exponentielles, l'entropie maximale et les propriétés de distribution de Moxwell-Boltzmann.

Modèle dirichlet-multinomial

Discute de la distribution de Dirichlet, de l'inférence bayésienne, de la moyenne postérieure et de la variance, des antécédents conjugués et de la distribution prédictive dans le modèle de Dirichlet-Multinôme.

Estimation bayésienne : Aperçu et exemples

Introduit l'estimation bayésienne, qui couvre l'inférence classique par rapport à l'inférence bayésienne, les antécédents conjugués, les méthodes MCMC et des exemples pratiques comme l'estimation de la température et la modélisation de choix.

Afficher plus

Publications associées (29)