Explained variation

In statistics, explained variation measures the proportion to which a mathematical model accounts for the variation (dispersion) of a given data set. Often, variation is quantified as variance; then, the more specific term explained variance can be used. The complementary part of the total variation is called unexplained or residual variation. Following Kent (1983), we use the Fraser information (Fraser 1965) where is the probability density of a random variable , and with () are two families of parametric models. Model family 0 is the simpler one, with a restricted parameter space . Parameters are determined by maximum likelihood estimation, The information gain of model 1 over model 0 is written as where a factor of 2 is included for convenience. Γ is always nonnegative; it measures the extent to which the best model of family 1 is better than the best model of family 0 in explaining g(r). Assume a two-dimensional random variable where X shall be considered as an explanatory variable, and Y as a dependent variable. Models of family 1 "explain" Y in terms of X, whereas in family 0, X and Y are assumed to be independent. We define the randomness of Y by , and the randomness of Y, given X, by . Then, can be interpreted as proportion of the data dispersion which is "explained" by X. Fraction of variance unexplained The fraction of variance unexplained is an established concept in the context of linear regression. The usual definition of the coefficient of determination is based on the fundamental concept of explained variance. Let X be a random vector, and Y a random variable that is modeled by a normal distribution with centre . In this case, the above-derived proportion of explained variation equals the squared correlation coefficient . Note the strong model assumptions: the centre of the Y distribution must be a linear function of X, and for any given x, the Y distribution must be normal. In other situations, it is generally not justified to interpret as proportion of explained variance.

What drives electricity tariffs in Switzerland? Two-stage statistical and geospatial analysis of structural differences across 1913 municipalities

Yael Frischholz, Noémie Alice Yvonne Ségolène Jeannin, Fabian Heymann

We present a two-staged statistical and geospatial analysis exploring the discrepancies of household electricity tariffs across 1,913 Swiss municipalities. First, we perform a multilinear regression analysis, considering structural, sociodemographic data a ...

2023

Characterization of high harmonic frequencies in reactor noise experiments within the CORTEX project

Andreas Pautz, Vincent Pierre Lamirand

We present a novel technique of neutron noise detection and experimental data interpretation developed during the EU H2020 project CORTEX aiming to improve the capabilities for identification and localization of neutron noise sources. The experimental data ...

2022

Evaluating the Explainers: Black-Box Explainable Machine Learning for Student Success Prediction in MOOCs

Vinitra Swamy, Bahar Radmehr, Mirko Marras, Natasa Krco

Neural networks are ubiquitous in applied machine learning for education. Their pervasive success in predictive performance comes alongside a severe weakness, the lack of explainability of their decisions, especially relevant in human-centric fields. We im ...

2022

What drives electricity tariffs in Switzerland? Two-stage statistical and geospatial analysis of structural differences across 1913 municipalities

Characterization of high harmonic frequencies in reactor noise experiments within the CORTEX project

Evaluating the Explainers: Black-Box Explainable Machine Learning for Student Success Prediction in MOOCs

Graph Chatbot

Chat with Graph Search

Characterization of high harmonic frequencies in reactor noise experiments within the CORTEX project

What drives electricity tariffs in Switzerland? Two-stage statistical and geospatial analysis of structural differences across 1913 municipalities

Evaluating the Explainers: Black-Box Explainable Machine Learning for Student Success Prediction in MOOCs