Linear regressionIn statistics, linear regression is a linear approach for modelling the relationship between a scalar response and one or more explanatory variables (also known as dependent and independent variables). The case of one explanatory variable is called simple linear regression; for more than one, the process is called multiple linear regression. This term is distinct from multivariate linear regression, where multiple correlated dependent variables are predicted, rather than a single scalar variable.
Semiregular variable starIn astronomy, a semiregular variable star, a type of variable star, is a giant or supergiant of intermediate and late (cooler) spectral type showing considerable periodicity in its light changes, accompanied or sometimes interrupted by various irregularities. Periods lie in the range from 20 to more than 2000 days, while the shapes of the light curves may be rather different and variable with each cycle. The amplitudes may be from several hundredths to several magnitudes (usually 1-2 magnitudes in the V filter).
Cepheid variableA Cepheid variable (ˈsɛfi.ɪd,_ˈsiːfi-) is a type of variable star that pulsates radially, varying in both diameter and temperature. It changes in brightness, with a well-defined stable period and amplitude. Cepheids are important cosmic benchmarks for scaling galactic and extragalactic distances. A strong direct relationship exists between a Cepheid variable's luminosity and its pulsation period. This characteristic of classical Cepheids was discovered in 1908 by Henrietta Swan Leavitt after studying thousands of variable stars in the Magellanic Clouds.
Empirical distribution functionIn statistics, an empirical distribution function (commonly also called an empirical cumulative distribution function, eCDF) is the distribution function associated with the empirical measure of a sample. This cumulative distribution function is a step function that jumps up by 1/n at each of the n data points. Its value at any specified value of the measured variable is the fraction of observations of the measured variable that are less than or equal to the specified value.
Regression analysisIn statistical modeling, regression analysis is a set of statistical processes for estimating the relationships between a dependent variable (often called the 'outcome' or 'response' variable, or a 'label' in machine learning parlance) and one or more independent variables (often called 'predictors', 'covariates', 'explanatory variables' or 'features'). The most common form of regression analysis is linear regression, in which one finds the line (or a more complex linear combination) that most closely fits the data according to a specific mathematical criterion.
Minimum-variance unbiased estimatorIn statistics a minimum-variance unbiased estimator (MVUE) or uniformly minimum-variance unbiased estimator (UMVUE) is an unbiased estimator that has lower variance than any other unbiased estimator for all possible values of the parameter. For practical statistics problems, it is important to determine the MVUE if one exists, since less-than-optimal procedures would naturally be avoided, other things being equal. This has led to substantial development of statistical theory related to the problem of optimal estimation.
Independent and identically distributed random variablesIn probability theory and statistics, a collection of random variables is independent and identically distributed if each random variable has the same probability distribution as the others and all are mutually independent. This property is usually abbreviated as i.i.d., iid, or IID. IID was first defined in statistics and finds application in different fields such as data mining and signal processing. Statistics commonly deals with random samples. A random sample can be thought of as a set of objects that are chosen randomly.
Variable-star designationIn astronomy, a variable-star designation is a unique identifier given to variable stars. It uses a variation on the Bayer designation format, with an identifying label (as described below) preceding the Latin genitive of the name of the constellation in which the star lies. See List of constellations for a list of constellations and the genitive forms of their names. The identifying label can be one or two Latin letters or a V plus a number (e.g. V399). Examples are R Coronae Borealis, YZ Ceti, V603 Aquilae.
Asymptotically optimal algorithmIn computer science, an algorithm is said to be asymptotically optimal if, roughly speaking, for large inputs it performs at worst a constant factor (independent of the input size) worse than the best possible algorithm. It is a term commonly encountered in computer science research as a result of widespread use of big-O notation. More formally, an algorithm is asymptotically optimal with respect to a particular resource if the problem has been proven to require Ω(f(n)) of that resource, and the algorithm has been proven to use only O(f(n)).
Gamma distributionIn probability theory and statistics, the gamma distribution is a two-parameter family of continuous probability distributions. The exponential distribution, Erlang distribution, and chi-squared distribution are special cases of the gamma distribution. There are two equivalent parameterizations in common use: With a shape parameter and a scale parameter . With a shape parameter and an inverse scale parameter , called a rate parameter. In each of these forms, both parameters are positive real numbers.