Search results
Results from the WOW.Com Content Network
Another approach to robust estimation of regression models is to replace the normal distribution with a heavy-tailed distribution. A t-distribution with 4–6 degrees of freedom has been reported to be a good choice in various practical situations. Bayesian robust regression, being fully parametric, relies heavily on such distributions.
Robust Regression and Outlier Detection is a book on robust statistics, particularly focusing on the breakdown point of methods for robust regression. It was written by Peter Rousseeuw and Annick M. Leroy, and published in 1987 by Wiley.
First, an outlier detection method that relies on a non-robust initial fit can suffer from the effect of masking, that is, a group of outliers can mask each other and escape detection. [17] Second, if a high breakdown initial fit is used for outlier detection, the follow-up analysis might inherit some of the inefficiencies of the initial estimator.
Least trimmed squares (LTS), or least trimmed sum of squares, is a robust statistical method that fits a function to a set of data whilst not being unduly affected by the presence of outliers [1]. It is one of a number of methods for robust regression .
The squared loss has the disadvantage that it has the tendency to be dominated by outliers—when summing over a set of 's (as in = ()), the sample mean is influenced too much by a few particularly large -values when the distribution is heavy tailed: in terms of estimation theory, the asymptotic relative efficiency of the mean is poor for heavy ...
It has also been called Sen's slope estimator, [1] [2] slope selection, [3] [4] the single median method, [5] the Kendall robust line-fit method, [6] and the Kendall–Theil robust line. [7] It is named after Henri Theil and Pranab K. Sen , who published papers on this method in 1950 and 1968 respectively, [ 8 ] and after Maurice Kendall ...
First, the statistician may remove the suspected outliers from the data set and then use the arithmetic mean to estimate the location parameter. Second, the statistician may use a robust statistic, such as the median statistic. Peirce's criterion is a statistical procedure for eliminating outliers.
Moreover, the MAD is a robust statistic, being more resilient to outliers in a data set than the standard deviation. In the standard deviation, the distances from the mean are squared, so large deviations are weighted more heavily, and thus outliers can heavily influence it. In the MAD, the deviations of a small number of outliers are irrelevant.