Search results
Results from the WOW.Com Content Network
Box-and-whisker plot with four mild outliers and one extreme outlier. In this chart, outliers are defined as mild above Q3 + 1.5 IQR and extreme above Q3 + 3 IQR. The interquartile range is often used to find outliers in data. Outliers here are defined as observations that fall below Q1 − 1.5 IQR or above Q3 + 1.5 IQR.
Figure 2. Box-plot with whiskers from minimum to maximum Figure 3. Same box-plot with whiskers drawn within the 1.5 IQR value. A boxplot is a standardized way of displaying the dataset based on the five-number summary: the minimum, the maximum, the sample median, and the first and third quartiles.
The five-number summary gives information about the location (from the median), spread (from the quartiles) and range (from the sample minimum and maximum) of the observations. Since it reports order statistics (rather than, say, the mean) the five-number summary is appropriate for ordinal measurements , as well as interval and ratio measurements.
The 4-quantiles are called quartiles → Q; the difference between upper and lower quartiles is also called the interquartile range, midspread or middle fifty → IQR = Q 3 − Q 1. The 5-quantiles are called quintiles or pentiles → QU; The 6-quantiles are called sextiles → S; The 7-quantiles are called septiles → SP; The 8-quantiles are ...
The Interquartile Range (IQR), defined as the difference between the upper and lower quartiles (), may be used to characterize the data when there may be extremities that skew the data; the interquartile range is a relatively robust statistic (also sometimes called "resistance") compared to the range and standard deviation. There is also a ...
Box plot of the Michelson–Morley experiment, showing several summary statistics. In descriptive statistics, summary statistics are used to summarize a set of observations, in order to communicate the largest amount of information as simply as possible. Statisticians commonly try to describe the observations in
where is the interquartile range of the data and is the number of observations in the sample . In fact if the normal density is used the factor 2 in front comes out to be ∼ 2.59 {\displaystyle \sim 2.59} , [ 4 ] but 2 is the factor recommended by Freedman and Diaconis.
The sample range is the difference between the maximum and minimum. It is a function of the order statistics: {, …,} = (). A similar important statistic in exploratory data analysis that is simply related to the order statistics is the sample interquartile range.