|
|
|
|||||||||||||||||||||
|
ANOVA
One-Way Analysis of Variance " ANOVA " is used to compare the means of two or more samples against each other to determine whether it is likely that the samples could come from populations with the same mean. This is similar to a 2-Sample t-Test except that three or more samples can be examined with(ANOVA). (ANOVA) can also be used to examine multiple Xs at the same time , but here the focus is primarily on the One-Way (ANOVA), which examines just one X. For example, a Team might need to determine if 3 operators: - A single X Operator - With 3 levels 3 Operators Take the same amount of time to perform a task. A data sample would be taken. For example : 15 points (times in this case) for each operator. ANOVA is used to make the judgment if all the operators' average (mean) task times are the same. The level of confidence in the answer depends on how far apart the means of the samples are, how much variability there is in the sample data, and how many data points there are. This is shown graphically in Figure below . The upper curves represent the distributions of all three operators' times (known as the populations). The exact nature of each of these distributions is unknown to the Team, because they represent all data points for all time. What the Team can see, however, are the samples taken, one from each population, shown as the lower curves. ANOVA examines the sample data with the aim of making an inference on the location of the population means (μ) relative to each other. It does this by breaking down the variation (using variances) in all the sample data into separate pieces, hence the name Analysis Of Variance. ANOVA compares the size of the variation between the samples versus the variationwithin the samples. Graphical representationof ANOVA.
If the between variation is not large compared with the within variation, then it is likely that the means of the parent distribution are about the same, or more specifically that the test cannot distinguish between them. The result of the test would be a degree of confidence (a p-value) that the samples come from populations with the same mean. In practical terms, the p-value gives an indication of the probability that the mean operator times are the same going forward. If the p-value is low, then at least one of the mean operator times is distinguishable from the others; if the p-value is high, they all are not distinguishable. RoadmapThe roadmap of the test analysis itself is shown graphically in Figure below
One-Way ANOVARoadmap.
Interpreting the outputCalculates a ratio of the signal(variation due to the X, the "between") relative to the noise (any other variation not due to the X, the "within"). If the signal-to-noise ratio gets large enough then this would be considered to be unlikely to have occurred purely by random chance and the X is thus considered statistically significant. This is achieved by looking up the signal-to-noise ratio in a reference distribution (F-Test), which returns a p-value. The p-value represents the likelihood that an effect this large could have occurred purely by random chance even if the populations were aligned. Based on the p-values, statements can be generally formed as follows: Example output from an ANOVA ANOVA results for a comparison of samples of Bob's vs Jane's vs Walt's performance(output from Minitab v14).
From the first table in the results:
| ||||||||||||||||||||||