Six Sigma Online

  • Increase font size
  • Default font size
  • Decrease font size
Home Six Sigma Heretic ANOVA by any other name

ANOVA by any other name

E-mail Print PDF
User Rating: / 0
PoorBest 
By the time you are reading this, you will have made your New Year’s resolutions, and I will have already broken mine. But I have an idea for a resolution that you might be interested in keeping, and one that could make your New Year happy and profitable. It is something that most Black Belts I talk to have never heard of. Before we get there, we need to talk about another powerful tool, with which you may be familiar.

Analysis of variance (ANOVA) is an elegant procedure—simple, economical, and powerful. Let's say we have this research question:

“Which of the four materials being considered for making a gear has the best wear characteristic?”

This will lead to the statistical question, “Which of the five materials has the highest average wear?” (I’ll discuss another statistical question—about the dispersion—in next month’s column.)

We make up eight gears out of each of the five materials and run them on our wear tester (on which, of course, we have performed our measurement system analysis and determined acceptability). You can get the data here.

We might be tempted to do multiple t-tests, but we would have to do 10 different t-tests (which is annoying). Even worse, when we do that we increase the chance of making an α error. If αFW is the chance of making a Type I error during all the tests, αPC is the Type I error for each test and c is the total number of tests:

So if our αPC for each t-test is 0.05, our actual αFW is inflated to about 40.13 percent.

Holy leftover fruit cake, Batman! That’s a serious chance of concluding that a significant difference exists in the materials when in fact they don’t.

Luckily, there’s a way to test for equality of all group averages in one test. ANOVA works because we have two potential sources of variation: variation within each group and variation between the averages of each group. If all the groups have the same average wear, then the variation between and within the groups is due only to random chance, and the different materials have no effect on wear. On the other hand, if the different materials do have different averages, the total variation we see will be higher than we would have guessed from the variability within each group.

I can therefore estimate the population variance in two ways. I can take the average of the variances within each group and I can find the variance between the means and divide by the sample size. Both ought to give me the same answer, within sampling error, if the true means of the groups are all the same. Here’s the genius part—if I make a ratio of these two estimates, I can use the good old F-statistic to test to see if they’re equal. If they’re different by an amount that could be due to sampling error, then as far as I can tell, the groups are all the same average and I calculate an F close to one. If the averages really are different, then the between estimate of the variance contains sample error and a variance component due to the differences in group averages.

Clearly, this F is going to be larger than one. Another cool thing is that it’s a one-tail test (see why?), which gives us additional power for the same α.

OK, fine, you knew all that. But sometimes it’s fun to just sit back and appreciate elegance when you find it.

Our null hypothesis for this ANOVA is:

H0: μ1 = μ2 = μ3 = μ4 = μ5

The alternative hypothesis is that the null statement is not true somehow.

First I check for normality, because that’s one of the assumptions in ANOVA.

Material

n

(A-D)A²*

p

(S-W)W

p

(L-M)r

p

Skew.

p

Kurt.

p

1.00

8

0.463

0.266

0.902

0.301

0.166

0.796

0.190

0.798

-1.301

>.10

2.00

8

0.654

0.089

0.859

0.117

0.701

0.161

-0.820

0.271

-0.924

>.10

3.00

8

0.392

0.396

0.897

0.274

0.000

1.000

0.000

1.000

-1.456

>.10

4.00

8

0.337

0.530

0.933

0.542

0.034

0.958

0.083

0.911

-0.438

>.10

5.00

8

0.777

0.043*

0.809

0.036*

0.750

0.111

-1.113

0.137

0.291

>.10

There’s one material that might not be normally distributed, as indicated by the Anderson-Darling and Shapiro-Wilk tests. ANOVA is fairly robust to departures from normality when n is large, but is highly affected by outliers. I reviewed a histogram of the data and found that, while it might be skewed, there are no outliers, therefore we are probably safe with ANOVA. Just to be sure, I ran a Kruskall-Wallace nonparametric test, which confirmed the results.

So we perform an ANOVA on our gear data, and generate something like this:

ONEWAY ANOVA

wear by material [1 to 5]

Source
dfSS
MS
F
p
Between 4 700.1500 175.037546.324
0.000*
Within 35 132.2500 3.7786
Total 39 832.4000

Fixed Effects Analysis:
ω² = 81.92%

I’m going to talk about dispersion analysis in March, so for now let’s assume we have equal variances within the five materials. The p-value on the end of the ANOVA table is the probability of getting an F-statistic of 46.324 (or more extreme) from an F-distribution with 4, 35 degrees of freedom and an average of 1, which is pretty dang unlikely. So we reject the hypothesis that all the averages are equal, and conclude that the different materials do in fact influence the gear wear. The ω2 number is an estimate of the percentage of the total variation explained by differences in material, so clearly those differences are large compared to the sampling error.

But now what? We’ve found a significant difference, and if you refer to the alternative hypothesis, you’ll notice that ANOVA doesn’t tell you where the differences are. Now we enter the world of post-hoc analysis. (Post-hoc just means “after the fact.” Ham hock is something else entirely, so stop drooling.) We will delve into this realm next month, when our managers intrude their reality on our nice antiseptic ANOVA. Then I will show you something that you might not have seen before that could save you oodles of money.

Then again, I could be wrong.

Thanks to Mike Petrovich, for his program MVPstats, which makes these types of analyses fast and easy. Mike now has a shareware version of his flexible SPC program available for download.

 

Random Testimonial

After reviewing several online courses, I finally told myself to make a decision based on the information I had.  I could have easily spent another 6 months reviewing my many options, but I really wanted to get started on a course this summer while I had a little more free time.  Your course definitely stood out among the others - especially in the area of personal mentoring which is very important to me.

My education includes a bachelor's degree in chemical engineering.  Even though I have over 10 years experience, I have not been a practicing engineer for the past 14 years (have been teaching).  I would like to get back into engineering and thought the black belt training would be very beneficial.  My goal is to become certified.

The optional slide guide will definitely be a bonus and time saver.  Taking notes with the guides saved 40% of the time required to view today's session!  

I appreciated your suggestion on nonprofits as an option for a project.  That opens up a lot of opportunities I never considered.  There are a few local organizations that we support - would love to consider offering my time to benefit their causes.

--CG

 


Our Economic Stimulus Package: $200 Off (click to learn more)

During this recent economic downturn, we have been contacted by a number of people who are looking to add more qualifications to their resume, either to increase their value at their current job or open some options as they seek employment after a layoff, as well as businesses looking for training with a high return on investment.  Six Sigma Online would like to do what little we can to help you out if this is your situation.

Six Sigma Online already offers one of the lowest price ways to get your Black Belt while still offering much greater depth of knowledge than you can get anywhere else.  In addition, for a limited time we will offer you a 10% discount to make it that much easier to get the skills you need to achieve your employment objectives.  Just go to the order page and it is automatically deducted during the time of this offer.

Businesses right now desperately need to find ways to improve profit and reduce the costs of quality.  Black Belts are the "edge of the blade" in this endeavor and so are in demand.  Our training will maximize the success of your projects with knowledge you will need as you encounter real-world problems to solve.  Here at Six Sigma Online, we hope that our training, and the discount above, will go some way to help individuals and businesses survive these tough times.

 

Heretic Articles

This article was originally published in Quality Digest. Subscribe to Quality Digest if you would like to receive these articles when they are published, or you can get an RSS feed that is updated two weeks after they are first published.

Unemployed in Colorado? Get state help for your Six Sigma training!

On October 15, 2010 The ROI Alliance (Six Sigma Online's parent company) became an approved provider of Six Sigma Black Belt and Master Black Belt training for Colorado's Workforce Investment Program (WIP), which helps people who are unemployed pay for the training they need to succeed in this competitive market place.  If you are looking for a job that is in high-demand, and you meet the requirements of the WIP, they may be able to help pay for your Six Sigma training!

 


Random Heresy

Random sampling distribution are really something delightful

Last month I wrote about how the random sampling distribution (RSD) of various sample statistics are the basis for pretty much everything in statistics. If you understand RSDs, you understand a lot about why we do what we do in hypothesis testing, inferential statistics, and estimation of confidence intervals. Understanding RSDs gives you a huge advantage as you seek to use data in business, so let's take a closer look.

Read more...

In the News

Six Sigma's lead instructor Steven Ouellette wrote an article with Dr. Jeffrey Luftig on "The Decline of Ethical Behavior in Business."

 


 

Six Sigma Online's lead instructor Steven Ouellette was profiled in the June 2008 issue of Quality Digest magazine. If you want to learn more about Steve's peculiar view of the world, as well as what he studied for a year in Europe, read the profile online.