Analysis for more than 2 group
Six Sigma – iSixSigma › Forums › Old Forums › General › Analysis for more than 2 group
 This topic has 4 replies, 4 voices, and was last updated 12 years, 8 months ago by Szentannai.

AuthorPosts

September 22, 2009 at 6:05 pm #52680
Dear All,
I have question, I have data with more than 2 groups, but my problem is some of them only have 1 sample, I want to see the difference of mean between group. I tried to use One way ANOVA but there was warning that I cannot continue for Post Hoc test because some of my groups only have 1 sample. Should I use non parametric statistic to test mean comparison?0September 23, 2009 at 8:39 am #185609why can’t you get more data and do the analysis correctly?
0September 23, 2009 at 10:19 am #185618Because I dont have more samples, this is for genetic research, so it is based on how many patients that I get for this research.
0September 23, 2009 at 11:48 am #185621
Robert ButlerParticipant@rbutler Include @rbutler in your post and this person will
be notified via email.If you are going to compare means then you need sufficient data to compute means in the first place. This is true regardless of test method. Your second post would suggest one of the following: you are working with a problem that is very rare in the population, you didn’t do a search for prior art to understand the issues concerned with your choice of groups (enrollment times, exclusion criteria, etc) or you are under pressure to “produce something”.
Reasons for sparse population of groups aside, if you have to produce something you will have to get out the old paper and pencil and grind it out by hand. One possible approach would be the following:
First – take those groups that have more than one sample and run an ANOVA on them – you will, of course, want to test for variance equality. If the variances are equal – generate an estimate of the pooled variance, pretend the numbers corresponding to your groups with a single sample are mean values and then manually run posthoc pairwise “means” tests using the actual mean values for those groups with actual means and the pretend mean values for those groups with a single entry.
You will have to invoke a multiple comparison correction and since the actual calculations for this, given your circumstances, would be of dubious value the simplest approximation to this would be a simplified Bonferroni correction – take the number of comparisons you want to run (you will have to decide this before you run the tests) generate the individual pvalues for each of the pairwise tests and multiply that value by the number of predetermined comparisons.
If the variances are unequal you could again pool their values and run the above or you could be conservative and use only the largest variance for your calculations. Given that you are pretending that a single measure is representative of the group mean the latter choice could be viewed as a way for compensating for this rather large assumption.
If the variances are unequal you could also run Welch’s test for unequal variance for those groups with more than one value. The problem would be how to manually modify the test to run such a test for single entry groups – I suppose there might be some kind of workaround but if it exists it something of which I’m not aware.0September 23, 2009 at 11:48 am #185622Hi santoso,
this seems to be pretty much hopeless. If you only have one sample point in a group, you have no way of judging how extreme or how typical that one measurement is, so basically you can not draw any conclusions from it.Can you, maybe aggregate some of the groups? E.g. instead of having cities with one sample from each, maybe look at regions, with several cities in each region, that sort of thing?Of course best would be to get more data. Not having enough to do a meaningful analysis could be a powerful argument. Regards
Sandor0 
AuthorPosts
The forum ‘General’ is closed to new topics and replies.