# DOE: Reducing the Model

Rainman
When we reduce the model in a DOE, we cannot take out an insignificant main effect if that main effect is part of a significant interaction that we are leaving in.
If we have a significant three way interaction that we are leaving in the model, say A*B*C, do we have to also leave in all the contained two way interations (A*B, A*C, B*C) even if they are not significant?  Obviously all three of the main effects would have to be left in.

Robert Butler
While there is a school of thought that insists you must keep an insignificant main effect in a model when an interaction involving that effect is significant there is also a school of thought that disagrees with this practice.
With the exception of the issues of full and reduced models in mixture designs, I prefer the second method so my answer to your question is keep only those terms that are significant and disregard the rest.

melvin
Coming from the opposite school of thought – I can’t see ignoring the hierarchy of variables, i.e., leave the A, B, C, AB, BC, and AC in.
If you make necessary interaction plots, they clearly indicate importance of the variables.
Bob

The answer is simply no – you do not have to keep insignificant interactions in the model.
Let’s say you are using Minitab to analyse a 3-factor (A, B & C) design with teh following results:
Factor       p-value
A              0.98
B               0.54
C              0.03
A*B

.   If A*B*C turns out to be statistically significant, then all factors A, B and C have to be retained in the model even if they are not signficant.   This is because Minitab needs them to be included if it is to calculate the 3-way interaction.   However, if A*B was insignificant is should be removed.   A*B is not a sub-set of A*B*C.
Sorry about all the posts – had some problems when I was writing it!
The answer is simply no – you do not have to keep insignificant interactions in the model.
Let’s say you are using Minitab to analyse a 3-factor (A, B & C) design with teh following results:
Factor       p-value
A              0.98
B               0.54
C              0.03
A*B          0.99
A*C          0.33
B*C          0.76
A*B*C     0.00
A*B*C turns out to be statistically significant, so all factors A, B and C have to be retained in the model even if they are not signficant.   This is because Minitab needs them to be included if it is to calculate the 3-way interaction.   However, all the 2-way interactions are insignificant and can be removed.   The 2-ways are not sub-sets of A*B*C.
Once the model has been reduced we’d end up with the following in Minitab:
A              0.98
B               0.54
C              0.03
A*B*C     0.00
A and B are not significant but are kept in to allow A*B*C to be determined.
Now, when you build the Y=f(x) model from the coefficients, you only use the significant terms:
Y = Constant + Ccoefficicnet*C + ABCcoefficient*A*B*C
What the practical significance of each of the terms is, is another question.
I hope this helps and again, sorry for all the incomplete posts I logged.

Mikel
Mikel
Go try that in Minitab 14 – it does not work.
And again why, except to get minitab to do the analysis, do you want to include insignificant mains of 2-ways? It is not possible for a coefficient to be equal to 0?

Stan,
Stan,
I don’t follow you.   Please note that you responded to an incomplete message from me.   If you want to add your insight, then please respond to message 62713 which was my complete one.
Thanks

Mikel
Read the message – no inisght.
Minitab 14 does not work the way you say – cannot remove 2-ways and leave 3-ways.
Modeling does not require leaving mains or 2-ways.

Participant

Just did it in Minitab 14.1 and it worked fine!

Mikel
In minitab 14.1, I have a 3 factor experiement, where I have a statistically significant 3-way interaction and some insignificant 2-way interaction. If I try to take 1,2 , or all 3 of the 2-ways out, I get a message that says
—————————MINITAB—————————General factorial model is non-hierarchical.—————————OK   —————————

Participant

Strange.   I just repeated the test on a 3 factor DOE (full factorial) with only the 3-way interaction significant.   I can take out any or all the 2-ways without any complaint from MTB.   I’ve never seen the message you mention.

BeenThereDoneThat
If you have a main effect for A of zero with a significant AB effect, there is most likely a lurking variable that was missed. The conclusion from the DOE will be that you can explain the data, but not the experimental reality. Social scientists deal with this kind of situation constantly when they gather data that is observable and make conclusions based on unobservable, lurking factors such as ’empathy’ or ‘compassion’ by conducting factor analysis. Paddy:
Correct – when you have ABC, include A, B, and C. You usually don’t need AB, BC, and AC. Some partial factorial design might give you results that don’t make practical sense, but it is unlikely and depends on the design used.

Mikel
Nonsense, and don’t call me a cowboy again. Those are the Arizona non-Metrosexual guys.
There is a difference between analysis and modeling. Minitab even tells you so and uses multiple regression for the modeling. And this nonsense about a lurking variable – where do you come up with this stuff? By definition, a non significant variable has a coefficient of 0 (or at least you can’t prove it does not). Use step wise regression when deciding on your final model, your adjusted r squared being the determining factor.

BeenThereDoneThat
Nonsense? I’m not sure what you mean.I agree about the difference between analysis and modeling. I see analysis as the part from the mathematicians talking about tasting tea. The modeling part is more the practical side – what makes sense in the light of the experimental setup and the real situation.MINITAB is great software and defaults are well designed to make the software a good practical tool, but you can still run a t-test that violates the essential assumptions of the test.This is not about the size of the coefficient – hence my point that this will make little practical difference to the model.Errors in the the number of degrees of freedom for alternate models will alter the mean-square-error, the p-values and r-factors, making Hamilton r-factor ratio tests a bit dicy. Stepwise regression is a bit of an art – not for people who are just learning DOE.When you leave the main effects in the model, you can’t go wrong. If you neglect them automatically you could miss something important. If you propose a model with a 3-way interaction without acknowledging the independence of the 3 variables, then you are proposing a situation where they are not independent – they are likely linked to a variable not included in the experiment or the model.I don’t like hand waving when it come to the maths.
The cowboy metaphor was for fun and not directed at any active member of the forum, I’m glad someone picked up the reference to that well known…. what was his name?

Ken Feldman
DrSeuss
Rainman,
The beauty of the DOE is that if you started your analysis with an orthogonal array, namely, an arrangement of factor combinations that was established using typical software, such as, Minitab; then the independence of your factors and interactions is guaranteed.  Basically, each main effect and interaction effect is independent of each other (one does not depend on the others being significant or not).  Keep the main effects and 3-way interaction and kill the insignificant 2-ways.

