Random Variation

If you’re interested in the statistical concepts surrounding random variation, we will provide the statistical definition and explore how it might apply to your organization.

For those more interested in what it means in practical terms, we will explore the definition and application in terms of its benefits and how it can be used to better manage your organization.

Overview: What is random variation?

One of the best definitions for random variation appears in the dictionary of the iSixSigma.com website:

The tendency for the estimated magnitude of a parameter (e.g., based upon the average of a sample of observations of a treatment effect) to deviate randomly from the true magnitude of that parameter. Random variation is independent of the effects of systematic biases. In general, the larger the sample size is, the lower the random variation is of the estimate of a parameter. As random variation decreases, precision increases.

In other words, everything varies, whether it be the dimensions of your product, your personal weight, your manufacturing processing time, the time to get to work, or your blood pressure. Over time, you would expect the variation of those measurements to form some kind of statistical distribution that would approximate the underlying population of whatever you are measuring.

That underlying distribution will have a calculated central tendency, variation, and shape. At any point in time, the measurement you take will vary and can come from any place in that distribution. If there is random variation, you will not be able to predict the exact value of the next measurement. You might be able to calculate the probability of what the next value might be — or even calculate a range of values within the next measurement might fall. We can call that a confidence interval.

How random variation affects your processes

While the statistical properties are interesting, what might be more important for you is how the concept of random variation impacts your ability to manage your process. If your process is exhibiting random variation, or what Dr. W. Edwards Deming called common cause variation, then your process is predictable and in what might be called a steady state. Deming distinguished common cause from special cause. Special cause variation is unpredictable and a function of some unexpected intervention in your process.

For example, the fill level of your bottle will have some variation as a function of the variation in your fill equipment, liquid, temperature, and run speed. That is the steady state given the combined effects of the variation in your process elements. It is expected and, over time, will form some distribution.

However, if one of your fill nozzles starts to clog up, there will be variation in fill that is a function of a specific and assignable cause. That would not be expected or predicted until after its occurrence. That would be non-random variation — or special cause variation.

You can use a control chart to distinguish between a random (common cause, predictable, noise) variation and a non-random (special cause, unpredictable, signal) variation.

3 benefits of paying attention to your variation

Knowing whether your process is exhibiting random or non-random variation will help you properly respond to the signal you receive from your control chart.

1. Proper response

If your process is exhibiting random variation then any improvement will require a fundamental change in the process. If the process is exhibiting non-random variation, then you will need to identify the reason for that assignable cause and then take action to either eliminate or incorporate changes to maintain an improved state or eliminate a negative impact.

2. Predict

If you are taking sample measurements and the process is demonstrating random variation, you’ll be able to do some level of prediction of future values.

3. Assess changes

If your process is demonstrating random variation and you make a change, you will have confidence that, if you see an impact due to your change, it will be real and believable.

Why is random variation important to understand?

The concept of random variation, or noise, is a central concept in statistics. You will want to understand what random variation is and its implications for taking the appropriate actions on your process.

Underlying assumption

Most statistical tests will have an underlying assumption that the data you’re analyzing was created by a random process. If not, your results may be inaccurate because of the influence of non-random variation.

Desired state

You should strive to achieve random variation in your processes. Random variation does not imply that everything is OK or good, but merely that the process is predictable and steady state. From there, you will want to evaluate whether that steady state is satisfactory or needs to be improved.

For example, why do you think your doctor wants you to fast before a blood test? Is it to be mean (especially if your appointment is in the afternoon)? No, your doctor wants you to only exhibit random variation in your body processes and not have the influence of special cause variation, so your test results can be considered representative of your true steady state. That doesn’t mean an elevated blood pressure is good, but at least your doctor knows that it exists. From there, he or she can have the proper response.

Improper response

Unless you have a good understanding of random variation, you may inadvertently believe you have non-random variation when you don’t. This would cause you to try and find an assignable cause when none exists, or make changes as a result of an individual observation that would be tantamount to tampering with the process.

An industry example of random variation

Unfortunately, many managers don’t understand or appreciate the concept of random variation. For example, a manager in the finance department of a B2B online business was getting complaints from the CFO that invoices were slow getting out to the customer, and thus cash flow was being negatively impacted.

The LSS Master Black Belt (MBB) investigated and found out that, as a result of their LSS training, the manager was control charting the invoice processing time. That was a good thing. When the MBB started questioning the manager how he uses the control chart, he realized what the problem was. The control chart had all of the points within the upper and lower control limits so the process was demonstrating random variation.

The manager was reacting to high and low points without appreciating whether the process was exhibiting common or special cause variation. It turned out that when the manager saw a “high” point on the control chart he initiated a search for the root cause. And when he was happy with a “low” point, he didn’t do anything except to say “Great job!”

An example chart showing variation in process time

The manager should have realized that the process was stable and showing random variation so the appropriate response should have been to change the process to reduce the overall variation — and if desired, to lower the average processing time.

3 best practices when thinking about random variation

To manage your process by properly using the concept of random variation, you should consider the following best practices.

1. Collect your data in a random manner

To get a picture of the true random variation of your process, you should collect your data in a random manner. Introducing any bias in your data collection will impact the randomness of your variation.

2. Use the appropriate statistical tools to determine if you have random variation

As has been explained before, the statistical control chart is the best tool for determining whether your process is generating data in a random pattern or not.

3. Provide a proper response

You should react to random variation by seeking to improve your process if it’s not capable of meeting your specs, targets, or expectations. If you have non-random variation, you will need to investigate why and then take the appropriate steps to either incorporate or eliminate the reasons why.

Frequently Asked Questions (FAQ) about random variation

What is an example of random variation vs. non-random variation?

Let’s use a pair of fair dice as an example. If we throw our dice many times, we will experience variation in the numbers we throw. If we threw them even more times, we would get a distribution with an average of 7, a range of 10 (12-2) and a shape that is triangular. That is the hypothetical distribution.

But what if we started to see throws of 8, 7, 9, 10, 9, 12, 11, and 10. They are all above the average. We might be suspicious that this is not random variation. We would investigate and possibly find that the dice are loaded. We would then seek to correct the situation if we wanted the dice to represent random variation.

What is the best way to know if we are seeing random variation?

The statistical control chart is the best tool for distinguishing between random and non random variation.

Must I always react to random variation?

If your process is showing random variation and is operating at a desired level, there is no need for you to react. But if you wish to improve your process, you’ll want your process to be in a steady state of random variation. That way, when you observe a change, you can attribute it to what you did rather than some unknown source.