![]() |
|
| Home > Statistics > Data / Sampling / Descriptive Statistics | Search: | for |
|
Interpreting Anomalies Correctly Can Help Avoid Waste
B People say that Six Sigma is sometimes like using a rocket ship engine in an automobile. The techniques and statistical software tools are so powerful they can lead to anomalies in the data or produce “bad” results. These include:
Without the benefit of extensive project experience and strong statistical interpretive skills, a Green Belt or even Black Belt may improperly diagnose these results, leading to poor decisions. Poor decisions waste time, resources and capital – specifically through re-sampling data, re-analyzing data and improving processes not in control. Fortunately, there are some reliable, quick and cost-effective techniques that Six Sigma professionals can use in these situations to help them more properly analyze the data to make better decisions. Histogram AnomaliesA practitioner has collected some data that represents average call-handling time for a help desk (Table 1).
Using a statistical software package, the practitioner produced the following histogram (Figure 1).
At first glance, the practitioner might conclude the data is not normally distributed. It does not have the classic bell-shaped, symmetrical curve. They may decide to collect more data or even perform another data sampling. But before going to such extremes practitioners should try calculating the measures of central tendency. In other words, determining how the data clusters or centralizes around particular values. There are three measures of central tendency:
If these three values are equal or approximately the same, it is possible to conclude with a fairly high degree of confidence that the data is normally distributed. A quick calculation of the mean, median and mode in this case show they are 7.3, 7.0 and 7.0, respectively. Thus, the data is normally distributed. Another simple but effective technique is to increase or decrease the number of classes in the histogram. In this example, reducing the class size from 6 to 5 produces the following normal histogram (Figure 2).
Sometimes practitioners will produce histograms that are skewed in either the positive or negative direction. Before jumping to conclusions and re-sampling the data, think about the type of process being measured. A positively skewed histogram could represent accounts receivables days outstanding or late deliveries, in which case a practitioner should expect the distribution to be one-sided with a tail to the right. Conversely, a negatively skewed histogram could reflect accounts payable days outstanding, where a one-sided distribution with a tail to the left could be expected. Scatter Plot AnomaliesSimple linear regression is a great statistical tool to determine if there is a relationship between two variables. A scatter plot graphically shows this linear relationship through a straight line connecting the data points. The equation for that linear relationship can be determined as follows: Y = mx+ b, where Y is the dependent variable, m is the slope, b is the intercept and x is the independent variable. If there is linear correlation (i.e., a straight line) it is possible to make predictions about Y given x. Correlation (r) ranges from +1 (perfect direct or positive relationship) to -1 (perfect indirect or negative relationship). Further, R2 (coefficient of determination) is the percent of variation that can be explained by the regression equation. A higher R2 value means that x is a better predictor variable of Y, and high correlation indicates a strong relationship. Taking the square root of R2 produces r. Sometimes, however, the relationship between two variables may be represented by a curve instead of a straight line. Seeing the data is not linear, a practitioner may attempt to calculate more complex regression models such as polynomial, power or exponential functions. They might even conclude there is no strong relationship between the two variables, and, therefore, decide to select another input variable. Before going down these paths, practitioners should consider transforming the data to create a straighter line to fit the data. A simple yet highly effective approach to transforming the data into a straighter line is to square the x values and calculate new R2 values until reaching a point where the R2 is at a maximum. Control Chart AnomaliesControl charts are powerful techniques to determine if a process is in control or out of control. Six Sigma professionals are all familiar with the general rules of control charts:
Consider the control chart shown in Figure 3.
None of the data points are touching or exceeding the control limits. The process appears to be in control, in which case the Six Sigma practitioner should continue to gradually improve the process. But think again. Even though the points are within the control limits, this control chart shows a trend (six or more successive points in ascending or descending direction) that is evidence of a special cause. Consequently, the practitioner should stop, identify and eliminate any special causes before improving the process. Other examples of control charts exhibiting distinct patterns within the control limits are:
Here is another call center example: A practitioner is trying to assess how well calls are resolved during the first call, and they collect the data in Table 2. In this scenario, p represents the proportion of calls not resolved.
From this data, a p-chart is produced (Figure 4).
The practitioner’s first reaction is that the entire process is out of control. Virtually every data point is either below or above the control limits – this should raise red flags. They might want to stop to identify and resolve all the special causes before proceeding to improve the process. Or, the pracitioner may think there are errors in the data and decide to re-perform the collection sampling. But before doing so, it is important to look again at the data to see if there are any shifts occurring. In this example, there are two distinct sets of p values: Weeks 1-9 and Weeks 10-21 (Table 3). Accordingly, two separate p-bar values are needed: 0.10896 for Weeks 1-9 and 0.18236 for Weeks 10-21.
The resulting p-chart (Figure 5) indicates the process is in control during Weeks 1-9. During Week 10, however, there is a process shift that significantly increases the p values. During Weeks 10-21, the average proportion of calls not resolved hovers around 18 percent. Thus, efforts to identify special causes should be focused only on what happened during Week 10 and beyond, in order to save considerable time and resources.
Another example of a strange control chart, this time involving the number of calls per customer. The practitioner collects the following data over a period of 13 weeks (Table 4), and decides to use a u-chart because there could be more than one call per person.
Figure 7 shows the resulting u-chart. While the process appears in control, the control limits during the initial eight weeks are highly irregular and quite wide. During Weeks 9-13, the control limits appear more normal (regular and narrower).
Before jumping to conclusions that the data is invalid, the practitioner should consider the type of chart they are working with. U-charts (and p-charts) are control charts for data containing defects and defectives, respectively. The control limits for these types of charts are highly sensitive to the number of samples collected. Notice the average number of samples for Weeks 1 to 8 is nearly 17 calls. The average for Weeks 9 to 13 is 347 calls. The higher the sample number, the more regular and narrow the control limits. In this example, it appears the process had not reached a “steady-state” of calls until week 9. Consequently, the practitioner might consider using only the data for Weeks 9 to 13, rather than re-sample the entire 13 weeks. Again, this can save considerable time and resources. Making Proper InterpretationsPoor decisions waste time, resources and capital. This waste occurs in re-sampling data, collecting more data, re-analyzing data or improving a process that is not in control. Generally, the problem is not related to the tool itself, but how the results are interpreted. In the absence of extensive project experience, Six Sigma professionals can use the techniques described here to identify these anomalies and transform the data in order to make better quality decisions. About the Author: Peter J. Sherman is a Black Belt and quality engineer with 21 years of experience, including serving as senior Black Belt for AT&T’s Product Development Group. He has a master’s degree in engineering from the Massachusetts Institute of Technology (MIT) and an MBA from Georgia State University. As a visiting scholar to Japan while at MIT, he worked with quality expert W. Edwards Deming. Sherman is lead Instructor at Emory University’s Six Sigma Certificate Program in Atlanta, and is a member of the American Society for Quality (ASQ) and the International Society of Six Sigma Professionals (ISSSP). He can be reached at psherm1@bellsouth.net. Reproduction Without Permission Is Strictly Prohibited Copyright Requests Publish an Article: Do you have a Six Sigma tip, learning or case study? Share it with the largest community of Six Sigma professionals, and be recognized by your peers. It's a great way to promote your expertise and/or build your resume. Read more about submitting an article. "The Bottom Line" Links
|
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Home | Discussion Forum | Event Calendar | Job Shop | |
| Link To iSixSigma | Rate This Page | Report A Problem | Free Content For Your Site | Submit Article For Publishing | |
| Terms of Service. ©2000-2008 iSixSigma. All rights reserved. v3.0lb, 1.8-A-244 |
About iSixSigma · Contact Us · Privacy Policy · Site Map. |