Six Sigma Quality Resources for Achieving Six Sigma Results
Click To Learn More About PremiumLinks
 Home > Tools & Templates  > Organizing Data Search:
 
 for    
Publications
Marketplace
| iSixSigma
Stuff
| iSixSigma
Blogosphere
| Events
Calendar
| The
Dictionary
| Discussion
Forum
| Find
a Job
| Post
a Job
| Industry
News
| Newsletter
Signup
| Sigma
Calculator
| Online
Surveys
Nominations for iSixSigma Awards! close November 30 – nominate your project/program today!
iSixSigma Magazine Signup
 iSixSigma Live!  
  Live! Home
  2010 Summit & Awards
  2010 Energy Forum
 Free Newsletters!  
  Sign Up Now!
  Manage Subscriptions
  New To Six Sigma?
  Six Sigma Q&A
  Cert. Practice Test
  Problem Solving Wizard
  ISSSP Info
ISSSP Is The Official Six Sigma Society of iSixSigma
 Channels 
  Europe
  Financial Services
  Healthcare
  Military
  Software / IT
 Quality Directory 
  Best Practices
  Certifications/Awards
  Consultants
  Culture Evolution
  Methodologies
  News & Events
  Organizations
  Product/Service Guides
  Statistics & Analysis
  Tools & Templates
   DOE
   FMEA
   Glossary
   Histogram
   Pareto
   Poka Yoke
   SIPOC
   Software
  Voice of the Customer
  Free Whitepapers
 Related Topics 
  Innovation
  Outsourcing/Offshoring
  Business Process Mgt
 Quick Access 
  Help
  Search
  Advertise Here
  Article Archives
  Newsletter Archives
 User Feedback 
  Please suggest site
  improvements.
 
  [ larger form ]

Turning Judgment Calls into Reliable Data with Gage R&R

Bookmark This Page Bookmark This Page
Email This Page Email This Page
Format for Printing Format for Printing
Cite This Article Cite This Article
Submit an Article Submit an Article
Six Sigma Article Archive Read More Articles
Related Tools & Articles
  • Discussion Forum
    "I am wondering if Gage R&R variation of less than 30 percent is real?..."

    Contribute to this Discussion
    Download Products

    By Michael Mueller

    One of the biggest challenges in making improvements in transactional processes is getting data which can be relied upon.

    There is an abundance of categorical data in transactional areas – situations where a judgment call is required: Is something right or wrong? Is the application complete? What type of error was made on the request form? Six Sigma project teams often use whatever data they can gather on issues like these without questioning its reliability. That is a mistake.

    An often-overlooked tool in the Lean Six Sigma toolbox, Gage R&R, can help improve data reliability. It is a method for checking the reproducibility of a measurement system (how closely data from different data collectors match) and repeatability (the likelihood that measurements taken by the same person at different times will match). Gage R&R has been more commonly applied to evaluate continuous data gathering with a measurement instrument of some sort, but the basic approach works extremely well for "judgment call data" in financial services.

    Here are two examples which provide insight into using Gage R&R:

    Case 1: Validating General Ledgers

    A Black Belt at a global manufacturing and service company was assigned a project to streamline and reduce the cycle time for validating general ledger accounts. As is typical of these situations, about eight auditors were involved in reviewing the accounts and making two critical judgment calls: 1) Was the ledger prepared correctly? 2) If not, what was wrong with it?

    Well-trained in Six Sigma, this Black Belt knew that she could not proceed very far into the project unless she had confidence in the data. So one of her first steps was to perform a Gage R&R test. The Black Belt first had an expert auditor review 10 accounts to establish the "master" values for each ledger. (Was it accurate? If not, in what way was it wrong?)

    She then had four auditors from the department also review the accounts using their standard procedures. These reviews were compared against the master and scored accordingly. (Did the person reach the right decision about pass/fail? And if it failed, did he or she give the correct reason why.) Two weeks later, she repeated the scoring exercise using the same four auditors and same 10 ledgers. This would allow her to gauge repeatability. The results were surprising:
     
    Repeatability was only 50 to 60 percent. That meant almost half the time each auditor got a different result when they scored the same ledger two weeks apart.

    Reproducibility was 40 to 70 Percent. In almost a third of the cases, the auditors did not agree with each other.

    Acceptable levels of these figures vary depending on situation, but in this case the target was 80 to 90 percent. Obviously this group had some work to do. Based on her experience with "judgment call data," the Black Belt knew that the root cause was likely in the operational definitions used to make decisions. When the auditors looked at the operational definitions, they realized this was definitely the case. The definitions of what constituted acceptable ledgers, and what constituted an "error" were so vague that it was no wonder people interpreted them differently.

    In measurement systems involving discrete (categorical) data, the goal is to have operational definitions that allow any item to be put into one, and only one, category. There cannot be any overlap between categories, and it must be clear how to decide what category something goes into. In this case, the Black Belt helped the team of auditors refine the definitions of various accounting mistakes. They then repeated the Gage R&R exercise and scored numbers well over 80 percent.

    Case 2: Classifying Calls to a Call Center

    Call centers are probably one of the most data-rich environments in any company. Most calls centers already track data like the duration and reason for the call. The Black Belt working in a call center for one company thought he had a leg up on his project because he could quickly construct a Pareto chart on "reason for the call" based on existing data. Surely that meant he could skip the Measure phase of DMAIC and go right into examining the reasons for the largest bars on the Pareto.

    Fortunately, a Master Black Belt suggested he step back and do a Gage R&R study on classification of calls before proceeding. So he selected a few of the people responsible for reviewing calls. (These call appraisers are the reason for the recorded message, "This call may be monitored....") The Black Belt had the call raters review a set of taped calls and evaluate whether the phone operator classified it correctly. That is, whether the call was forwarded to the right group. The exercise was repeated with the same call appraisers a few weeks later.

    Much to the Black Belt's shock, the gage scores showed only 40 to 60 percent repeatability. That meant the raters changed their decisions about the half the time. The adjacent figure illustrates the problem. Reproducibility had similarly low ratings. (The call appraisers did not agree with each other, either.) By implication, that meant that all the historical data on "call classification" was useless from a DMAIC viewpoint.

    To fix the situation, the Black Belt followed the same approach as the Six Sigma practitioner in Case 1. He got the raters to discuss the definitions they used to classify calls, refine and rewrite them. Then he ran the tests again. Though the Gage R&R scores did improve in the second round, they were still unacceptably low. So the process was repeated and the definitions were revised again. Finally the gage scores rose to acceptable levels.

    Conclusion: Lessons About Gage R&R

    The Black Belts in both of these companies learned a valuable lesson – never assume that a set of data can be relied upon unless it is proved to be trustworthy. Trying to calibrate what is essentially expert judgment can be tricky. The auditors in the first company, for example, were initially quite resistant to the proposed exercise. After all, what could someone who had never taken an accounting class tell them about auditing? But a strong project sponsor said, "We're going to do this." And it got done. Once they saw the results, the auditors' professional instincts kicked in and they worked well together to discuss their differences and develop improved definitions.

    Another Six Sigma tool, measurement systems analysis (MSA), provides more complicated techniques to evaluate the reliability of data. But as the cases cited here show, relatively simple experiments where people compare their decisions can work well. This technique can be used easily by Green Belts or Black Belts.

    Belts and other Six Sigma project team members also need to remember that evaluating data is a process, not a single event. Measurement systems tend to degrade over time. Thus it is important to regularly assess measurement systems to validate that they continue to provide reliable data – whether those systems are people's judgment calls or measuring instruments.

    About the Author: Michael Mueller is a Master Black Belt at George Group with extensive Lean Six Sigma experience in a variety of businesses. He has applied Lean Six Sigma and business process management in areas such as sales and marketing, acquisitions integration, accounts receivable, card services, and regulatory compliance. He can be reached at mmueller@georgegroup.com.

     
    Rate This Article:  Current Rating: 3.50
      Poor    Excellent     
              1    2    3     4    5
    Copyright � 2000-2009 iSixSigma – All Rights Reserved
    Reproduction Without Permission Is Strictly Prohibited – Copyright Requests


    Publish an Article: Do you have a Six Sigma tip, learning or case study?
    Share it with the largest community of Six Sigma professionals, and be recognized by your peers.
    It's a great way to promote your expertise and/or build your resume. Read more about submitting an article.




    "The Bottom Line" Links

    BEST SELLING PRODUCTS (iSixSigma Publications)
    1. Six Sigma Black Belt (DMAIC) Training Slides - 2009 Version!
      The 2009 Six Sigma Black Belt course includes over 40 more slides than the 2008 version. Contents include: 1,220 PowerPo...
    2. Certified Lean Six Sigma Black Belt Assessment Exam
      Interested in assessing your knowledge of Lean Six Sigma? Preparing for certifications? Testing your students and traine...
    3. Certified Lean Six Sigma Green Belt Assessment Exam
      This assessment exam is useful for students interested in assessing their knowledge of Lean Six Sigma on the Green Belt ...
    4. Kaizen Workshop E-book
      This 150+ page ebook teaches key tools and techniques of Kaizen, as well as real application to enhance learning. Kaizen...
    5. Design For Six Sigma (DFSS) E-Book or Print
      Need an "encyclopedia" consisting of many of the tools you’ll study? Need a helpful refresher to apply the DFSS process?...
    6. Certified Lean Six Sigma Black Belt E-book
      In 670 pages learn everything within the Lean Six Sigma DMAIC body of knowledge to successfully achieve Black Belt certi...
    7. Six Sigma Yellow Belt Training Slides - 2009 Version
      The 2009 Six Sigma Yellow Belt course is comprised of: 503 slidesInstructor notesSlide explanations15 data sets19 suppo...
     
    Six Sigma AdLinks
    AdLinks Information


    Google AdWords
     
    Home | Discussion Forum | Event Calendar | Job Shop
    Link To iSixSigma | Rate This Page | Report A Problem | Free Content For Your Site | Submit Article For Publishing
     Terms of Service. �2000-2009 iSixSigma. All rights reserved. v3.0lb, 0.2
    About iSixSigmaContact UsPrivacy PolicySite Map