Statit Support Articles
Administration
Frequently Asked Questions
How-to
ODBC
Sample Code
 
Quality Practice Tips

Subscribe to Statit Bulletin, our quarterly SPC/Quality Resource e-newsletter

Your Name:
E-mail:
Company:
 

The Xbar & R Chart Says You're in Control. Are you?


Robert F. Hart, Ph.D.
Marilyn K. Hart, Ph.D.

The purpose of process data analysis is to answer the question "might the observed data fluctuations reasonably be attributed to common-cause variation?"

The principal use of the Shewhart control chart is to guide efforts for process improvement by serving as an aid to the discovery of peculiarities in the data which suggest nonrandom, special-cause influence. Process data typically tend to wander somewhat over time so the long-term standard deviation estimate is greater than the short-term estimate. The time-ordered Xbar & R chart capitalizes on this by using small subgroups of data to obtain a smaller within-subgroup estimate of sigma than might otherwise be obtained. This is accomplished by taking observations that are homogeneous (as much alike as possible). The result is that the variation within the subgroups will be minimized and the variation between the subgroups will be maximized giving the tightest possible control limits and the greatest opportunity for discovering nonrandom special-cause variation. Hence, with a single-stream process (i.e., no parallel paths such as a multiple cavity die), a good sampling plan would be to take five consecutive pieces to make each subgroup. Subgroups should be taken at those times where expert knowledge of the process would give the greatest concern for manufacturing problems, such as:

1. At every startup and shutdown
2. Just before and after interruptions such as lunch breaks
3. Just before after shift changes
4. At all manufacturing or inspection personnel changes

Such a sampling scheme will provide the smallest possible variation within the subgroups and hence the greatest opportunity to give a statistical signal to serve as a roadmap for process improvement.

Unfortunately, improper sampling plans are sometimes used which tend to inflate the control limits. A serious error in the use of the Xbar & R chart is to blindly believe a control chart which indicates that a process is in control. This error is serious because it robs you of your "roadmap for improvement." Artificially "inflated" control limits may easily give a false indication that a process is free from nonrandom variation when such is not the case at all. This renders the control chart worse than useless -- no information is better than misinformation.

The usual cause of inflated control limits is the existence of a systematic within-subgroup pattern (e.g., the first reading of a subgroup is usually the highest within that subgroup). The problem of inflated control limits due to systematic within-subgroup stratification is so common that one of the Nelson's rules of evidence of lack of control deals with it: 15 points in a row inside the 1-sigma limits [1]. Subgroup stratification is even found in the published literature as shown in the following example from Ishikawa [2].

In the manufacture of resin parts, a critical measurement was made on five parts each day for 25 days. The first part was measured at 6:00 AM and another each four hours thereafter. The data were kept in time order to make an Xbar & R chart with the usual 3-sigma limits, having 25 subgroups (days) of size five (times of the day), Figure 1. It is noted in passing that the highest values on the Xbar chart and on the R chart both occurred on Day 19. With random normal data the Xbar values and R values are independent so the probability of such a coincidence is too small to attribute the signal to chance. However we will overlook this, as Ishikawa did, and proceed with the problem of inflated control limits.

Ishikawa concluded that the process was in a state of control with only common-cause variation arising from what appears to be random data. It will be shown that this conclusion was grossly in error, the result of inflated control limits, and that the mistake easily could have been avoided.


Figure 1. Resin parts data. Xbar & R chart subgrouped by day with subgroups of size 5.

Dr. W. Edwards Deming strongly advocated the use of run charts, simple time-ordered plots of the individual measurements, for data analysis. While we were conversing with Dr. Deming in 1980, one of his associates stated that a particular client "did not have enough sense to make a run chart before he made a control chart." The run chart for the resin data is shown in Figure 2 where it has been interrupted between days making the within-day pattern clearly visible. The run chart enables you to focus on peculiarities in the individual measurements which might suggest nonrandom special-cause influence.


F igure 2. Resin parts data. Run chart interrupted by day.

The most glaring feature of the run chart is that in 21 of the 25 days the startup measurement was the highest. Not random. The false message delivered by the Xbar & R chart was due to poor sampling, resulting in within-subgroup stratification and inflated control limits. The variation between times of the day, which should have been made part of the between-subgroup variation, was improperly made part of the within-subgroup variation, inflating the control limits and rendering the control chart impotent.

We have achieved our initial purpose for analysis. The observed data fluctuations can not be attributed to common-cause variation. But there is still considerably more to be learned from the interrupted run chart in Figure 2.

In addition to the nonrandom within-day variation, Figure 2 shows that there was also nonrandom between-day variation. A reference line can be opportunely be set at 13.65 skimming off the twelve highest points. Eleven of the twelve are 6:00AM startup measurements but the twelfth point is particularly interesting. This very high 10:00 reading occurred on Day 19, the day which also had the highest of the 125 readings and the highest range of 5 readings. Not random.

The numerical data for this example were without explanatory notations. Totally lacking is the sorely needed information on day of the week, date, periods when shut down, and the conditions under which the measurements were taken. Without this information one might reasonably suppose that the startup problem was most severe on Day 19, possibly after a prolonged shutdown. Clearly such conjecture after the fact is no substitute for complete annotation of the numerical data!

Finally, reconsider the R chart in Figure 1, noting that the highest subgroup range value (well within the inflated control limits) occurred on Day 19, the same day as the other problems above. It may now be seen that this range would very likely have been out of control had the limits not been so severely inflated.

An alternative to the interrupted run chart is to identify patterns in the extreme values of the data (having nothing to do with whether or not theses very high of very low points might be considered as statistical outliers). This has been done for the 25 by 2 array of the resin data in Table 2 where all values greater than 13.65 have been put in parentheses. The highest of the 125 values, the 6:00 reading on Day 19, has been put in double parentheses. The only value in parentheses that was not a 6:00 startup measurement was the 10:00 measurement on the same day. Not random. As a helpful variation of this technique, the highest (and again, the lowest) values in each subgroup of five may be identified.

The alternate approach of searching out patterns in the extreme data points gives results equivalent to the interrupted run chart. It is important to note that the ability of these two methods to discover systematic within-subgroup stratification is not hampered by the occurrence of even very large spikes of between-day variation. That is to say, these two methods work well in problems with "two-way" variation. This will not be the case in the next method of detecting within-subgroup stratification.

Day Number 6:00AM 10:00AM 2:00PM 6:00PM 10:00PM
1 (14) 12.6 13.2 13.1 12.1
2 13.2 13.3 12.7 13.4 12.1
3 13.5 12.8 13 12.8 12.4
4 (13.9) 12.4 13.3 13.1 13.2
5 13 13 12.1 12.2 13.3
6 (13.7) 12 12.5 12.4 12.4
7 (13.9) 12.1 12.7 13.4 13
8 13.4 13.6 13 12.4 13.5
9 (14.4) 12.4 12.2 12.4 12.5
10 13.3 12.4 12.6 12.9 12.8
11 13.3 12.8 13 13 13.1
12 13.6 12.5 13.3 13.5 12.8
13 13.4 13.3 12 13 13.1
14 (13.9) 13.1 13.5 12.6 12.8
15 (14.2) 12.7 12.9 12.9 12.5
16 13.6 12.6 12.4 12.5 12.2
17 (14) 13.2 12.4 13 13
18 13.1 12.9 13.5 12.3 12.8
19 ((14.6)) (13.7) 13.4 12.2 12.5
20 (13.9) 13 13 13.2 12.6
21 13.3 12.7 12.6 12.8 12.7
22 (13.9) 12.4 12.7 12.4 12.8
23 13.2 12.3 12.6 13.1 12.7
24 13.2 12.8 12.8 12.3 12.6
25 13.3 12.8 12.2 12.3 13

Table 1. Resin parts data. Xbar & s chart subgrouped by day with subgroups of size 5.

A third method of detecting within-subgroup stratification is to make an Xbar & s chart on the transpose of the data, Figure 3. In this example that means subgrouping the data by time-of-day rather than by day. This gives five subgroups of size 25 (days). An R chart is not suitable because the subgroup sizes are too large. 2.5 sigma limits are the proper choice for use with 5 subgroups in order to keep the false-alarm (alpha risk) similar to that when using 3-sigma limits with 25 subgroups [3]. The Xbar & s chart, with the startup readings far beyond the upper control limit, does an excellent job of showing the source of the inflated control limits in Figure 1. A study of six methods for the "detection of lack of within-subgroup homogeneity" in the absence of two-way variation [4] found this method the best of the six. However, this method may not be helpful in the presence of severe two-way variation.


Figure 3. Xbar & s chart subgrouped by time-of-day, 2.5-sigma limits

In retrospect we see that two processes, not one, were present in this example; the startup process and the normal process. When the two processes were wrongfully analyzed as a single process as in Figure 1, the "noise" of the differences between the two processes blocked the information needed to make improvement to either.

The cardinal rule here is that unless an assignable cause of variation is removed, it must be taken into account in the data analysis. The two processes must be tracked separately. For improvement, the best course of action would probably be to concentrate on the startup problem to learn how to minimize the startup impact. In any event, the "bad" startup product has to be separated from the bulk of the "good" product, reassigning the "bad" product or reworking or scrapping as necessary.

All of the above methods for discovering within-subgroup stratification depend upon this stratification being "systematic." In this example, if the time-order of the observations within the day had been lost, the stratification and inflated control limits would not have changed but the source would just have been hidden. Note how critical it is to preserve any natural order that exists in the data. The usual tests for stratification or "centerline-hugging" are seldom powerful enough to be helpful.

One more example from the literature will be mentioned briefly, this one involving three-way variation [5]. The silicon content was measured for each of five heats within each of 3 shifts for five consecutive days. An Xbar & R chart, as well as an interrupted I chart for the individual measurements all led to the erroneous conclusion that there was only common-cause variation present. The analysis error was due to inflated control limits. It could readily be seen by looking at the pattern of the extreme values in the data table that the highest measurement on four of the five days was on the last heat of the third shift. Further, the three highest readings of the 75 were on the final heats of days 1, 3, and 5. Not random. This was apparently a "shut-down" problem. The data aberrations could probably have been found also by using the interrupted run chart on individuals once the inflated I chart limits had been removed. A reference line to skim off the high values like the one in Figure 2 would have been helpful.

When you think a process is in control and it isn't, you are at a loss to know how to go about improving it. Searching out the irregularities of the process is a fun game, but it is also bread and butter. It tells YOU how to improve the process.

References:

[1] Nelson, Lloyd S. "The Shewhart Control Chart-Tests for Special Causes," Journal of Quality Technology, vol. 16(4), October 1984, pp. 237 - 239.
[2] K. Ishikawa. Guide to Quality Control. Asian Productivity Organization, Tokyo, 1982.
[3] M. Hart and R. Hart. Statistical Process Control for Health Care, Duxbury Press, 2002, p. 60
[4] M. Hart, R. Hart, and G. Philip. "Control Chart False Alarm Risks with Three Sigma Limits." Quality Engineering, 4(3) 1992, pp. 413-418.
[5] American Society for Testing and Materials. ASTM Manual on Presentation of Data and Control Chart Analysis -- STP15D. Philadelphia: American Society for Testing and Materials, 1976. p. 128

For more information, contact Drs. Robert and Marilyn Hart at robthart@aol.com or (541)412-0425.

If you would like additional information, please call our Support staff at (541) 752-4100 or send email to .