We wish to do a quality control check on two laboratories.
Six samples of water are selected, and divided into two parts. One half is randomly chosen and sent to Lab 1; the other half is sent to Lab 2. Each lab is supposed to measure the impurities in the water.
Here is the raw data:
Sample Lab 1 Lab 2
------------------
1 31.4 28.1
2 37.0 37.1
3 44.0 40.6
4 28.8 27.3
5 59.9 58.4
6 37.6 38.9
Notice that the difference between the laboratories is small relative to variation among the samples. You are really interested in examining if the differences among the laboratories is zero.
Does this satisfy the conditions laid out earlier for a paired design? Yes, the measurements from each laboratory are related by the sample. There are two readings from each sample - one from each laboratory.
It would be quite inappropriate to analyze this design using the `two-independent samples t-test'as the readings for lab 1 are not independent of the readings for lab 2.
Let:
The hypothesis testing proceeds in much the same fashion as before:
We are not really interested in the actual impurity readings for the various samples, but rather only interested in the difference. This can be specified as
H:
or
or
A:
or
or
.
where
is the mean difference
in the readings. Note again that the hypotheses are in terms of
population parameters and we are interested in testing if the
difference is 0. [A difference of 0 would imply no difference
in the readings between laboratories, on average]
The data is entered into JMP in the usual fashion. Because the
data is paired, the data should be entered as two separate columns,
one for each laboratory. Also, another column should be computed
to compute the difference between the two labs (use a formula).
Summary statistics can be analyzed in two different, but equivalent ways.
Using the Analyze->Fit Y by X platform, use Lab 2 as the Y
variable, Lab 1 as the X variable, and then use the `Paired
t-test' from the Analyze pop-up menu. The following plot
occurs:
If both labs are measuring the impurities identically, all the points should fall along the line Y=X (the dark, thick line). But there is variation in the readings, due to technical problems, etc. and so the points will not fall exactly on the line. If there is no systematic bias, the points should be randomly scattered above and below this thick dark line. However, if there is a systematic `bias' in one lab, the points will consistently fall below the line or above the line. The line which shows the estimated systematic bias is the second line plotted (in color). It appears that lab 1 is showing a systematic positive bias relative to lab 2. However, at this point, we don't know if there is sufficient evidence to conclude that a systematic bias exists in the population value.
Use the Analyze->Distribution of Y to obtain summary statistics
on the difference:
The box plot shows that the mean diff is around 0. The `Moment' report indicates that the mean difference is about 1.383 units, but the standard error is rather large. Indeed, the 95% confidence interval indicates that the mean difference might be zero since 0 is included in the interval.
There are two different (but equivalent) ways of doing the test.
Method 2a automatically generates the following table; in Method
2b you will need to generate a hypothesis test for the population
mean difference is zero. Here are the two reports:
Both report give the same test statistic:
T = (estimate of diff - hypothesized diff)/ estimated se of diff = (1.38 - 0)/.756 = 1.83
Both are compared to a t-distribution with df=number of pairs -1 = 6-1 =5.
Once again, you must decide between the two sided or one-sided p-values. We were doing a two sided test, so the two-sided p-value of .1270 is used.
Because the p-value is large, we fail to reject the null hypothesis and conclude that there is insufficient evidence against the hypothesis that both laboratories give the same mean readings on the samples.
As before, we haven't proven that both labs give the same reading. We only failed to detect a difference in the mean readings.
Notes: