Section 7.2 Inference for paired data
¶When we have two observations on each person or each case, we can answer questions such as the following:
Do students do better on reading or writing sections of standardized tests?
How do the number of days with temperature above 90°F compare between 1948 and 2018?
Are Amazon textbook prices lower than the college bookstore prices? If so, how much lower, on average?
Subsection 7.2.1 Learning objectives
Distinguish between paired and unpaired data.
Recognize that inference procedures for paired data use the same onesample \(t\)procedures as in the previous section, and that these procedures are applied to the differences of the paired observations.
Carry out a complete hypothesis test for paired differences.
Carry out a complete confidence interval procedure for paired differences.
Subsection 7.2.2 Paired observations and samples
In the previous edition of this textbook, we found that Amazon prices were, on average, lower than those of the UCLA Bookstore for UCLA courses in 2010. It's been several years, and many stores have adapted to the online market, so we wondered, how is the UCLA Bookstore doing today?
We sampled 201 UCLA courses. Of those, 68 required books that could be found on Amazon. A portion of the data set from these courses is shown in Table 7.2.1, where prices are in U.S. dollars.
subject  course_number  bookstore  amazon  price_difference  
1  American Indian Studies  M10  47.97  47.45  0.52 
2  Anthropology  2  14.26  13.55  0.71 
3  Arts and Architecture  10  13.50  12.53  0.97 
\(\vdots\)  \(\vdots\)  \(\vdots\)  \(\vdots\)  \(\vdots\)  \(\vdots\) 
67  Korean  1  24.96  23.79  1.17 
68  Jewish Studies  M10  35.96  32.40  3.56 
textbooks
data set.Each textbook has two corresponding prices in the data set: one for the UCLA Bookstore and one for Amazon. Therefore, each textbook price from the UCLA bookstore has a natural correspondence with a textbook price from Amazon. When two sets of observations have this special correspondence, they are said to be paired.
Paired data.
Two sets of observations are paired if each observation in one set has a special correspondence or connection with exactly one observation in the other data set.
To analyze paired data, it is often useful to look at the difference in outcomes of each pair of observations. In the textbook
data set, we look at the differences in prices, which is represented as the diff
variable. Here, for each book, the differences are taken as
It is important that we always subtract using a consistent order; here Amazon prices are always subtracted from UCLA prices. A histogram of these differences is shown in Figure 7.2.2. Using differences between paired observations is a common and useful way to analyze paired data.
Checkpoint 7.2.3.
The first difference shown in Table 7.2.1 is computed as: \(47.97  47.45 = 0.52\text{.}\) What does this difference tell us about the price for this textbook on Amazon versus the UCLA bookstore? ^{ 1 }
Subsection 7.2.3 Hypothesis tests for paired data
¶To analyze a paired data set, we simply analyze the differences. We can use the same \(t\)distribution techniques we applied in the last section.
\(n_{\text{diff}}\)  \(\bar{x}_{\text{diff}}\)  \(s_{\text{diff}}\)  
68  3.58  13.42  
We will set up and implement a hypothesis test to determine whether, on average, there is a difference in textbook prices between Amazon and the UCLA bookstore. We are considering two scenarios: there is no difference in prices or there is some difference in prices.
\(\mu_{\text{diff}}=0\text{.}\) On average, there is no difference in textbook prices.
\(\mu_{\text{diff}} \neq 0\text{.}\) On average, there is some difference in textbook prices.
Can the \(t\)distribution be used for this application? The observations are based on a random sample from a large population, so independence is reasonable. While the distribution of the data is very strongly skewed, we do have \(n = 68\) observations. This sample size is large enough that we do not have to worry about whether the population distribution for difference in price might be nearly normal or not. Because the conditions are satisfied, we can use the \(t\)distribution to this setting.
We compute the standard error associated with \(\bar{x}_{\text{diff}}\) using the standard deviation of the differences (\(s_{\text{diff}} = 13.42\)) and the number of differences (\(n_{\text{diff}} = 68\)):
Next we compute the test statistic. The point estimate is the observed value of \(\bar{x}_{\text{diff}}\text{.}\) The null value is the value hypothesized under the null hypothesis. Here, the null hypothesis is that the true mean of the differences is 0.
The degrees of freedom are \(df = 68  1 = 67\text{.}\) To visualize the pvalue, the sampling distribution of \(\bar{x}_{\text{diff}}\) is drawn as though \(H_0\) is true. This is shown in Figure 7.2.5. Because this is a twosided test, the pvalue corresponds to the area in both tails. Using statistical software, we find the area in the tails to be 0.0312.
Because the pvalue of 0.0312 is less than 0.05, we reject the null hypothesis. We have evidence that, on average, there is a difference in textbook prices. In particular, we can say that, on average, Amazon prices are lower than the UCLA Bookstore prices for UCLA course textbooks.
Example 7.2.6.
We have evidence to conclude Amazon is, on average, less expensive. Does this mean that UCLA students should always buy their books on Amazon?
No. The fact that Amazon is, on average, less expensive, does not imply that it is less expensive for every textbook. Examining the distribution shown in Figure 7.2.2, we see that there are certainly a handful of cases where Amazon prices are much lower than the UCLA Bookstore's, which suggests it is worth checking Amazon or other online sites before purchasing. However, in many cases the Amazon price is above what the UCLA Bookstore charges, and most of the time the price isn't that different.
For reference, this is a very different result from what we (the authors) had seen in a similar data set from 2010. At that time, Amazon prices were almost uniformly lower than those of the UCLA Bookstore's and by a large margin, making the case to use Amazon over the UCLA Bookstore quite compelling at that time.
Hypothesis test for paired data.
To carry out a complete hypothesis test to test the claim that a mean of differences \(\mu_{\text{diff}}\) is equal to 0,
Identify: Identify the hypotheses and the significance level, \(\alpha\text{.}\)
\(H_0\text{:}\) \(\mu_{\text{diff}} = 0\)
\(H_A\text{:}\) \(\mu_{\text{diff}} \ne 0\text{;}\) \(H_A\text{:}\) \(\mu_{\text{diff}} > 0\text{;}\) or \(H_A\text{:}\) \(\mu_{\text{diff}} \lt 0\)
Choose: Choose the appropriate test procedure and identify it by name.
Here we choose the matched pairs \(t\)test.
Check: Check conditions for the sampling distribution of \(\bar{x}_{\text{diff}}\) to be nearly normal.
There is paired data from a random sample or from a matched pairs experiment.
\(n_{\text{diff}}\ge 30\) or the population of differences is nearly normal. If the number of differences is less than 30 and the distribution of the populationof differences is unknown, check for strong skew or outliers in the sample differences. If neither is found, then the condition that the population of differences is nearly normalis considered reasonable.
Calculate: Calculate the \(t\)statistic, \(df\text{,}\) and pvalue.

\(T = \frac{\text{ point estimate }  \text{ null value } }{SE \text{ of estimate } }\text{,}\) \(df=n_{\text{diff}} 1\)
point estimate: the sample mean of differences \(\bar{x}_{\text{diff}}\)
\(SE\) of estimate: \(\frac{s_{\text{diff}} }{\sqrt{n_{\text{diff}} }}\)
null value: \(0\)
pvalue = (based on the \(t\)statistic, the \(df\text{,}\) and the direction of \(H_A\))
Conclude: Compare the pvalue to \(\alpha\text{,}\) and draw a conclusion in context.
If the pvalue is \(\lt \alpha\text{,}\) reject \(H_0\text{;}\) there is sufficient evidence that [\(H_A\) in context].
If the pvalue is \(> \alpha\text{,}\) do not reject \(H_0\text{;}\) there is not sufficient evidence that [\(H_A\) in context].
Example 7.2.8.
An SAT preparation company claims that its students' scores improve by over 100 points on average after their course. A consumer group would like to evaluate this claim, and they collect data on a random sample of 30 students who took the class. Each of these students took the SAT before and after taking the company's course, so we have a difference in scores for each student. We will examine these differences \(x_1=57\text{,}\) \(x_2=133\text{,}\) ..., \(x_{30}=140\) as a sample to evaluate the company's claim. The distribution of the differences, shown in Figure 7.2.7, has a mean of 135.9 and a standard deviation of 82.2. Do these data provide convincing evidence to back up the company's claim? Use the five step framework to organize your work.
Identify: We will test the following hypotheses at the \(\alpha=0.05\) level:
\(H_0\text{:}\) \(\mu_{\text{diff}} = 100\text{.}\) Student scores improve by 100 points, on average.
\(H_A\text{:}\) \(\mu_{\text{diff}} > 100\text{.}\) Student scores improve by more than 100 points, on average.
Here, \({\text{diff}}\) = \({\text{ SAT score after course  SAT score before course } }\text{.}\)
Choose: Because we have paired data and the parameter to be estimated is a mean of differences, we will use a matched pairs \(t\)test.
Check: We have a random sample with paired observations and the number of differences is \(n_{\text{diff}}=30 \ge 30\text{,}\) so we can proceed with the matched pairs \(t\)test.
Calculate: We will calculate the test statistic, \(df\text{,}\) and pvalue.
The point estimate is the sample mean of differences: \(\bar{x}_{\text{diff}} = 135.9\)
The \(SE\) of the sample mean of differences is: \(\frac{s_{\text{diff}} }{\sqrt{n_{\text{diff}} }} = \frac{82.2}{\sqrt{30}} = 15.0\)
Since this is essentially a one sample \(t\)test, the degrees of freedom is \(n_{\textit{diff}}1\text{.}\)
The pvalue is the area to the right of 2.4 under the \(t\)distribution with 29 degrees of freedom. The pvalue = 0.012.
Conclude: pvalue \(=0.012 \lt \alpha\) so we reject the null hypothesis. The data provide convincing evidence to support the company's claim that students' scores improve by more than 100 points, on average, following the class.
Checkpoint 7.2.9.
Because we found evidence to support the company's claim, does this mean that a student will score more than 100 points higher on the SAT if they take the class than if they do not take the class? ^{ 2 }
Subsection 7.2.4 Calculator: the matched pairs \(t\)test
¶The matched pairs \(t\)test is a one sample \(t\)test. Instead of using the data or the summary statistics from a sample, make sure to use the data of differences or the summary statistics for the differences.
TI83/84: matched pairs Ttest.
Use STAT
, TESTS
, TTest
.
Choose
STAT
.Right arrow to
TESTS
.Down arrow and choose
2:TTest
.Choose
Data
if you have all the data orStats
if you have the mean and standard deviation.
Let \(\mu_{0}\) be the null or hypothesized value of \(\mu_{\text{diff}}\text{.}\)
If you choose
Data
, letList
beL3
or the list in which you entered the differences, and letFreq
be1
.If you choose
Stats
, enter the mean, \(SD\text{,}\) and sample size of the differences.
Choose \(\ne\text{,}\) \(\lt\text{,}\) or \(\gt\) to correspond to \(H_A\text{.}\)

Choose
Calculate
and hitENTER
, which returns:t
t statistic p
pvalue \(\bar{x}\) the sample mean of the differences Sx
the sample \(SD\) of the differences n
the sample size of the differences
Casio fx9750GII: matched pairs Ttest.
Compute the paired differences of the observations.
Using the computed differences, follow the instructions for a 1sample \(t\)test
Subsection 7.2.5 Confidence intervals for the mean of a difference
¶In the previous examples, we carried out a matched pairs \(t\)test, where the null hypothesis was that the true average of the paired differences is zero. Sometimes we want to estimate the true average of paired differences with a confidence interval, and we use a matched pairs \(t\)interval. Consider again the table summarizing data on: UCLA Bookstore price \(\) Amazon price, for each of the 68 books sampled.
\(n_{\text{diff}}\)  \(\bar{x}_{\text{diff}}\)  \(s_{\text{diff}}\)  
68  3.58  13.42  
We construct a 95% confidence interval for the average price difference between books at the UCLA Bookstore and on Amazon. Conditions have already verified, namely, that we have paired data from a random sample and that the number of differences is at least 30. We must find the critical value, \(t^{\star}\text{.}\) Since \(df = 67\) is not on the \(t\)table, round the \(df\) down to 60 to get a \(t^{\star}\) of 2.00 for 95% confidence. (See Subsection 7.2.6 for how to get a more precise interval using a calculator.) Plugging the \(t^{\star}\) value, point estimate, and standard error into the confidence interval formula, we get:
We are 95% confident that the UCLA bookstore is, on average, between $0.33 and $6.83 more expensive than Amazon for UCLA course books. This interval does not contain zero, so it is consistent with the earlier hypothesis test that rejected the null hypothesis that the average difference was 0. Because our interval is entirely above 0, we have evidence that the true average difference is greater than zero. Unlike the hypothesis test, the confidence interval gives us a good idea of how much more expensive the UCLA bookstore might be, on average.
Example 7.2.11.
Based on the interval, can we say that 95% of the books cost between $0.33 and $6.83 more at the UCLA Bookstore than on Amazon?
No. This interval is attempting to estimate the average difference with 95% confidence. It is not attempting to capture 95% of the values. A quick look at Figure 7.2.2 shows that much less than 95% of the differences fall between $0.32 and $6.84.
Constructing a confidence interval for paired data.
To carry out a complete confidence interval procedure to estimate a mean of differences \(\mu_{\text{diff}}\text{,}\)
Identify: Identify the parameter and the confidence level, C%.
The parameter will be a mean of differences, e.g. the true mean of the differences in county population (year 2018 \(\) year 2017).
Choose: Choose the appropriate interval procedure and identify it by name.
Here we choose the matched pairs \(t\)interval.
Check: Check conditions for the sampling distribution of \(\bar{x}_{\text{diff}}\) to be nearly normal.
There is paired data from a random sample or from a matched pairs experiment.
\(n_{\text{diff}}\ge 30\) or the population of differences is nearly normal. If the number of differences is less than 30 and the distribution of the population of differences is unknown, check for strong skew or outliers in the sample differences. If neither is found, then the condition that the population of differences is nearly normal is considered reasonable.
Calculate: Calculate the confidence interval and record it in interval form.

\(\text{ point estimate } \ \pm\ t^{\star} \times SE\ \text{ of estimate }\text{,}\) \(df\text{:}\) \(n_{\text{diff}}  1\)
point estimate: the sample mean of differences \(\bar{x}_{\text{diff}}\)
\(SE\) of estimate: \(\frac{s_{\text{diff}}}{\sqrt{n_{\text{diff}}}}\)
\(t^{\star}\text{:}\) use a \(t\)table at row \(df = n_{\textit{diff}}  1\) and confidence level C
(_______, _______)
Conclude: Interpret the interval and, if applicable, draw a conclusion in context.
We are C% confident that the true mean of the differences in [...] is between _______ and _______. If applicable, draw a conclusion based on whether the interval is entirely above, is entirely below, or contains the value 0.
Example 7.2.12.
An SAT preparation company claims that its students' scores improve by over 100 points on average after their course. A consumer group would like to evaluate this claim, and they collect data on a random sample of 30 students who took the class. Each of these students took the SAT before and after taking the company's course, so we have a difference in scores for each student. We will examine these differences \(x_1=57\text{,}\) \(x_2=133\text{,}\) ..., \(x_{30}=140\) as a sample to evaluate the company's claim. The distribution of the differences, shown in Figure 7.2.13, has a mean of 135.9 and a standard deviation of 82.2. Construct a confidence interval to estimate the true average increase in SAT after taking the company's course. Is there evidence at the 95% confidence level that students score an average of more than 100 points higher after the class? Use the five step framework to organize your work.
Identify: The parameter we want to estimate is \(\mu_{\text{diff}}\text{,}\) the true change in SAT score after taking the company's course. Here, \({\text{diff}}\) = \({\text{ SAT score after course }  \text{ SAT score before course } }\text{.}\) We will estimate this parameter at the 95% confidence level.
Choose: Because we have paired data and the parameter to be estimated is a mean of differences, we will use a matched pairs \(t\)interval.
Check: We have a random sample with paired observations and the number of differences is \(n_{\text{diff}}=30\ge 30\text{,}\) so we can proceed with the matched pairs \(t\)interval.
Calculate: We will calculate the confidence interval as follows.
The point estimate is the sample mean of differences: \(\bar{x}_{\text{diff}} = 135.9\)
The \(SE\) of the sample mean of differences is: \(\frac{s_{\text{diff}}}{\sqrt{n_{\text{diff}}}} = \frac{82.2}{\sqrt{30}}=15.0\)
We find \(t^{\star}\) for the one sample case using the \(t\)table at row \(df = n 1\) and confidence level C%. For a 95% confidence level and \(df = 30  1 = 29\text{,}\) \(t^{\star} = 2.045\text{.}\)
The 95% confidence interval is given by:
Conclude: We are 95% confident that the true average increase in SAT score following the company's course is between 105.2 points to 166.6 points. There is sufficient evidence that students score greater than 100 points higher, on average, after the company's course because the entire interval is above 100.
Checkpoint 7.2.14.
Based on the interval (105.2, 166.6), calculated previously, can we say that 95% of student scores increased between 105.2 and 166.6 points after taking the company's course? ^{ 3 }
Subsection 7.2.6 Calculator: the matched pairs \(t\)interval
¶TI83/84: matched pairs Tinterval.
Use STAT
, TESTS
, TInterval
.
Choose
STAT
.Right arrow to
TESTS
.Down arrow and choose
8:TInterval
.
Choose
Data
if you have all the data orStats
if you have the mean and standard deviation.If you choose
Data
, letList
beL3
or the list in which you entered the differences (don't forget to enter the differences!) and letFreq
be1
.If you choose
Stats
, enter the mean, \(SD\text{,}\) and sample size of the differences.
Let
CLevel
be the desired confidence level.
Choose
Calculate
and hitENTER
, which returns:(,) the confidence interval for the mean of the differences \(\bar{x}\) the sample mean of the differences Sx
the sample \(SD\) of the differences n
the number of differences in the sample
Casio fx9750GII: matched pairs Tinterval.
Compute the paired differences of the observations.
Using the computed differences, follow the instructions for a 1sample \(t\)interval.
Checkpoint 7.2.15.
In our UCLA textbook example, we had 68 paired differences. Because \(df=67\) was not on our \(t\)table, we rounded the \(df\) down to 60. This gave us a 95% confidence interval (0.325, 6.834). Use a calculator to find the more exact 95% confidence interval based on 67 degrees of freedom. How different is it from the one we calculated based on 60 degrees of freedom? ^{ 4 }
TInterval
or equivalent. We do not have all the data, so choose Stats
on a TI or Var
on a Casio. Enter x
\(= 3.58\) and Sx
\(= 13.42\text{.}\) Let n
\(= 68\) and CLevel
\(= 0.95\text{.}\) This should give the interval (0.332, 6.828). The intervals are equivalent when rounded to two decimal places.\(n_{\text{diff}}\)  \(\bar{x}_{\text{diff}}\)  \(s_{\text{diff}}\)  
68  3.58  13.42  
Subsection 7.2.7 Section summary
Paired data can come from a random sample or a matched pairs experiment. With paired data, we are often interested in whether the difference is positive, negative, or zero. For example, the difference of paired data from a matched pairs experiment tells us whether one treatment did better, worse, or the same as the other treatment for each subject.
We use the notation \(\bar{x}_{\text{diff}}\) to represent the mean of the sample differences. Likewise, \(s_{\text{diff}}\) is the standard deviation of the sample differences, and \(n_{\text{diff}}\) is the number of sample differences.
To carry out inference on paired data, we first find all of the sample differences. Then, we perform a onesample procedure on the differences. For this reason, the confidence interval and hypothesis test for paired data use the same \(t\)procedures as the onesample methods, where the degrees of freedom is given by \(n_{\text{diff}}1\text{.}\)

When there is paired data and the parameter of interest is a mean of the differences:
Estimate \(\mu_{\text{diff}}\) at the C% confidence level using a matched pairs \(t\)interval.
Test \(H_0\text{:}\) \(\mu_{\text{diff}}=0\) at the \(\alpha\) significance level using a matched pairs \(t\)test.

The conditions for the matched pairs \(t\)interval and \(t\)test are the same.
There is paired data from a random sample or a matched pairs experiment.
\(n_{\text{diff}}\ge 30\) or the population of differences is nearly normal. If the number of differences is less than 30 and it is not known that the population of differences is nearly normal, we argue that the population of differences could be nearly normal if there is no strong skew or outliers in the sample differences.

When the conditions are met, we calculate the confidence interval and the test statistic as we did in the previous section. Here, our data is a list of differences.
Confidence interval: \(\text{ point estimate } \ \pm\ t^{\star} \times SE\ \text{ of estimate }\)
Test statistic: \(T = \frac{\text{ point estimate }  \text{ null value } }{SE \text{ of estimate } }\)
Here the point estimate is the mean of sample differences: \(\bar{x}_{\textit{{diff}}}\text{.}\)
The \(SE\) of estimate is the \(SE\) of a mean of sample differences: \(\frac{s_{\textit{{diff}}}}{\sqrt{n_{\textit{{diff}}}}}\text{.}\)
The degrees of freedom is given by \(df = n_{\textit{{diff}}}1\text{.}\)
Exercises 7.2.8 Exercises
1. Air quality.
Air quality measurements were collected in a random sample of 25 country capitals in 2013, and then again in the same cities in 2014. We would like to use these data to compare average air quality between the two years. Should we use a paired or nonpaired test? Explain your reasoning.
Paired, data are recorded in the same cities at two different time points. The temperature in a city at one point is not independent of the temperature in the same city at another time point.
2. True / False: paired.
Determine if the following statements are true or false. If false, explain.
In a paired analysis we first take the difference of each pair of observations, and then we do inference on these differences.
Two data sets of different sizes cannot be analyzed as paired data.
Consider two sets of data that are paired with each other. Each observation in one data set has a natural correspondence with exactly one observation from the other data set.
Consider two sets of data that are paired with each other. Each observation in one data set is subtracted from the average of the other data set's observations.
3. Paired or not? Part I.
In each of the following scenarios, determine if the data are paired.
Compare pre (beginning of semester) and posttest (end of semester) scores of students.
Assess genderrelated salary gap by comparing salaries of randomly sampled men and women.
Compare artery thicknesses at the beginning of a study and after 2 years of taking Vitamin E for the same group of patients.
Assess effectiveness of a diet regimen by comparing the before and after weights of subjects.
(a) Since it's the same students at the beginning and the end of the semester, there is a pairing between the datasets, for a given student their beginning and end of semester grades are dependent.
(b) Since the subjects were sampled randomly, each observation in the men's group does not have a special correspondence with exactly one observation in the other (women's) group.
(c) Since it's the same subjects at the beginning and the end of the study, there is a pairing between the datasets, for a subject student their beginning and end of semester artery thickness are dependent.
(d) Since it's the same subjects at the beginning and the end of the study, there is a pairing between the datasets, for a subject student their beginning and end of semester weights are dependent.
4. Paired or not? Part II.
In each of the following scenarios, determine if the data are paired.
We would like to know if Intel's stock and Southwest Airlines' stock have similar rates of return. To find out, we take a random sample of 50 days, and record Intel's and Southwest's stock on those same days.
We randomly sample 50 items from Target stores and note the price for each. Then we visit Walmart and collect the price for each of those same 50 items.
A school board would like to determine whether there is a difference in average SAT scores for students at one high school versus another high school in the district. To check, they take a simple random sample of 100 students from each high school.
5. Global warming, Part I.
Let's consider a limited set of climate data, examining temperature differences in 1948 vs 2018. We randomly sampled 197 locations from the National Oceanic and Atmospheric Administration's (NOAA) historical data, where the data was available for both years of interest. We want to know: were there more days with temperatures exceeding 90°F in 2018 or 1948?^{ 5 } The difference in number of days exceeding 90°F (number of days in 2018  number of days in 1948) was calculated for each of the 197 locations. The average of these differences was 2.9 days with a standard deviation of 17.2 days. We are interested in determining whether these data provide strong evidence that there were more days in 2018 that exceeded 90°F from NOAA's weather stations.
Is there a relationship between the observations collected in 1948 and 2018? Or are the observations in the two groups independent? Explain.
Write hypotheses for this research in symbols and in words.
Check the conditions required to complete this test. A histogram of the differences is given to the right.
Calculate the test statistic and find the pvalue.
Use \(\alpha = 0.05\) to evaluate the test, and interpret your conclusion in context.
What type of error might we have made? Explain in context what the error means.
(a) For each observation in one data set, there is exactly one specially corresponding observation in the other data set for the same geographic location. The data are paired. Let \(diff = 2018  1948\text{.}\)
(b) \(H_{0}: \mu_{diff} = 0\) (On average, the number of days exceeding 90°F in 1948 and 2018 for NOAA stations was the same.) \(H_{A}: \mu_{diff} > 0\) (On average, there were more days exceeding 90°F in 2018 than 1948 for NOAA stations.)
(c) Locations were randomly sampled, so independence is reasonable. The sample size is at least 30, so we're just looking for particularly extreme outliers: none are present (the observation off left in the histogram would be considered a clear outlier, but not a particularly extreme one). Therefore, the conditions are satisfied.
(d) \(SE = 17.2/ \sqrt{197} = 1.23\text{.}\) \(T = \frac{2.90}{1.23} = 2.36\) with degrees of freedom \(df = 1971 = 196\text{.}\) This leads to a pvalue of about 0.019.
(e) Since the pvalue is less than 0.05, we reject \(H_{0}\text{.}\) The data provide strong evidence that NOAA stations observed more 90°F days in 2018 than in 1948.
(f) Type 1 Error, since we may have incorrectly rejected \(H_{0}\text{.}\) This error would mean that NOAA stations did not actually observe a decrease, but the sample we took just so happened to make it appear that this was the case.
6. High School and Beyond, Part I.
The National Center of Education Statistics conducted a survey of high school seniors, collecting test data on reading, writing, and several other subjects. Here we examine a simple random sample of 200 students from this survey. Sidebyside box plots of reading and writing scores as well as a histogram of the differences in scores are shown below.
Is there a clear difference in the average reading and writing scores?
Are the reading and writing scores of each student independent of each other?
Create hypotheses appropriate for the following research question: is there an evident difference in the average scores of students in the reading and writing exam?
Check the conditions required to complete this test.
The average observed difference in scores is \(\bar{x}_{readwrite} = 0.545\text{,}\) and the standard deviation of the differences is 8.887 points. Do these data provide convincing evidence of a difference between the average scores on the two exams?
What type of error might we have made? Explain what the error means in the context of the application.
Based on the results of this hypothesis test, would you expect a confidence interval for the average difference between the reading and writing scores to include 0? Explain your reasoning.
7. Global warming, Part II.
We considered the change in the number of days exceeding 90°F from 1948 and 2018 at 197 randomly sampled locations from the NOAA database in Exercise 7.2.8.5. The mean and standard deviation of the reported differences are 2.9 days and 17.2 days.
Calculate a 90% confidence interval for the average difference between number of days exceeding 90°F between 1948 and 2018. We've already checked the conditions for you.
Interpret the interval in context.
Does the confidence interval provide convincing evidence that there were more days exceeding 90°F in 2018 than in 1948 at NOAA stations? Explain.
(a) \(SE=1.23\) and \(z^{*}=1.65\text{.}\) \(2.9 \pm 1.65 \times 1.23 \rightarrow (0.87, 4.93)\text{.}\)
(b) We are 90% confident that there was an increase of 0.87 to 4.93 in the average number of days that hit 90°F in 2018 relative to 1948 for NOAA stations.
(c) Yes, since the interval lies entirely above 0.
8. High school and beyond, Part II.
We considered the differences between the reading and writing scores of a random sample of 200 students who took the High School and Beyond Survey in Exercise 7.2.8.6. The mean and standard deviation of the differences are \(\bar{x}_{readwrite} = 0.545\) and 8.887 points.
Calculate a 95% confidence interval for the average difference between the reading and writing scores of all students.
Interpret this interval in context.
Does the confidence interval provide convincing evidence that there is a real difference in the average scores? Explain.