AHSS Inference for the difference of two means

Section 7.3 Inference for the difference of two means

Often times we wish to compare two groups to each other to answer questions such as the following:

Does treatment using embryonic stem cells (ESCs) help improve heart function following a heart attack?
Is there convincing evidence that newborns from mothers who smoke have a different average birth weight than newborns from mothers who don't smoke?
Is there statistically significant evidence that one variation of an exam is harder than another variation?
Are faculty willing to pay someone named “John” more than someone named “Jennifer”? If so, how much more?

Subsection 7.3.1 Learning objectives

Determine when it is appropriate to use a paired $t$-procedure versus a two-sample $t$-procedure.
State and verify whether or not the conditions for inference on the difference of two means using the $t$-distribution are met.
Be able to use a calculator or other software to find the degrees of freedom associated with a two-sample $t$-procedure.
Carry out a complete confidence interval procedure for the difference of two means.
Carry out a complete hypothesis test for the difference of two means.

Subsection 7.3.2 Sampling distribution for the difference of two means

Previously we explored the sampling distribution for the difference of two proportions. Here we consider the sampling distribution for the difference of two means. We are interested in the distribution of $\bar{x}_1-\bar{x}_2\text{.}$ We know that it is centered on $\mu_1-\mu_2\text{.}$ The standard deviation for the difference can be found as follows.

\begin{align*} SD_{\bar{x}_{1} - \bar{x}_{2}} \amp = \sqrt{\left(SD_{\bar{x}_{1}}\right)^2 +\left(SD_{\bar{x}_{2}}\right)^2 }\\ \amp = \sqrt{\left(\frac{\sigma_{1}}{\sqrt{n_1}}\right)^2 + \left(\frac{\sigma_{2}}{\sqrt{n_2}}\right)^2 }\\ \amp = \sqrt{\frac{\sigma_{1}^2}{n_{1}} + \frac{\sigma_{2}^2}{n_{2}}} \end{align*}

Finally, we are interested in the shape of the sampling distribution of $\bar{x}_1-\bar{x}_2\text{.}$ It will be nearly normal when the sampling distribution of each of $\bar{x}_1$ and $\bar{x}_2$ are nearly normal.

Subsection 7.3.3 Checking conditions for inference on a difference of means

When comparing two means, we carry out inference on a difference of means, $\mu_1-\mu_2\text{.}$ We will use the $t$-distribution just as we did when carrying out inference on a single mean. The assumptions are that the observations are independent, both between groups and within groups and that the sampling distribution of $\bar{x}_1-\bar{x}_2$ is nearly normal. We check whether these assumptions are reasonable by verifying the following conditions.

Independent. Observations can be considered independent when the data are collected from two independent random samples or, in the context of experiments, from two randomly assigned treatments. Randomly assigning subjects to treatments is equivalent to randomly assigning treatments to subjects.

Nearly normal sampling distribution. The sampling distribution of $\bar{x}_1-\bar{x}_2$ will be nearly normal when the sampling distribution of $\bar{x}_1$ and of $\bar{x}_2$ are nearly normal, that is when both population distributions are nearly normal or both sample sizes are at least 30.

As before, if the sample sizes are small and the population distributions are not known to be nearly normal, we look at the data for excessive skew or outliers. If we do no find excessive skew or outliers in either group, we consider the assumption that the populations are nearly normal to be reasonable.

Subsection 7.3.4 Confidence intervals for a difference of means

What's in a name? Are employers more likely to offer interviews or higher pay to prospective employees when the name on a resume suggests the candidate is a man versus a woman? This is a challenging question to tackle, because employers are influenced by many aspects of a resume. Thinking back to Chapter 1 on data collection, we could imagine a host of confounding factors associated with name and gender. How could we possibly isolate just the factor of name? We would need an experiment in which name was the only variable and everything else was held constant.

Researchers at Yale carried out precisely this experiment. Their results were published in the Proceedings of the National Academy of Sciences (PNAS).¹ The researchers sent out resumes to faculty at academic institutions for a lab manager position. The resumes were identical, except that on half of them the applicant's name was John and on the other half, the applicant's name was Jennifer. They wanted to see if faculty, specifically faculty trained in conducting scientifically objective research, held implicit gender biases.

https://www.pnas.org/content/109/41/16474

Unlike in the matched pairs scenario, each faculty member received only one resume. We are interested in comparing the mean salary offered to John relative to the mean salary offered to Jennifer. Instead of taking the average of a set of paired differences, we find the average of each group separately and take their difference. Let

\begin{align*} \bar{x}_1:\amp \text{ mean salary offered to John }\\ \bar{x}_2: \amp \text{ mean salary offered to Jennifer } \end{align*}

We will use $\bar{x}_1 - \bar{x}_2$ as our point estimate for $\mu_1-\mu_2\text{.}$ The data is given in the table below.


	$n$	$\bar{x}$	$s$

John	63	$30,238	$5,567
Jennifer	64	$26,508	$7,247

We can calculate the difference as

\begin{gather*} \bar{x}_1-\bar{x}_2 = 30238 - 26508 = 3730\text{.} \end{gather*}

Example 7.3.1.

Interpret the point estimate 3730. Why might we want to construct a confidence interval?


Version	\(n\)	\(\bar{x}\)	\(s\)	min	max

A	30	79.4	14	45	100
B	30	74.1	20	32	100

(,)	the confidence interval	`Sx1`	SD of sample 1
`df`	degrees of freedom	`Sx2`	SD of sample 2
\(\bar{x}_1\)	mean of sample 1	`n1`	size of sample 1
\(\bar{x}_2\)	mean of sample 2	`n2`	size of sample 2

`Left`, `Right`	ends of the confidence interval
`df`	degrees of freedom
\(\bar{x}1\text{,}\) \(\bar{x}2\)	sample means
`sx1`, `sx2`	sample standard deviations
`n1`, `n2`	sample sizes


Version	\(n\)	\(\bar{x}\)	\(s\)	min	max

A	30	79.4	14	45	100
B	30	74.1	20	32	100

Section 7.3 Inference for the difference of two means

Subsection 7.3.1 Learning objectives

Subsection 7.3.2 Sampling distribution for the difference of two means

Subsection 7.3.3 Checking conditions for inference on a difference of means

Subsection 7.3.4 Confidence intervals for a difference of means

Example 7.3.1.

Example 7.3.2.

Degrees of freedom for two-sample T-procedures.

Example 7.3.3.

Constructing a confidence interval for the difference of two means.

Example 7.3.4.

Subsection 7.3.5 Calculator: the 2-sample \(t\)-interval

TI-83/84: 2-sample T-interval.

Casio fx-9750GII: 2-sample T-interval.

Checkpoint 7.3.5.

Subsection 7.3.6 Hypothesis testing for the difference of two means

Example 7.3.8.

Example 7.3.10.

Example 7.3.11.

Example 7.3.12.

Example 7.3.13.

Example 7.3.14.

Checkpoint 7.3.15.

Checkpoint 7.3.16.

Hypothesis test for the difference of two means.

Example 7.3.17.

Subsection 7.3.7 Calculator: the 2-sample \(t\)-test

TI-83/84: 2-sample T-test.

Casio fx-9750GII: 2-sample T-test.

Checkpoint 7.3.18.

Subsection 7.3.8 Section summary

Exercises 7.3.9 Exercises

1. Friday the 13th, Part I.

2. Diamonds, Part I.

3. Friday the 13th, Part II.

4. Diamonds, Part II.

5. Chicken diet and weight, Part I.

6. Fuel efficiency of manual and automatic cars, Part I.

7. Chicken diet and weight, Part II.

8. Fuel efficiency of manual and automatic cars, Part II.

9. Prison isolation experiment, Part I.

10. True / False: comparing means.

Subsection 7.3.10 Chapter Highlights