AHSS Inference for a mean with the $t$-distribution

Section 7.1 Inference for a mean with the $t$-distribution

In this section, we turn our attention to numerical variables and answer questions such as the following:

How well can we estimate the mean income of people in a certain city, county, or state?
What is the average mercury content in various types of fish?
Are people's run times getting faster or slower, on average?
How does the sample size affect the expected error in our estimates?
When is it reasonable to model the sample mean $\bar{x}$ using a normal distribution, and when will we need to use a new distribution, known as the $t$-distribution?

Subsection 7.1.1 Learning objectives

Understand the relationship between a $t$-distribution and a normal distribution, and explain why we use a $t$-distribution for inference on a mean.
State and verify whether or not the conditions for inference for a mean based on the $t$-distribution are met. Understand when it is necessary to look at the distribution of the sample data.
Know the degrees of freedom associated with a one sample $t$-procedure.
Carry out a complete hypothesis test for a single mean.
Carry out a complete confidence interval procedure for a single mean.
Find the minimum sample size needed to estimate a mean with C% confidence and a margin of error no greater than a certain value.

Subsection 7.1.2 Using a normal distribution for inference when $\sigma$ is known

In Section 4.2 we saw that the distribution of a sample mean is normal if the population is normal or if the sample size is at least 30. In these problems, we used the population mean and population standard deviation to find a Z-score. However, in the case of inference, these values will be unknown. In rare circumstances we may know the standard deviation of a population, even though we do not know its mean. For example, in some industrial processes, the mean may be known to shift over time, while the standard deviation of the process remains the same. In these cases, we can use the normal model as the basis for our inference procedures. We use $\bar{x}$ as our point estimate for $\mu$ and the $SD$ formula for a sample mean calculated in Section 4.2: $\sigma_{\bar{x}} =\frac{\sigma}{\sqrt{n}}\text{.}$ That leads to a confidence interval and a test statistic as follows:

\begin{align*} \text{ CI: } \bar{x} \amp \ \pm \ z^{\star}\frac{\sigma}{\sqrt{n}} \amp \amp Z = \frac{\bar{x} - \text{ null value } }{\frac{\sigma}{\sqrt{n}}} \end{align*}

What happens if we do not know the population standard deviation $\sigma\text{,}$ as is usually the case? The best we can do is use the sample standard deviation, denoted by $s\text{,}$ to estimate the population standard deviation.

\begin{gather*} SE= \frac{s}{\sqrt{n}} \end{gather*}

However, when we do this we run into a problem: when carrying out our inference procedures, we will be trying to estimate two quantities: both the mean and the standard deviation. Looking at the $SD$ and $SE$ formulas, we can make some important observations that will give us a hint as to what will happen when we use $s$ instead of $\sigma\text{.}$

For a given population, $\sigma$ is a fixed number and does not vary.
$s\text{,}$ the standard deviation of a sample, will vary from one sample to the next and will not be exactly equal to $\sigma\text{.}$
The larger the sample size $n\text{,}$ the better the estimate $s$ will tend to be for $\sigma\text{.}$

For this reason, the normal model still works well when the sample size is large. For smaller sample sizes, we run into a problem: our use of $s\text{,}$ which is used when computing the standard error, tends to add more variability to our test statistic. It is this extra variability that leads us to a new distribution: the $t$-distribution.

Subsection 7.1.3 Introducing the $t$-distribution

When we use the sample standard deviation $s$ in place of the population standard deviation $\sigma$ to standardize the sample mean, we get an entirely new distribution - one that is similar to the normal distribution, but has greater spread. This distribution is known as the $t$-distribution. A $t$-distribution, shown as a solid line in Figure 7.1.1, has a bell shape. However, its tails are thicker than the normal model's. We can see that a greater proportion of the area under the $t$-distribution is beyond 2 standard units from 0 than under the normal distribution. These extra thick tails are exactly the correction we need to resolve the problem of a poorly estimated standard deviation.

Figure 7.1.1. Comparison of a $t$-distribution (solid line) and a normal distribution (dotted line).

The $t$-distribution, always centered at zero, has a single parameter: degrees of freedom. The degrees of freedom (df) describes the precise form of the bell-shaped $t$-distribution. Several $t$-distributions are shown in Figure 7.1.2. When there are more degrees of freedom, the $t$-distribution looks more like the standard normal distribution.

Degrees of freedom.

The degrees of freedom describes the shape of the $t$-distribution. The larger the degrees of freedom, the more closely the distribution resembles the standard normal distribution.

When the degrees of freedom is large, about 30 or more, the $t$-distribution is nearly indistinguishable from the normal distribution. In Subsection 7.1.5, we will see how degrees of freedom relates to sample size.

We will find it useful to become familiar with the $t$-distribution, because it plays a very similar role to the normal distribution during inference. We use a $t$-table, partially shown in Table 7.1.3, in place of the normal probability table when the population standard deviation is unknown, especially when the sample size is small. A larger table is presented in Section B.3.

	one tail	0.100	0.050	0.025	0.010	0.005

$df$	1	3.078	6.314	12.71	31.82	63.66
	2	1.886	2.920	4.303	6.965	9.925
	3	1.638	2.353	3.182	4.541	5.841
	$\vdots$	$\vdots$	$\vdots$	$\vdots$	$\vdots$
	17	1.333	1.740	2.110	2.567	2.898
	18	1.330	1.734	2.101	2.552	2.878
	19	1.328	1.729	2.093	2.539	2.861
	20	1.325	1.725	2.086	2.528	2.845
	$\vdots$	$\vdots$	$\vdots$	$\vdots$	$\vdots$
	1000	1.282	1.646	1.962	2.330	2.581
	$\infty$	1.282	1.645	1.960	2.326	2.576

Confidence level C		80%	90%	95%	98%	99%

Table 7.1.3. An abbreviated look at the $t$-table. Each row represents a different $t$-distribution. The columns describe the cutoffs for specific tail areas. The row with $df=18$ has been highlighted.

Each row in the $t$-table represents a $t$-distribution with different degrees of freedom. The columns correspond to tail probabilities. For instance, if we know we are working with the $t$-distribution with $df=18\text{,}$ we can examine row 18, which is highlighted in Table 7.1.3. If we want the value in this row that identifies the cutoff for an upper tail of 10%, we can look in the column where one tail is 0.100. This cutoff is 1.33. If we had wanted the cutoff for the lower 10%, we would use -1.33. Just like the normal distribution, all $t$-distributions are symmetric.

Example 7.1.4.

What proportion of the $t$-distribution with 18 degrees of freedom falls below -2.10?


\(n\)	\(\bar{x}\)	\(s\)	minimum	maximum
19	4.4	2.3	1.7	9.2

	alternative hypothesis	\(\bar{x}\)	sample mean
`t`	T statistic	`sx`	sample standard deviation
`p`	p-value	`n`	sample size


n	\(\bar{x}\)	s	min	max

25	7.73	0.77	6.17	9.78

Min	147.2
Q1	163.8
Median	170.3
Mean	171.1
SD	9.4
Q3	177.8
Max	198.1

Section 7.1 Inference for a mean with the \(t\)-distribution

Subsection 7.1.1 Learning objectives

Subsection 7.1.2 Using a normal distribution for inference when \(\sigma\) is known

Subsection 7.1.3 Introducing the \(t\)-distribution

Degrees of freedom.

Example 7.1.4.

Example 7.1.6.

Example 7.1.7.

Example 7.1.8.

Subsection 7.1.4 Calculator: finding area under the \(t\)-distribution

TI-84: Finding area under the T-curve.

Casio fx-9750GII: Finding area under the T-distribution.

Checkpoint 7.1.10.

Checkpoint 7.1.11.

Subsection 7.1.5 Checking conditions for inference on a mean using the \(t\)-distribution

The normality condition with small samples.

Subsection 7.1.6 One sample \(t\)-interval for a mean

Degrees of freedom for a single sample.

Example 7.1.14.

Example 7.1.15.

Constructing a confidence interval for a mean.

Example 7.1.16.

Example 7.1.17.

Subsection 7.1.7 Calculator: the 1-sample \(t\)-interval

TI-83/84: 1-sample T-interval.

Casio fx-9750GII: 1-sample T-interval.

Checkpoint 7.1.18.

Subsection 7.1.8 Choosing a sample size when estimating a mean

Example 7.1.19.

Identify a sample size for a particular margin of error.

Subsection 7.1.9 Hypothesis testing for a mean

Example 7.1.21.

The T-statistic.

Example 7.1.22.

Example 7.1.23.

Example 7.1.24.

Hypothesis test for a mean.

Example 7.1.25.

Checkpoint 7.1.26.

Subsection 7.1.10 Calculator: 1-sample \(t\)-test

TI-83/84: 1-sample T-test.

Casio fx-9750GII: 1-sample T-test.

Checkpoint 7.1.27.

Subsection 7.1.11 Section summary

Exercises 7.1.12 Exercises

1. Identify the critical \(t\).

2. \(t\)-distribution.

3. Find the p-value, Part I.

4. Find the p-value, Part II.

5. Working backwards, Part I.

6. Working backwards, Part II.

7. Sleep habits of New Yorkers.

8. Heights of adults.

9. Find the mean.

10. \(t^\star\) vs. \(z^\star\).

11. Play the piano.

12. Auto exhaust and lead exposure.

13. Car insurance savings.

14. SAT scores.