AHSS Inference for a single proportion

Section 6.1 Inference for a single proportion

In this section, we will apply the inferential procedures introduced in Chapter 5 to the context of a single proportion, and we will explore how to do sample size calculations for data collection purposes. We will answer questions such as the following:

Do greater than half of adults in the U.S. oppose nuclear energy?
What percent of adults in the U.S. approve of the way the Supreme Court is handling its job?
What is the standard error is associated with this estimate?
How do we construct a confidence interval for this value?
What sample size is required to estimate this within a 3% margin of error using a 95% confidence level?

Subsection 6.1.1 Learning objectives

State and verify whether or not the conditions for inference on a proportion using a normal distribution are met.
Recognize that the success-failure condition and the standard error calculation are different for the test and for the confidence interval and explain why this is the case.
Carry out a complete hypothesis test and confidence interval procedure for a single proportion.
Find the minimum sample size needed to estimate a proportion with C% confidence and a margin of error no greater than a certain value.
Recognize that margin of error calculations only measure sampling error, and that other types of errors may be present.

Subsection 6.1.2 Distribution of a sample proportion (review)

The distribution of a sample proportion, such as the distribution of all possible values for the proportion of people who share a particular opinion in a poll, was introduced in Section 4.5. When the sampling distribution of a sample proportion, $\hat{p}\text{,}$ is approximately normal, we can use confidence intervals and hypothesis tests based on a normal distribution. We call these Z-intervals and Z-tests for short. Here, we review the conditions necessary for a sample proportion to be modeled using a normal distribution.

Conditions for the sampling distribution of $\hat{p}$ being nearly normal.

The sampling distribution of a sample proportion, $\hat{p}\text{,}$ based on a random sample of size $n$ from a population with a true proportion $p\text{,}$ is nearly normal when

the sample observations are independent and
$np\geq10$ and $n(1-p)\geq10\text{.}$ This is called the success-failure condition.

If these conditions are met, then the sampling distribution of $\hat{p}$ is nearly normal with mean $\mu_{\hat{p}}=p$ and standard deviation $\sigma_{\hat{p}} = \sqrt{\frac{\ p(1-p)\ }{n}}\text{.}$

Subsection 6.1.3 Checking conditions for inference using a normal distribution

We can use a normal model for inference for a proportion when the observations are independent and the sampling distribution of the sample proportion is nearly normal. We check that these assumptions are reasonable by verifying the following conditions.

Independent. Observations can be considered independent when the data are collected from a random process, such as tossing a coin, or from a random sample. Without a random sample or process, the standard error formula would not apply, and it is unclear to what population the inference would apply. When sampling without replacement from a finite population, the observations can be considered independent when sampling less than 10% of the population. ¹
Nearly normal sampling distribution. We saw in Section 4.5 that the sampling distribution of a sample proportion will be nearly normal when the success-failure condition is met, i.e. when the expected number of success and failures are both at least 10.

When sampling without replacement and sampling greater than 10% of the population, a modified standard error formula should be used.

In our examples, we generally sample from large populations, such as the United States. In these cases, we do not explicitly verify that the sample size is less than 10% of the population size. However, in borderline cases, one should remember to check this condition as well to ensure that the standard error estimate is reasonable.

Subsection 6.1.4 Confidence intervals for a proportion

The Gallup organization began measuring the public's view of the Supreme Court's job performance in 2000, and has measured it every year since then with the question: “Do you approve or disapprove of the way the Supreme Court is handling its job?”. In 2018, the Gallup poll randomly sampled 1,033 adults in the U.S. and found that 53% of them approved.² We know that 53% is just a point estimate. What range of values are reasonable estimates for the percent of the population that approved of the job the Supreme Court is doing? We can use the confidence interval procedure introduced in the previous chapter to answer this question, but first we must clearly identify the parameter we're trying to estimate and be sure that a Z-interval will be appropriate. The following examples walk through the various steps for carrying out a confidence interval procedure using the Gallup poll data.

https://news.gallup.com/poll/237269/supreme-court-approval-highest-2009.aspx

Example 6.1.1.

Identify the population of interest and the parameter of interest for the Gallup poll about the U.S. Supreme Court.

`z`	Z-statistic
`p`	p-value
\(\hat{p}\)	the sample proportion
`n`	the sample size

`z`	Z-statistic
`p`	p-value
\(\hat{p}\)	the sample proportion
`n`	the sample size

Section 6.1 Inference for a single proportion

Subsection 6.1.1 Learning objectives

Subsection 6.1.2 Distribution of a sample proportion (review)

Conditions for the sampling distribution of \(\hat{p}\) being nearly normal.

Subsection 6.1.3 Checking conditions for inference using a normal distribution

Subsection 6.1.4 Confidence intervals for a proportion

Example 6.1.1.

Example 6.1.2.

Example 6.1.3.

Example 6.1.4.

Example 6.1.5.

Example 6.1.6.

Constructing a confidence interval for a proportion.

Example 6.1.7.

Checkpoint 6.1.8.

Subsection 6.1.5 Calculator: the 1-proportion Z-interval

TI-83/84: 1-proportion Z-interval.

Casio fx-9750GII: 1-proportion Z-interval.

Checkpoint 6.1.9.

Subsection 6.1.6 Choosing a sample size when estimating a proportion

Margin of error.

Example 6.1.10.

Example 6.1.11.

Identify a sample size for a particular margin of error.

Checkpoint 6.1.12.

Checkpoint 6.1.13.

Subsection 6.1.7 Hypothesis testing for a proportion

Example 6.1.14.

Confidence intervals versus hypothesis tests for a single proportion.

Example 6.1.15.

invalidlabel.

invalidlabel.

Example 6.1.17.

Hypothesis testing for a proportion.

Example 6.1.18.

Checkpoint 6.1.19.

Subsection 6.1.8 Calculator: the 1-proportion Z-test

TI-83/84: 1-proportion Z-test.

Casio fx-9750GII: 1-proportion Z-test.

Checkpoint 6.1.20.

Subsection 6.1.9 Section summary

Exercises 6.1.10 Exercises

1. Vegetarian college students.

2. Young Americans, Part I.

3. Orange tabbies.

4. Young Americans, Part II.

5. Gender equality.

6. Elderly drivers.

7.

8. Life rating in Greece.

9. Study abroad.

10. Legalization of marijuana, Part I.

11. National Health Plan, Part I.

12. Is college worth it? Part I.

13. Taste test.

14. Is college worth it? Part II.

15. National Health Plan, Part II.

16. Legalize Marijuana, Part II.