Section 3.5 Chapter 3 Review
Exercises 3.5.1 Review Exercises
1.
Portland Community College serves nearly 73,000 full-time and part-time students in the greater Portland area at four main campuses (SE, Cascade, Sylvania, and RC). Student Affairs would like to know how students get to campus. They randomly select 250 students from each of the four main campus and ask them how they got to classes on campus. The following are the results of their survey:
Public Transportation: 435
Driving: 475
Biking: 65
Walking: 30
Identify the population and state its size.
Identify the sample and state its size.
What sampling method was used?
What type of data was collected?
Give the statistic for the percentage of students who use public transportation.
The population is PCC students and 73,000 is the sample size.
The sample is 250 students from the 4 main campuses.
Stratified Sample
Categorical or Qualitative
\(\displaystyle 435/1000 = 0.435 = 43.5\%\)
2.
CNN conducted a survey of 500 American adults. 62% of those surveyed answered yes to the question, “Do you favor a law to ban the sale of assault weapons and semiautomatic rifles?” The reported margin of error was \(\pm 4\%\text{.}\)
What population is being studied?
What is the sample?
What type of data is this?
Is the 62% reported in the problem an example of a statistic or a parameter?
What is the confidence interval? Is the confidence interval about the statistic or the parameter?
Explain what the confidence interval tells you.
The population being studied is American adults.
The sample is 500 American adults.
Categorical or Qualitative
The 62% reported in the problem an example of a statistic because it comes from a sample.
\(62\% - 4\% = 58\%\) and \(62\% + 4\% = 66\%\text{.}\) The confidence interval is \((58\%, 66\%)\) and it is in relation to the parameter.
We are confident that the true proportion of all adult Americans who favor a law to ban the sale of assault weapons and semi automatic rifles is between 58% and 66%.
3.
A survey of 265 PCC students found that 23%, plus or minus 4% prefer to study at the library.
What population is being studied?
What type of data was collected?
Is the reported 23% a statistic or a parameter?
What is the margin of error?
What is the confidence interval?
Explain what the confidence interval tells you.
The population being studied is PCC Students.
Categorical or Qualitative
23% is a statistic because it is from a sample.
The margin of error is 4%.
\(23\% - 4\% = 19\%\) and \(23\% + 4\% = 27\%\text{.}\) The confidence interval is \((19\%, 27\%)\text{.}\)
We are confident that the true proportion of all PCC students who prefer to study at the library is between 19% and 27%
4.
Identify the sampling method. Just the name will suffice.
Researchers select every 5th customer who walks into the store to take a survey.
Raffle tickets are distributed and collected in a bag, where they are mixed and ten are drawn for prizes.
I asked the shoppers near me in the shoe department what size they wear.
An IRS auditor randomly selects 25 taxpayers in each filing status (single, head of household, married filing jointly, and married filing separately).
Systematic
Simple Random Sample
Convenience
Systematic
5.
Identify the most relevant source of bias in each situation.
An opinion poll is posted on Facebook and Twitter asking how you are most likely to vote for in the next election.
Keller Auditorium ask all the people in the front three rows if they enjoyed the Broadway play.
To determine opinions on voter support for a downtown farmers market, a surveyor randomly questions people working close to the park where the farmers market would be.
A survey asks people to report the number of hours they work out each week.
A survey randomly calls people on their landlines and ask them if they would support a school bond measure in the next election.
Voluntary Response Bias
Sampling Bias
Sampling Bias
Response bias
Perceived lack of anonymity
6.
Identify whether each situation describes an observational study or an experiment. If it is an experiment.
Subjects are asked to run a mile and record their time.
Fifty students were asked to go to a quiet space in the library to memorize a poem. Fifty students were asked to go to a noisy location in the cafeteria to memorize the poem. Each student recorded how much time it took to memorize the poem.
Observational
Experiment
7.
For the clinical trial of a migraine drug, subjects were randomly divided into two groups. The first received an inert pill, while the second received the test medicine. Patients were not aware of which group they were in. After one month, patients reported how many migraines they experienced.
Which is the treatment group?
Which is the control group (if there is one)?
Is this study blind, double-blind, or neither?
Is this best described as an experiment, a controlled experiment, or a placebo-controlled experiment?
The treatment group is the group receiving the test medicine for migraines.
The control group is the group receiving the inert pill.
This is a blind study.
This is a placebo-controlled experiment.
8.
In a recent study 1 , 380 high risk adolescents involved in the juvenile justice system were recruited to test an app designed to increase mindfulness and reduce substance use. Participants were randomly and equally assigned to use the app (Rewire) or receive services as usual from the Department of Youth Services. Participants were assessed to determine a baseline for substance use at the beginning of the study, and were asked to complete follow up assessments after 1 and 3 months. Assessments consisted of online surveys asking about substance use, emotion regulation, family demographics, and mindfulness practices. Urine samples were collected at each interview to verify self-reported substance use.
Describe the treatment group.
Describe the control group (if there is one).
Is this study blind, double-blind, or neither? Explain
Is this best described as an experiment, a controlled experiment, or a placebo-controlled experiment?
The treatment group is the group assigned to use the Rewire app.
The control group is the group that receives the usual services from the Department of Youth Services.
This study is neither blind nor double-blind because the researchers and the youth know whether they are using the Rewire app of receiving the usual services from the Department of Youth Services.
This is a controlled experiment.
9.
In a 2010 survey, US teens aged 12-18 were asked what their favorite movie genre was. The results are shown below.
Action: 351
Adventure: 171
Comedy: 651
Drama: 389
Horror: 287
Romance: 107
Undecided: 51
What is the implied population?
How many people were sampled?
What type of data is this?
Create a relative frequency bar chart of the results.
Create a pie chart of the results.
Explain the advantages/disadvantages of the two charts.
What is the statistic for the percentage of teens whose favorite movie genre is horror?
The implied population is United States teenagers.
There were 2007 teens surveyed.
Qualitative or Categorical
Answers will vary
About 14.3% of teens chose horror as their favorite movie genre.
10.
A survey of 5325 Portland residents was conducted to determine the primary purpose of using TriMet. The results are shown below.

How many people use TriMet for personal business?
How many people use TriMet to get to the airport?
426 people use TriMet for personal business.
479 people people use TriMet to get to the airport.
11.
A group of college students were asked what the price of gas would need to be before they would start using public transportation to get to school instead of driving. Their responses in $/gallon are listed below:
5.25, 5.00, 4.25, 3.75, 5.00, 4.50, 3.95, 3.75, 5.75, 4.75, 3.25, 3.75, 4.75, 5.00, 8.95
Find the mean and median. Round to two decimal places and include units.
Based on the mean and median, would you expect the distribution to be symmetric, skewed left, or skewed right? Explain.
Find the standard deviation. Round to two decimal places and include units.
Calculate the z-scores for the responses of $3.25 and $8.95. Are either of these values unusual?
Determine the 5-number summary for the data.
What is the range and IQR of the data set? Round to two decimal places and include units.
Use the 5-number summary to construct a box plot.
The mean is $4.78. The median is $4.75.
The mean and median is about the same value therefore the data is symmetric.
The standard deviation is $1.34.
-
\(z_{$3.25}=\frac{$3.25-$4.78}{$1.34}=\frac{-$1.53}{$1.34}=-1.14\)
\(z_{$8.95}=\frac{$8.95-$4.78}{$1.34}=\frac{$4.17}{$1.34}=3.11\)
-
\(\text{Range}=$8.95-$3.25=$5.70\)
\(\text{IQR}=$5-$3.75=$1.25\)
12.
The following is a sample of scores from a recent Math 105 exam:
32, 71, 72, 73, 73, 73, 76, 77, 78, 78, 79, 86, 88, 88, 88, 94, 94, 99
Find the mean of the data. Round to one decimal place if necessary.
Find the median of the data. Round to one decimal place if necessary.
Just comparing the mean and the median, do you expect the distribution to be skewed left, skewed right, or symmetric. Explain.
Find the standard deviation of the data. Round to one decimal place if needed.
Explain what the mean and standard deviation tell you about the sampled test scores.
Is the score of 99 unusual? Use z-scores to support your claim.
Find the 5-number summary.
Use the 5-number summary to create a box plot.
Create a histogram of the data. Start your scale at 0, and use a bin size of 10.
Describe the shape of the distribution. Be sure to address all three characteristics (modality, symmetry, and outliers).
The mean is about 78.9.
The median is 78.
The mean is greater than the median therefore the data is left skewed.
The standard deviation is about 14.5.
The majority of values are between 64.4 and 93.4.
\(z=\frac{(99-78.9)}{14.5}=1.39\text{.}\) This 99 is only 1.39 standard deviations above the mean. This is not unusual. Generally, a value should be more than three deviations above or below the mean to be considered unusual.
13.
The following table shows the cost of purchasing a car at a local dealership. Some of the cars sold were new and some were used.
Cost (Thousands of dollars) |
Frequency |
12 | 6 |
15 | 7 |
18 | 12 |
22 | 10 |
30 | 12 |
32 | 11 |
40 | 6 |
45 | 6 |
Find the mean and standard deviation of the data. Round to two decimal places and include units.
Explain what standard deviation tell you about how much cars are selling for at this dealership.
Determine the five-number summary.
What is the range and IQR?
Use the five-number summary to construct a boxplot of the data.
The mean is about $26,200 and the standard deviation is about $10,000.
The majority of cars sell for between $16,200 and $36,200.
-
\(\text{Range} = 45 - 12 = 33\)
\(\text{IQR}= 32 - 18 = 14\)
14.
The double box-and-whisker plot 2 shows the goals scored per game by two soccer teams during a 25 game season.

Estimate the 25th, 50th and 75th percentiles for Team A and Team B goals.
What is the median number of goals for Team A? Team B?
What percentage of the goals for Team B is more than the maximum number of Team A?
What Team data is more symmetric?
What is the shape of the distribution for Team B?
-
The 25th, 50th, and 75th percentile for Team A goals are 2, 4, and 5, respectively.
The 25th, 50th, and 75th percentile for Team B goals are 6, 8, and 9, respectively.
-
The Median for Team A is 4.
The Median for Team B is 8.
Fifty percent of the goals for Team B exceed the maximum number for goals for Team A.
The data for Team A is more symmetric than for Team B.
The data for Team B is skewed left.
15.
Suppose you buy a new car whose advertised gas mileage is 35 mpg (miles per gallon). After driving the car for several months, you find that you are getting only 30.4 mpg. You phone the manufacturer and learn that the standard deviation for that model is 1.35 mpg.
Find the z-score for the gas mileage of your car.
Does it appear that your car is getting unusually low gas mileage? Explain your answer using your z-score.
\(\displaystyle z=\frac{(30.4-35)}{1.35}=\frac{-4.6}{1.35} \approx -3.41\)
30.4 mpg is 3.41 standard deviations below the mean so the car is getting unusually low gas mileage.
16.
This data is a sample of the average number of minutes per week that a driver is delayed by road congestion in 13 cities:
66, 55, 53, 50, 36, 45, 34, 43, 52, 40, 76, 45, 63
Find the mean and the standard deviation, including units.
What is the z-score for the city with an average delay time of 42 hours per week?
Is an average delay time of 42 hours per week unusual? Explain using the calculated z-score.
The mean is approximately 50.6 minutes per week. The standard deviation approximately 12.2 minutes per week.
\(\displaystyle z=\frac{(42-50.6)}{12.2} \approx -0.70\)
42 minutes per week is 0.70 standard deviations below the mean, so it is not unusual.