SCSA Mathematics Methods Interval estimates for proportions
15 sample questions with marking guides and sample answers · Avg. score: 83.8%
A survey plans to draw conclusions based on a random sample of 1% of Queensland's adult population.
To be regarded as a random sample, every
adult in the population will be placed in an alphabetical list and every 100th person will be selected for the sample.
adult in the population can choose to participate until the sample size has been reached.
subgroup within the population will be represented in a similar proportion in the sample.
adult in the population will have an equal chance of being selected for the sample.
Reveal Answer
adult in the population will be placed in an alphabetical list and every 100th person will be selected for the sample.
This describes systematic sampling (selecting every th person), which is a specific technique distinct from simple random sampling.
adult in the population can choose to participate until the sample size has been reached.
This describes voluntary response sampling, which is a non-probability method that often leads to bias rather than a random sample.
subgroup within the population will be represented in a similar proportion in the sample.
This describes a representative sample or stratified sampling; while random samples aim to be representative, they are defined by the selection probability, not the final composition.
adult in the population will have an equal chance of being selected for the sample.
The fundamental definition of a simple random sample is that every member of the population has an equal probability of being selected.
Unexplained respiratory symptoms reported by athletes are sometimes incorrectly thought to be exercise-induced asthma. A researcher wants to investigate the proportion of Australian athletes with unexplained respiratory symptoms who do have exercise-induced asthma. Using a nationwide repository of medical records, the researcher collects a random sample of 71 athletes referred by their doctor for unexplained respiratory symptoms.
Identify and explain a possible source of bias in the sampling method.
Reveal Answer
Exclusion bias: Athletes with unexplained respiratory symptoms who do not seek medical intervention, or who do not identify themselves as athletes to their doctor, will not be represented in the sample.
| Descriptor | Marks |
|---|---|
correctly explains a valid source of bias | 1 |
Ignore the potential bias in the sampling method in the remaining parts of the question.
Suppose that 25 athletes from the sample were found to have exercise-induced asthma.
Calculate a 95% confidence interval for the true proportion of athletes with unexplained respiratory symptoms who do have exercise-induced asthma.
Reveal Answer
The sample proportion
Hence the 95% confidence interval is given by
| Descriptor | Marks |
|---|---|
calculates the correct sample proportion | 1 |
calculates the correct 95% confidence interval | 1 |
Determine the margin of error of the 95% confidence interval from part (b).
Reveal Answer
The margin of error is given by
| Descriptor | Marks |
|---|---|
correctly determines the margin of error | 1 |
All else remaining unchanged, what would you expect to happen to the margin of error if the sample size was increased.
Reveal Answer
The margin of error would decrease if the sample size is increased.
| Descriptor | Marks |
|---|---|
states that the margin of error would decrease | 1 |
All else remaining unchanged, what would you expect to happen to the margin of error if the confidence level was increased.
Reveal Answer
The margin of error would increase if the confidence level is increased.
| Descriptor | Marks |
|---|---|
states that the margin of error would increase | 1 |
All else remaining unchanged, what would you expect to happen to the margin of error if the sample proportion of athletes with exercise-induced asthma decreased. Justify your answer.
Reveal Answer
Since the sample proportion is less than 0.5 (0.5 is the sample proportion yielding the largest margin of error), a decrease in sample proportion would move the margin of error away from being maximum. Hence the margin of error would decrease.
| Descriptor | Marks |
|---|---|
states that the margin of error would decrease | 1 |
provides a valid justification based on the sample proportion moving away from 0.5 | 1 |
Determine the minimum sample size required to guarantee a margin of error for the 95% confidence interval of at most 0.04.
Reveal Answer
We require
Since must be an integer the minimum value of the sample size is .
| Descriptor | Marks |
|---|---|
correctly substitutes values into the margin of error equation to obtain an equation for | 1 |
correctly solves for , rounding up to an integer value | 1 |
A separate large-scale study from the United States of America claims that 20% of American athletes with unexplained respiratory symptoms do have exercise-induced asthma.
Based on the 95% confidence interval calculated in part (b) on page 20, is the proportion of Australian athletes with unexplained respiratory symptoms who do have exercise-induced asthma different from the American proportion? Justify your answer.
Reveal Answer
Yes. The claimed American proportion is outside the confidence interval for Australian athletes, so it is possible to conclude that the proportions are different at the 95% level of confidence.
| Descriptor | Marks |
|---|---|
states that the proportion is different | 1 |
provides a justification based on the American proportion being outside the Australian confidence interval | 1 |
At council meetings in a particular town, new proposals are only discussed if more than 80% of the community are in favour of the proposal.
To discover community opinion on a new bus route proposal, the council conducted several surveys, each with a sample size of 120. The distribution of the sample proportions from the surveys had a standard deviation of 0.04.
Make a justified decision as to whether the new bus route proposal would be discussed at a council meeting.
Reveal Answer
Analytical procedure:
(formula book)
Use GDC equation solver or graph and the
square root function to find intersections:
or
Both population proportions (26% and 74%) are less than
80%.
Therefore, the bus route proposal would not be discussed
at a council meeting.
| Descriptor | Marks |
|---|---|
Correctly substitutes the given information into the standard deviation formula for sample proportion | 1 |
Determines a possible value for the population proportion | 1 |
Determines a second possible population proportion value | 1 |
Makes a justified decision regarding the proposal | 1 |
Ravi randomly sampled 200 different pet owners in Brisbane and found that 50 celebrate their pet's birthday.
Determine an approximate 95% confidence interval for the proportion of Brisbane pet owners who celebrate their pet's birthday.
Reveal Answer
Using GDC to determine confidence interval associated with
| Descriptor | Marks |
|---|---|
correctly identifies all of the information required to establish the confidence interval | 1 |
correctly determines the confidence interval | 1 |
Two of Ravi's friends also randomly sampled Brisbane pet owners. The results are shown in the table.
| Friend's name | Number sampled | Number who celebrate their pet's birthday |
|---|---|---|
| Khadija | 100 | 26 |
| Tim | 150 | 34 |
Khadija suggested a more precise estimate for the proportion of Brisbane pet owners who celebrate their pet's birthday could be obtained by combining their results.
Using all available data, determine an approximate 95% confidence interval for the proportion of Brisbane pet owners who celebrate their pet's birthday.
Reveal Answer
Combining results
Using GDC
| Descriptor | Marks |
|---|---|
correctly determines and for the combined sample | 1 |
determines confidence interval | 1 |
Use the results from Questions 14a) and 14b) to evaluate the reasonableness of Khadija's suggestion.
Reveal Answer
By combining the results, the sample size is increased and the confidence interval width is reduced.
The new sample statistic provides a better estimate for the population parameter.
| Descriptor | Marks |
|---|---|
identifies changed width of confidence interval | 1 |
evaluates the reasonableness of Khadija’s suggestion | 1 |
The proportion of all Brisbane pet owners who celebrate their pet's birthday is 0.24.
Using the normal approximation, determine the probability that in a randomly selected sample of size 200, more than 30% of pet owners celebrate their pet's birthday.
Reveal Answer
Using approximation to the normal distribution
Mean
Standard deviation
Using GDC
| Descriptor | Marks |
|---|---|
correctly determines the mean and standard deviation of the normal distribution | 1 |
determines the probability | 1 |
Assuming the approximate normality of sample proportions ( and ) and based on two independent samples, the approximate confidence interval for the difference of two proportions is given by
If the approximate confidence interval for the difference between two proportions does not contain 0, this provides evidence that the two proportions are not equal.
The data in the table shows the observed frequencies of two drink preferences for independent samples of people who live in Town A and Town B.
| Town | Tea | Coffee | Total |
|---|---|---|---|
| A | 111 | 105 | 216 |
| B | 150 | 107 | 257 |
Using the approximate 99% confidence interval for the difference of two proportions, determine if there is evidence to conclude that drink preference is associated with the town where the person lives.
Reveal Answer
proportion of Town A who prefer to drink tea
proportion of Town B who prefer to drink tea
The sample proportions are:
Using the 99% confidence interval for the difference of two proportions
This interval contains zero; therefore, there is no evidence in the data to say that the two proportions are different, i.e. preference to drink tea does not depend on where the person lives.
| Descriptor | Marks |
|---|---|
correctly determines the sample proportions | 1 |
establishes confidence interval for the difference of two proportions | 1 |
determines 99% confidence interval | 1 |
interprets 99% confidence interval to determine equality of proportions | 1 |
shows logical organisation communicating key steps | 1 |
Let be the random variable that represents the sample proportion of households in a given suburb that have solar panels installed.
From a sample of randomly selected households in a given suburb, an approximate 95% confidence interval for the proportion of households having solar panels installed was determined to be .
Use to approximate the 95% confidence interval.
Find the value of that was used to obtain this approximate 95% confidence interval.
Reveal Answer
| Descriptor | Marks |
|---|---|
Calculates the correct value of | 1 |
Find the size of the sample from which this 95% confidence interval was obtained.
Reveal Answer
| Descriptor | Marks |
|---|---|
Sets up a correct equation involving using the margin of error or confidence interval bounds | 1 |
Calculates the correct sample size, | 1 |
A larger sample of households is selected, with a sample size four times the original sample.
The sample proportion of households having solar panels installed is found to be the same.
By what factor will the increased sample size affect the width of the confidence interval?
Reveal Answer
Confidence interval width is halved (reduced or decreased by a factor of 2; altered by a factor of ).
| Descriptor | Marks |
|---|---|
States the correct factor by which the width is affected (e.g., , halved, or decreased by a factor of 2) | 1 |
A survey was conducted to understand whether people support a new policy.
Using a z-score of 2, the approximate confidence interval for the population proportion of people who support the policy was calculated as .
Determine the margin of error.
Reveal Answer
The confidence interval corresponds to , where E is the margin of error about .
subtracting:
| Descriptor | Marks |
|---|---|
Correctly determines the margin of error | 1 |
Determine the number of people surveyed.
Reveal Answer
Upper CI value =
25 people were surveyed.
| Descriptor | Marks |
|---|---|
Determines the value of the sample proportion | 1 |
Substitutes and z-score into the confidence interval formula | 1 |
Determines the number of people surveyed | 1 |
A suburban council hires a consultant to estimate the proportion of residents of the suburb who use its library.
The consultant decides to estimate a 95% confidence interval for the proportion to within an error of 0.01. What minimum sample size should be selected?
Reveal Answer
| Descriptor | Marks |
|---|---|
uses the correct z-value | 1 |
uses 0.5 in the expression for standard error | 1 |
determines the sample size (as an integer) | 1 |
If resource limitations dictate that the maximum sample size that can be managed is 500, what is the maximum margin of error in estimating a 99% confidence interval?
Reveal Answer
that is, within 5.8%
| Descriptor | Marks |
|---|---|
uses the correct z-value | 1 |
uses 0.5 in the expression for standard error | 1 |
calculates the error | 1 |
The consultant decides to select the sample by standing on the roadside outside the library at lunchtime and asking a random sample of the passers-by whether they use the library.
Identify and explain two possible sources of bias with this sampling scheme.
Reveal Answer
- The sample is at a fixed time, so only people around at that time will be sampled.
- The location is fixed, so:
(i) only people at that location will be sampled or
(ii) not everyone from the suburb will pass by that area, so this is not a random sample of the residents.
| Descriptor | Marks |
|---|---|
identifies one possible source of bias | 1 |
explains why it is a possible source of bias | 1 |
identifies another possible source of bias | 1 |
explains why it is a possible source of bias | 1 |
An online retailer claims that 90% of all orders are shipped within 12 hours of being received. On a particular day, 121 orders were received and 102 orders were shipped within 12 hours.
State the sample proportion of orders shipped within 12 hours.
Reveal Answer
| Descriptor | Marks |
|---|---|
Correctly determines the sample proportion | 1 |
The distribution of the sample proportion of all orders that are shipped within 12 hours of being received on any day is approximately normal.
Assuming the online retailer's claim is true, find the probability that, in a random sample of 121, less than 85% of all orders are shipped within 12 hours.
Reveal Answer
Using normal distribution
Using GDC
| Descriptor | Marks |
|---|---|
Correctly identifies the mean of the population | 1 |
Determines expression for standard deviation of the normal distribution | 1 |
Determines probability of producing sample proportion or less | 1 |
Use the result from 13b) to evaluate the reasonableness of the online retailer's claim.
Reveal Answer
There is 3.34% chance of shipping less than 85% of orders when the population parameter is 0.90.
Observing a sample proportion of 0.843 (or even lower) would have occurred by chance less than 3.34% of the time if the retailer's claim is true. Therefore, we suspect that the retailer's claim is dubious.
| Descriptor | Marks |
|---|---|
Identifies meaning of the probability of producing the sample proportion | 1 |
Evaluates reasonableness of the retailer's claim | 1 |
A book called 'Why I Love Mathematics' is having its first print run. This is scheduled to last for one week, using four printing presses. The publisher claims that, historically, 10 books have printing errors for every two hundred that are printed.
A sample of 200 books is to be chosen to determine how many contained errors. The proposed sampling procedure is to select the first 200 books printed over a 6-hour window using the newest printing press.
Identify and explain one possible source of bias in the proposed sampling procedure.
Reveal Answer
Answers could include:
- the sample only includes books from the newest printing press, which may perform differently to the other three presses
- the sample was gathered over a single narrow time period, so books printed later in the week are not included (this may involve different operators and the performance of the presses might change during their use over the week).
| Descriptor | Marks |
|---|---|
identifies a source of bias | 1 |
provides a correct explanation for that source | 1 |
Identify two changes to the sampling procedure that would reduce bias.
Reveal Answer
The sample could be randomly selected:
- from books printed across the entire duration of the print run
- from all printing presses.
| Descriptor | Marks |
|---|---|
suggests a change to improve randomisation across the entire print run | 1 |
suggests a second change to improve randomisation across the entire print run | 1 |
Assume that the sample was gathered using an improved procedure, and that the publisher's claim is correct.
Use the approximate normality of the distribution of sample proportions to determine the probability that the sample proportion of books with errors is less than 0.04.
Reveal Answer
The distribution of can be approximated as
Hence,
| Descriptor | Marks |
|---|---|
calculates the correct distribution parameters (mean and standard deviation/variance) | 1 |
calculates the correct probability | 1 |
In a different random sample of 600 books, it is found that the proportion of books containing an error is 0.1, with a margin of error of 0.024.
Determine a 95% confidence interval for the proportion of books that will have printing errors.
Reveal Answer
A 95% confidence interval is given by
| Descriptor | Marks |
|---|---|
correctly calculates confidence interval | 1 |
On the basis of the confidence interval determined in part (c), is the proportion of books with printing errors different from what was claimed by the publisher?
Reveal Answer
The proportion of books with errors claimed by the publisher was . This proportion is not within the confidence interval and so there is sufficient evidence to conclude that the claim of the publisher is incorrect at the 95% confidence level.
| Descriptor | Marks |
|---|---|
states that the claimed proportion is not within the confidence interval | 1 |
states that there is sufficient evidence at the above confidence level to conclude that the claimed proportion is incorrect | 1 |
Suggest two changes that could be made in order to decrease the margin of error of the confidence interval.
Reveal Answer
The margin of error could be decreased by
- increasing the size of the sample
- decreasing the confidence level.
| Descriptor | Marks |
|---|---|
states one possible change | 1 |
states a second possible change | 1 |
Determine the minimum sample size that would be necessary to guarantee that the margin of error of the resulting 95% confidence interval was at most 0.02.
Reveal Answer
For the worst-case scenario set . Solving for a margin of error equal to 0.02 gives
A sample size of at least 2401 would ensure the margin of error is at most 0.02.
| Descriptor | Marks |
|---|---|
sets and provides a correct expression for the sample size | 1 |
correctly solves for the minimum sample size. | 1 |
A researcher found that 17 out of 50 randomly selected people had used public transport in the past week.
Determine the sample proportion of people who had used public transport in the past week.
Reveal Answer
| Descriptor | Marks |
|---|---|
correctly determines the sample proportion | 1 |
Determine an approximate 95% confidence interval for the proportion of people who had used public transport in the past week.
Reveal Answer
Using the formula:
Variance to estimate variance
Standard deviation
Using GDC InvN
Area
Std dev
CI
| Descriptor | Marks |
|---|---|
identifies all the information required to establish the confidence interval | 1 |
determines an approximate 95% confidence interval | 1 |
Someone claims that: 50% of people use public transport each week.
Use your answer from Question 11b) to explain whether the data can or cannot support this claim.
Reveal Answer
From Q11b) we are 95% confident that the proportion of people using public transport is approximately between 21% and 47%. 50% is outside of this range.
The claim is not supported.
| Descriptor | Marks |
|---|---|
provides a valid evaluation of the claim that references their answer from 11b) | 1 |
An organisation randomly surveyed 1000 Australian adults and found that 55% of those surveyed were happy with their level of physical activity.
An approximate 95% confidence interval for the percentage of Australian adults who were happy with their level of physical activity is closest to
Reveal Answer
This interval is incorrect because it does not center around the sample proportion of 55%.
This interval is too wide. It corresponds to an approximate 99% confidence interval, which uses a z-score of 2.576 instead of 1.96.
This interval is too narrow. It corresponds to an approximate 90% confidence interval, which uses a z-score of 1.645 instead of 1.96.
This is correct. The 95% confidence interval is calculated using , which gives , or .
This interval is much too wide. It represents the 95% confidence interval if the sample size was instead of .
A sample of size was used to estimate a population proportion. An approximate margin of error of 3% was calculated using . Given the sample proportion was 0.6, determine .
Reveal Answer
This option incorrectly inverts the formula. The sample size is calculated by dividing the variance estimate by the squared term , not the other way around.
Rearranging the margin of error formula to solve for gives . With , the numerator is .
This option incorrectly inverts the fraction and uses the wrong value for the product . The calculation equals , not .
While the structure of the formula is correct, the numerator is calculated incorrectly. The value of is , not .
According to the Association of Poultry Farmers, 35% of people living in Melbourne purchase free-range eggs.
The company does not follow the suggestion of the junior employee and instead randomly samples 243 people living in Perth and asks them whether they purchase free-range eggs. On the basis of the results of their survey, a confidence interval for is calculated to be (0.2520, 0.3488).
If a random sample of 100 people living in Melbourne is surveyed, what is the probability that the sample proportion of people who purchase free-range eggs will be less than 0.28?
Reveal Answer
The sample proportion has distribution
Hence
| Descriptor | Marks |
|---|---|
states sample proportion is normally distributed | 1 |
determines correct distribution parameters | 1 |
calculates correct probability | 1 |
A market research company wants to know whether the proportion of people living in Perth who purchase free-range eggs is similar to that of Melbourne. A junior employee proposes that it gather a sample of shoppers by standing outside a particular shop between 9 am and 10 am on a Tuesday morning and asking all shoppers entering the shop if they purchase free-range eggs.
Identify and explain two sources of bias in the proposed sampling method.
Reveal Answer
- Single location: only sampling people who visit a particular store. People who live a long way from the store are less likely to shop there than people who live locally, and so are less likely to be included/represented in the sample
- Single time: only sampling between 9am and 10am on Tuesday. People who have 9am–5pm work commitments are less likely to shop at this time than people who do not work these hours and so are less likely to be included/represented in the sample.
| Descriptor | Marks |
|---|---|
identifies a source of bias | 1 |
explains how the source introduces bias | 1 |
identifies a second source of bias | 1 |
explains how the second source introduces bias | 1 |
Determine the number of people in the sample who purchase free-range eggs.
Reveal Answer
Hence the number of people who purchase free-range eggs is
| Descriptor | Marks |
|---|---|
calculates correct sample proportion | 1 |
calculates correct number of people | 1 |
Determine the level of confidence that was used to calculate the confidence interval.
Reveal Answer
The margin of error is
Hence we have
Then
Hence the confidence level was 90%.
| Descriptor | Marks |
|---|---|
correctly calculates the margin of error | 1 |
calculates the correct standardised score (-value) | 1 |
determines the correct confidence level | 1 |
The distribution of a certain sample proportion has a mean of 0.70 and a standard deviation of 0.02.
Determine the sample size.
525
750
1750
2500
Reveal Answer
525
Using the formula for the standard deviation of a sample proportion, , we solve to find .
750
This sample size is incorrect. If , the standard deviation would be , which does not match the given value of .
1750
This sample size is too large. Using in the standard deviation formula yields approximately , rather than the required .
2500
This sample size is incorrect. A sample size of would result in a much smaller standard deviation of .