QCAA Mathematical Methods Sampling and proportions
15 sample questions with marking guides and sample answers
A council wants to survey residents about a new dog park. Which sampling method would best minimise bias in the survey?
Questioning every third resident entering a supermarket near an existing dog park.
Collecting responses from residents who clicked a survey link on the website.
Asking residents visiting a dog park on a randomly selected day.
Selecting residents using a random number generator.
Reveal Answer
Questioning every third resident entering a supermarket near an existing dog park.
Surveying near an existing dog park introduces location bias, as people in that area might have stronger opinions about dog parks than the general population.
Collecting responses from residents who clicked a survey link on the website.
This relies on voluntary response sampling, which introduces self-selection bias because only residents with strong opinions are likely to participate.
Asking residents visiting a dog park on a randomly selected day.
Surveying at a dog park introduces selection bias by overrepresenting dog owners and excluding residents who do not currently use dog parks.
Selecting residents using a random number generator.
Using a random number generator creates a simple random sample, giving every resident an equal chance of being chosen and effectively minimizing bias.
At council meetings in a particular town, new proposals are only discussed if more than 80% of the community are in favour of the proposal.
To discover community opinion on a new bus route proposal, the council conducted several surveys, each with a sample size of 120. The distribution of the sample proportions from the surveys had a standard deviation of 0.04.
Make a justified decision as to whether the new bus route proposal would be discussed at a council meeting.
Reveal Answer
Analytical procedure:
(formula book)
Use GDC equation solver or graph and the
square root function to find intersections:
or
Both population proportions (26% and 74%) are less than
80%.
Therefore, the bus route proposal would not be discussed
at a council meeting.
| Descriptor | Marks |
|---|---|
Correctly substitutes the given information into the standard deviation formula for sample proportion | 1 |
Determines a possible value for the population proportion | 1 |
Determines a second possible population proportion value | 1 |
Makes a justified decision regarding the proposal | 1 |
A survey plans to draw conclusions based on a random sample of 1% of Queensland's adult population.
To be regarded as a random sample, every
adult in the population will be placed in an alphabetical list and every 100th person will be selected for the sample.
adult in the population can choose to participate until the sample size has been reached.
subgroup within the population will be represented in a similar proportion in the sample.
adult in the population will have an equal chance of being selected for the sample.
Reveal Answer
adult in the population will be placed in an alphabetical list and every 100th person will be selected for the sample.
This describes systematic sampling (selecting every th person), which is a specific technique distinct from simple random sampling.
adult in the population can choose to participate until the sample size has been reached.
This describes voluntary response sampling, which is a non-probability method that often leads to bias rather than a random sample.
subgroup within the population will be represented in a similar proportion in the sample.
This describes a representative sample or stratified sampling; while random samples aim to be representative, they are defined by the selection probability, not the final composition.
adult in the population will have an equal chance of being selected for the sample.
The fundamental definition of a simple random sample is that every member of the population has an equal probability of being selected.
An online retailer claims that 90% of all orders are shipped within 12 hours of being received. On a particular day, 121 orders were received and 102 orders were shipped within 12 hours.
State the sample proportion of orders shipped within 12 hours.
Reveal Answer
| Descriptor | Marks |
|---|---|
Correctly determines the sample proportion | 1 |
The distribution of the sample proportion of all orders that are shipped within 12 hours of being received on any day is approximately normal.
Assuming the online retailer's claim is true, find the probability that, in a random sample of 121, less than 85% of all orders are shipped within 12 hours.
Reveal Answer
Using normal distribution
Using GDC
| Descriptor | Marks |
|---|---|
Correctly identifies the mean of the population | 1 |
Determines expression for standard deviation of the normal distribution | 1 |
Determines probability of producing sample proportion or less | 1 |
Use the result from 13b) to evaluate the reasonableness of the online retailer's claim.
Reveal Answer
There is 3.34% chance of shipping less than 85% of orders when the population parameter is 0.90.
Observing a sample proportion of 0.843 (or even lower) would have occurred by chance less than 3.34% of the time if the retailer's claim is true. Therefore, we suspect that the retailer's claim is dubious.
| Descriptor | Marks |
|---|---|
Identifies meaning of the probability of producing the sample proportion | 1 |
Evaluates reasonableness of the retailer's claim | 1 |
The distribution of a certain sample proportion has a mean of 0.70 and a standard deviation of 0.02.
Determine the sample size.
525
750
1750
2500
Reveal Answer
525
Using the formula for the standard deviation of a sample proportion, , we solve to find .
750
This sample size is incorrect. If , the standard deviation would be , which does not match the given value of .
1750
This sample size is too large. Using in the standard deviation formula yields approximately , rather than the required .
2500
This sample size is incorrect. A sample size of would result in a much smaller standard deviation of .
For a certain species of bird, the proportion of birds with a crest is known to be .
Let be the random variable representing the proportion of birds with a crest in samples of size for this specific bird.
The smallest sample size for which the standard deviation of is less than 0.08 is
7
27
37
38
43
Reveal Answer
7
Setting up the inequality yields . A sample size of 7 is much too small and results in a standard deviation of about 0.185.
27
Setting up the inequality yields . A sample size of 27 results in a standard deviation of about 0.094, which is greater than 0.08.
37
Setting up the inequality yields . A sample size of 37 results in a standard deviation of about 0.0805, which is slightly greater than the required 0.08.
38
The standard deviation of a sample proportion is . Solving yields , making 38 the smallest integer sample size that satisfies the condition.
43
Although a sample size of 43 results in a standard deviation less than 0.08, it is not the smallest possible sample size to do so since also satisfies the condition.
The distribution for a sample proportion has a mean of 0.15 and a standard deviation of 0.0345.
The sample size is
10
14
107
116
Reveal Answer
10
This sample size is too small. Using the formula for the standard deviation of a sample proportion, a sample size of 10 would result in a standard deviation of approximately .
14
This is incorrect. A sample size of 14 would yield a standard deviation of approximately , which does not match the given value.
107
Using the formula for the standard deviation of a sample proportion , we can solve for : , which rounds to 107.
116
This value is incorrect. Solving the equation for results in approximately 107, not 116.
Unexplained respiratory symptoms reported by athletes are sometimes incorrectly thought to be exercise-induced asthma. A researcher wants to investigate the proportion of Australian athletes with unexplained respiratory symptoms who do have exercise-induced asthma. Using a nationwide repository of medical records, the researcher collects a random sample of 71 athletes referred by their doctor for unexplained respiratory symptoms.
Identify and explain a possible source of bias in the sampling method.
Reveal Answer
Exclusion bias: Athletes with unexplained respiratory symptoms who do not seek medical intervention, or who do not identify themselves as athletes to their doctor, will not be represented in the sample.
| Descriptor | Marks |
|---|---|
correctly explains a valid source of bias | 1 |
Ignore the potential bias in the sampling method in the remaining parts of the question.
Suppose that 25 athletes from the sample were found to have exercise-induced asthma.
Calculate a 95% confidence interval for the true proportion of athletes with unexplained respiratory symptoms who do have exercise-induced asthma.
Reveal Answer
The sample proportion
Hence the 95% confidence interval is given by
| Descriptor | Marks |
|---|---|
calculates the correct sample proportion | 1 |
calculates the correct 95% confidence interval | 1 |
Determine the margin of error of the 95% confidence interval from part (b).
Reveal Answer
The margin of error is given by
| Descriptor | Marks |
|---|---|
correctly determines the margin of error | 1 |
All else remaining unchanged, what would you expect to happen to the margin of error if the sample size was increased.
Reveal Answer
The margin of error would decrease if the sample size is increased.
| Descriptor | Marks |
|---|---|
states that the margin of error would decrease | 1 |
All else remaining unchanged, what would you expect to happen to the margin of error if the confidence level was increased.
Reveal Answer
The margin of error would increase if the confidence level is increased.
| Descriptor | Marks |
|---|---|
states that the margin of error would increase | 1 |
All else remaining unchanged, what would you expect to happen to the margin of error if the sample proportion of athletes with exercise-induced asthma decreased. Justify your answer.
Reveal Answer
Since the sample proportion is less than 0.5 (0.5 is the sample proportion yielding the largest margin of error), a decrease in sample proportion would move the margin of error away from being maximum. Hence the margin of error would decrease.
| Descriptor | Marks |
|---|---|
states that the margin of error would decrease | 1 |
provides a valid justification based on the sample proportion moving away from 0.5 | 1 |
Determine the minimum sample size required to guarantee a margin of error for the 95% confidence interval of at most 0.04.
Reveal Answer
We require
Since must be an integer the minimum value of the sample size is .
| Descriptor | Marks |
|---|---|
correctly substitutes values into the margin of error equation to obtain an equation for | 1 |
correctly solves for , rounding up to an integer value | 1 |
A separate large-scale study from the United States of America claims that 20% of American athletes with unexplained respiratory symptoms do have exercise-induced asthma.
Based on the 95% confidence interval calculated in part (b) on page 20, is the proportion of Australian athletes with unexplained respiratory symptoms who do have exercise-induced asthma different from the American proportion? Justify your answer.
Reveal Answer
Yes. The claimed American proportion is outside the confidence interval for Australian athletes, so it is possible to conclude that the proportions are different at the 95% level of confidence.
| Descriptor | Marks |
|---|---|
states that the proportion is different | 1 |
provides a justification based on the American proportion being outside the Australian confidence interval | 1 |
A teacher coaches their school's table tennis team.
The teacher has an adjustable ball machine that they use to help the players practise.
The speed, measured in metres per second, of the balls shot by the ball machine is a normally distributed random variable .
The teacher sets the ball machine with a mean speed of 10 metres per second and a standard deviation of 0.8 metres per second.
The teacher adjusts the height setting for the ball machine. The machine now shoots balls high above the table tennis table.
Unfortunately, with the new height setting, 8% of balls do not land on the table.
Let be the random variable representing the sample proportion of balls that do not land on the table in random samples of 25 balls.
The teacher can also adjust the spin setting on the ball machine.
The spin, measured in revolutions per second, is a continuous random variable with the probability density function
Determine , correct to three decimal places.
Reveal Answer
| Descriptor | Marks |
|---|---|
Correctly determines the probability as | 1 |
Find the value of , in metres per second, which 80% of ball speeds are below. Give your answer in metres per second, correct to one decimal place.
Reveal Answer
| Descriptor | Marks |
|---|---|
Correctly finds the value of as | 1 |
Find the mean and the standard deviation of .
Reveal Answer
,
| Descriptor | Marks |
|---|---|
Correctly calculates the mean: or | 1 |
Correctly calculates the exact standard deviation: | 1 |
Use the binomial distribution to find , correct to three decimal places.
Reveal Answer
, ,
| Descriptor | Marks |
|---|---|
Identifies the correct binomial distribution and probability statement, e.g., and | 1 |
Correctly evaluates the probability as | 1 |
Find the maximum possible spin applied by the ball machine, in revolutions per second.
Reveal Answer
| Descriptor | Marks |
|---|---|
Correctly identifies the maximum possible spin as | 1 |
Find the median spin, in revolutions per second, correct to one decimal place.
Reveal Answer
,
| Descriptor | Marks |
|---|---|
Sets up the correct equation using the integral from to , e.g., | 1 |
Calculates the correct median spin, | 1 |
Find the standard deviation of the spin, in revolutions per second, correct to one decimal place.
Reveal Answer
| Descriptor | Marks |
|---|---|
Sets up the correct integral expression for the variance or standard deviation | 1 |
Correctly evaluates the variance or intermediate integrals | 1 |
Correctly calculates the standard deviation as | 1 |
The teacher adjusts the spin setting so that the median spin becomes 30 revolutions per second. This will transform the original probability density function to a new probability density function , where .
Find the values of and for which the new median spin is 30 revolutions per second, giving your answer correct to two decimal places.
Reveal Answer
, , and
| Descriptor | Marks |
|---|---|
Sets up the correct equations using integrals to find and , e.g., and | 1 |
Calculates the correct values for and , and | 1 |
Ravi randomly sampled 200 different pet owners in Brisbane and found that 50 celebrate their pet's birthday.
Determine an approximate 95% confidence interval for the proportion of Brisbane pet owners who celebrate their pet's birthday.
Reveal Answer
Using GDC to determine confidence interval associated with
| Descriptor | Marks |
|---|---|
correctly identifies all of the information required to establish the confidence interval | 1 |
correctly determines the confidence interval | 1 |
Two of Ravi's friends also randomly sampled Brisbane pet owners. The results are shown in the table.
| Friend's name | Number sampled | Number who celebrate their pet's birthday |
|---|---|---|
| Khadija | 100 | 26 |
| Tim | 150 | 34 |
Khadija suggested a more precise estimate for the proportion of Brisbane pet owners who celebrate their pet's birthday could be obtained by combining their results.
Using all available data, determine an approximate 95% confidence interval for the proportion of Brisbane pet owners who celebrate their pet's birthday.
Reveal Answer
Combining results
Using GDC
| Descriptor | Marks |
|---|---|
correctly determines and for the combined sample | 1 |
determines confidence interval | 1 |
Use the results from Questions 14a) and 14b) to evaluate the reasonableness of Khadija's suggestion.
Reveal Answer
By combining the results, the sample size is increased and the confidence interval width is reduced.
The new sample statistic provides a better estimate for the population parameter.
| Descriptor | Marks |
|---|---|
identifies changed width of confidence interval | 1 |
evaluates the reasonableness of Khadija’s suggestion | 1 |
The proportion of all Brisbane pet owners who celebrate their pet's birthday is 0.24.
Using the normal approximation, determine the probability that in a randomly selected sample of size 200, more than 30% of pet owners celebrate their pet's birthday.
Reveal Answer
Using approximation to the normal distribution
Mean
Standard deviation
Using GDC
| Descriptor | Marks |
|---|---|
correctly determines the mean and standard deviation of the normal distribution | 1 |
determines the probability | 1 |
A cognitive ability test is developed for Australian students aged 15 years. Reported scores are normally distributed with a mean of 50 and a standard deviation of 10. Let the random variable denote the score of a randomly selected 15-year-old Australian student.
Calculate the percentage of Australian 15-year-old students you expect to obtain a score of at least 64 on the test.
Reveal Answer
We are given that . Hence
and so, we expect 8.08% of Australian 15-year-old students to obtain a score of at least 64.
| Descriptor | Marks |
|---|---|
correctly calculates the probability of a student obtaining a score of at least 64 | 1 |
converts probability to a percentage | 1 |
Calculate the minimum score a student needs to achieve to be in the top 1% of Australian 15-year-old students.
Reveal Answer
Solving for yields . Hence the minimum score required for a student to be in the top 1% is 73.27 (or 74 if reporting only integer scores).
| Descriptor | Marks |
|---|---|
writes out correct probability statement | 1 |
correctly solves for the correct minimum score | 1 |
Students who obtain scores in the range of 43 to 57 are classified as 'average'.
Calculate the probability that a randomly selected student is classified as 'average'.
Reveal Answer
| Descriptor | Marks |
|---|---|
correctly calculates probability | 1 |
A sample of 50 students is to be randomly selected.
Use the approximate normality of the distribution of sample proportions to approximate the probability that the sample proportion of students classified as 'average' is no more than 0.46.
Reveal Answer
The sample proportion of students classified as 'average' has a mean of
and standard deviation
Hence, we use the approximate distribution . The probability that is no more than 0.46 given by
| Descriptor | Marks |
|---|---|
states the correct mean and standard deviation of the distribution of sample proportions | 1 |
calculates correct probability | 1 |
It has been decided to transform the test scores using the equation
such that and .
Determine the mean and standard deviation of , rounding your answers to integer values.
Reveal Answer
Let . Given that
and
It follows that
and
where and are the mean and standard deviation of respectively. Solving these two equations yields
| Descriptor | Marks |
|---|---|
determines correct scores for and | 1 |
writes correct equations for the mean and standard deviation | 1 |
correctly solves for the mean and standard deviation, expressing answers as integers | 1 |
Hence, determine the values of and used to transform the original scores.
Reveal Answer
Given that it follows from the means and standard deviations of and that
Hence and .
| Descriptor | Marks |
|---|---|
states correct equations to solve for and | 1 |
correctly solves for and | 1 |
Mika is flipping a coin. The unbiased coin has a probability of of landing on heads and of landing on tails.
Let be the binomial random variable representing the number of times that the coin lands on heads.
Mika flips the coin five times.
The height reached by each of Mika's coin flips is given by a continuous random variable, , with the probability density function
where is the vertical height reached by the coin flip, in metres, between the coin and the floor, and , and are real constants.
Mika's sister Bella also has a coin. On each flip, Bella's coin has a probability of of landing on heads and of landing on tails, where is a constant value between 0 and 1.
Bella flips her coin 25 times in order to estimate .
Let be the random variable representing the proportion of times that Bella's coin lands on heads in her sample.
Find .
Reveal Answer
| Descriptor | Marks |
|---|---|
Calculates the correct exact probability: or | 1 |
Find .
Reveal Answer
| Descriptor | Marks |
|---|---|
Calculates the correct exact probability: or | 1 |
Find , correct to three decimal places.
Reveal Answer
Working
| Descriptor | Marks |
|---|---|
Sets up the correct conditional probability expression or substitution, e.g., or | 1 |
Answer
| Descriptor | Marks |
|---|---|
Calculates the correct answer to three decimal places: | 1 |
Find the expected value and the standard deviation for .
Reveal Answer
,
Expected Value
| Descriptor | Marks |
|---|---|
Calculates the correct expected value: or | 1 |
Standard Deviation
| Descriptor | Marks |
|---|---|
Calculates the correct exact standard deviation: | 1 |
State the value of the definite integral .
Reveal Answer
| Descriptor | Marks |
|---|---|
States the correct value of the definite integral: | 1 |
Given that and , find the values of , and .
Reveal Answer
, ,
Working
| Descriptor | Marks |
|---|---|
Sets up the correct system of definite integrals: , , and | 1 |
Demonstrates a correct method to evaluate the integrals and solve the system of equations for the constants | 1 |
Answer
| Descriptor | Marks |
|---|---|
Calculates all three correct exact values: (or ), (or ), and | 1 |
The ceiling of Mika's room is 3 m above the floor. The minimum distance between the coin and the ceiling is a continuous random variable, , with probability density function .
The function is a transformation of the function given by ,
where is the minimum distance between the coin and the ceiling, and and are real constants.
Find the values of and .
Reveal Answer
| Descriptor | Marks |
|---|---|
Finds both correct values: and | 1 |
Is the random variable discrete or continuous? Justify your answer.
Reveal Answer
Discrete, countable
| Descriptor | Marks |
|---|---|
Identifies the random variable as discrete and provides a valid justification (e.g., the number of heads is countable) | 1 |
If , find an approximate 95% confidence interval for , correct to three decimal places.
Reveal Answer
| Descriptor | Marks |
|---|---|
Calculates the correct 95% confidence interval, correct to three decimal places: | 1 |
Bella knows that she can decrease the width of a 95% confidence interval by using a larger sample of coin flips.
If , how many coin flips would be required to halve the width of the confidence interval found in part c.ii.?
Reveal Answer
| Descriptor | Marks |
|---|---|
Calculates the correct number of coin flips required: | 1 |
It is estimated that 20% of small businesses fail in the first year. A business advisory group takes a random sample of 500 new businesses that started in January 2018. An analyst employed by the group suggests the use of the binomial distribution is appropriate in this case.
The business advisory group believes that the proportion of new businesses that fail within a year can be reduced by providing financial advice. They took another random sample of 500 businesses that started in January 2019 and provided them with regular financial advice. In this random sample, at the end of the year 80 businesses had failed.
What is the probability that at most 120 of the businesses fail in the first year?
Reveal Answer
Let the random variable denote the number of new businesses that fail out of the 500. Then .
| Descriptor | Marks |
|---|---|
states the parameters of the Binomial distribution of an appropriate random variable | 1 |
calculates the probability | 1 |
What is the approximate distribution of the sample proportion of small businesses that fail by the end of the year in this sample? Justify your answer.
Reveal Answer
Sample proportion ,
That is, , as the sample size is large
| Descriptor | Marks |
|---|---|
states the distribution is normal as the sample size is large | 1 |
gives the value of the mean | 1 |
gives the value of the variance | 1 |
What is the probability that the sample proportion of businesses that fail by the end of the year is less than 0.18?
Reveal Answer
| Descriptor | Marks |
|---|---|
uses the correct value of mean and standard deviation | 1 |
obtains the correct probability | 1 |
By January 2019, 90 of the 500 new businesses had failed. Calculate a 95% confidence interval for the proportion of new businesses that fail in the first year.
Reveal Answer
Sample proportion
95% confidence interval
| Descriptor | Marks |
|---|---|
calculates the lower bound of the interval correctly | 1 |
calculates the upper bound of the interval correctly | 1 |
Calculate the sample proportion and its margin of error at the 95% confidence level.
Reveal Answer
Sample proportion
| Descriptor | Marks |
|---|---|
calculates the sample proportion correctly | 1 |
calculates correctly | 1 |
Calculate a 95% confidence interval for the proportion of businesses that failed. What do you conclude regarding the value of the financial advice provided to the new businesses?
Reveal Answer
Sample proportion
95% confidence interval
Comparing this confidence interval with the previous one , we can see that they overlap.
Therefore, it does not appear that the financial advice has reduced the proportion of businesses that fail in the first year.
| Descriptor | Marks |
|---|---|
calculates the lower bound of the interval correctly | 1 |
calculates the upper bound of the interval correctly | 1 |
conclusion refers to the confidence intervals overlapping | 1 |
states the correct conclusion | 1 |
If the sample size was reduced, what would be the effect on the confidence interval? Justify your answer.
Reveal Answer
The width of the confidence interval would be increased, as the margin of error of the sample proportion will increase, thus increasing the error.
| Descriptor | Marks |
|---|---|
claims the width of the interval would increase | 1 |
refers to the increase in margin of error or increase in error | 1 |
State two assumptions that the analyst made in recommending the use of the binomial model in this case and discuss whether they are valid.
Reveal Answer
- We assume that the businesses fail independently of each other. This is unlikely to be valid because:
(a) two similar businesses may both fail or both survive
(b) if two similar businesses in an area then one may dominate and the other fails. - We assume that the probability of a business failing is the same for all businesses. This is unlikely to be valid, as businesses of different types are expected to have different probabilities of failing.
- The probability of failure is fixed (for the year). This is unlikely to be true, and the probability will depend on changing conditions over time.
| Descriptor | Marks |
|---|---|
states one assumption | 1 |
states with reasons that assumption is unlikely to be valid | 1 |
states second assumption | 1 |
states with reasons that assumption is unlikely to be valid | 1 |
A suburban council hires a consultant to estimate the proportion of residents of the suburb who use its library.
The consultant decides to estimate a 95% confidence interval for the proportion to within an error of 0.01. What minimum sample size should be selected?
Reveal Answer
| Descriptor | Marks |
|---|---|
uses the correct z-value | 1 |
uses 0.5 in the expression for standard error | 1 |
determines the sample size (as an integer) | 1 |
If resource limitations dictate that the maximum sample size that can be managed is 500, what is the maximum margin of error in estimating a 99% confidence interval?
Reveal Answer
that is, within 5.8%
| Descriptor | Marks |
|---|---|
uses the correct z-value | 1 |
uses 0.5 in the expression for standard error | 1 |
calculates the error | 1 |
The consultant decides to select the sample by standing on the roadside outside the library at lunchtime and asking a random sample of the passers-by whether they use the library.
Identify and explain two possible sources of bias with this sampling scheme.
Reveal Answer
- The sample is at a fixed time, so only people around at that time will be sampled.
- The location is fixed, so:
(i) only people at that location will be sampled or
(ii) not everyone from the suburb will pass by that area, so this is not a random sample of the residents.
| Descriptor | Marks |
|---|---|
identifies one possible source of bias | 1 |
explains why it is a possible source of bias | 1 |
identifies another possible source of bias | 1 |
explains why it is a possible source of bias | 1 |
A book called 'Why I Love Mathematics' is having its first print run. This is scheduled to last for one week, using four printing presses. The publisher claims that, historically, 10 books have printing errors for every two hundred that are printed.
A sample of 200 books is to be chosen to determine how many contained errors. The proposed sampling procedure is to select the first 200 books printed over a 6-hour window using the newest printing press.
Identify and explain one possible source of bias in the proposed sampling procedure.
Reveal Answer
Answers could include:
- the sample only includes books from the newest printing press, which may perform differently to the other three presses
- the sample was gathered over a single narrow time period, so books printed later in the week are not included (this may involve different operators and the performance of the presses might change during their use over the week).
| Descriptor | Marks |
|---|---|
identifies a source of bias | 1 |
provides a correct explanation for that source | 1 |
Identify two changes to the sampling procedure that would reduce bias.
Reveal Answer
The sample could be randomly selected:
- from books printed across the entire duration of the print run
- from all printing presses.
| Descriptor | Marks |
|---|---|
suggests a change to improve randomisation across the entire print run | 1 |
suggests a second change to improve randomisation across the entire print run | 1 |
Assume that the sample was gathered using an improved procedure, and that the publisher's claim is correct.
Use the approximate normality of the distribution of sample proportions to determine the probability that the sample proportion of books with errors is less than 0.04.
Reveal Answer
The distribution of can be approximated as
Hence,
| Descriptor | Marks |
|---|---|
calculates the correct distribution parameters (mean and standard deviation/variance) | 1 |
calculates the correct probability | 1 |
In a different random sample of 600 books, it is found that the proportion of books containing an error is 0.1, with a margin of error of 0.024.
Determine a 95% confidence interval for the proportion of books that will have printing errors.
Reveal Answer
A 95% confidence interval is given by
| Descriptor | Marks |
|---|---|
correctly calculates confidence interval | 1 |
On the basis of the confidence interval determined in part (c), is the proportion of books with printing errors different from what was claimed by the publisher?
Reveal Answer
The proportion of books with errors claimed by the publisher was . This proportion is not within the confidence interval and so there is sufficient evidence to conclude that the claim of the publisher is incorrect at the 95% confidence level.
| Descriptor | Marks |
|---|---|
states that the claimed proportion is not within the confidence interval | 1 |
states that there is sufficient evidence at the above confidence level to conclude that the claimed proportion is incorrect | 1 |
Suggest two changes that could be made in order to decrease the margin of error of the confidence interval.
Reveal Answer
The margin of error could be decreased by
- increasing the size of the sample
- decreasing the confidence level.
| Descriptor | Marks |
|---|---|
states one possible change | 1 |
states a second possible change | 1 |
Determine the minimum sample size that would be necessary to guarantee that the margin of error of the resulting 95% confidence interval was at most 0.02.
Reveal Answer
For the worst-case scenario set . Solving for a margin of error equal to 0.02 gives
A sample size of at least 2401 would ensure the margin of error is at most 0.02.
| Descriptor | Marks |
|---|---|
sets and provides a correct expression for the sample size | 1 |
correctly solves for the minimum sample size. | 1 |