How many SCSA Mathematics Specialist questions cover Statistical inference?

AusGrader has 117 SCSA Mathematics Specialist questions on Statistical inference, all with instant AI grading and detailed marking feedback.

WACE Mathematics Specialist — Statistical inference Questions & Answers

Q19

2021

QCAA

Paper 2

7 marks

Q19

7 marks

Consider the following information.

	mean	variance
Continuous random variable $X$	$E(X) = \mu = \int_{-\infty}^{\infty} x p(x)dx$	$Var(X) = \int_{-\infty}^{\infty} (x-\mu)^2 p(x)dx$

The waiting time (minutes) until workers at a certain call centre receive their $n$ th phone call, where $n \in Z^+$ , is a random variable $T$ with probability density function

$f(t) = \begin{cases} \frac{k^n t^{n-1}}{(n-1)!} e^{-\frac{t}{3}}, & t \ge 0 \\ 0 & , \text{otherwise} \end{cases}$

where $k$ is a positive constant.

The waiting time until workers receive their 5th call is collected from a random sample of 80 workers.
Determine the probability that the mean waiting time from this sample is more than 16 minutes.

Reveal Answer

Using the property of a PDF
$\int_{-\infty}^{\infty} p(x) dx = 1$
Using $n=5$ in the given PDF
$\int_0^\infty \frac{k^5 t^4}{4!} e^{-\frac{t}{3}} dt = 1$

Solving the equation: $k = \frac{1}{3}$
Mean of distribution for waiting time until 5th call, $\mu$
$E(X) = \int_{-\infty}^{\infty} x p(x) dx$
$\mu = \int_0^\infty t \frac{\left(\frac{1}{3}\right)^5 t^4}{4!} e^{-\frac{t}{3}} dt = \int_0^\infty \frac{\left(\frac{1}{3}\right)^5 t^5}{4!} e^{-\frac{t}{3}} dt$
$= 15 \text{ minutes}$

Variance of distribution for 5th call
$Var(X) = \int_{-\infty}^{\infty} (x-\mu)^2 p(x) dx$
$= \int_0^\infty (t-15)^2 \frac{\left(\frac{1}{3}\right)^5 t^4}{4!} e^{-\frac{t}{3}} dt = 45 \text{ minutes}^2$
$\therefore \sigma = \sqrt{45} \text{ minutes}$

Consider the distribution of the sample mean of the waiting time until the 5th phone call is received, $\bar{T}$ .
As the sample size is large, the distribution of $\bar{T}$ can be considered normal.
$\mu_{\bar{T}} = 15$ and $\sigma_{\bar{T}} = \frac{\sigma}{\sqrt{n}} = \frac{\sqrt{45}}{\sqrt{80}} = 0.75$

Using normal cdf on GDC: $P(\bar{T} > 16) \approx 0.09$

Marking Criteria

Descriptor	Marks
Correctly determines equation in terms of k	1
Solves equation to determine k	1
Determines population mean	1
Determine population variance	1
Justifies that the distribution of T can be considered normal	1
Determines mean and standard deviation of the sample mean	1
Determines required probability	1

Q10

2022

QCAA

Paper 2

1 mark

Q10

1 mark

In a town, the mean number of residents per household is 3.79 people with a standard deviation of 1.47 people.
Using a random sample of 45 households from the town, determine the probability that the mean number of residents per household will be more than 4.

A

0.17

B

0.33

C

0.83

D

0.96

Reveal Answer

A

0.17

Correct Answer

First, calculate the z-score: $z = \frac{4 - 3.79}{1.47/\sqrt{45}} \approx 0.96$ . The probability $P(Z > 0.96)$ is $1 - 0.8315 \approx 0.17$ .

B

0.33

This value does not result from the standard normal distribution calculation using the Central Limit Theorem parameters provided.

C

0.83

This represents the probability that the mean is less than 4 ( $P(Z < 0.96) \approx 0.83$ ). You must subtract this from 1 to find the probability of being more than 4.

D

0.96

This value is the calculated z-score ( $z \approx 0.96$ ), not the probability associated with that z-score.

Q12

2022

SCSA

Paper 2

9 marks

Q12

The inner diameter of a cylinder in a motor car engine is critical to its performance. Let $\mu$ mm denote the population mean cylinder diameter produced by a manufacturing process. A random sample, $R_1$ , of 100 cylinder diameters is taken and the standard deviation for this sample was found to be 1 mm.

Let $\overline{X} =$ the sample mean cylinder diameter for sample $R_1$ .

Q12c

From random sample $R_1$ , a 95% confidence interval for $\mu$ is formed.

Q12a

3 marks

State the distribution for $\overline{X}$ and its parameters.

Reveal Answer

Since $n = 100 > 30$ , then $\overline{X} \sim N\left(\mu, \sigma_{\overline{X}}^2\right)$

i.e. normally distributed and centred with a mean $\mu$ and estimated standard deviation of the sample mean
$\sigma\left(\overline{X}\right) = \frac{1}{\sqrt{100}} = 0.1 \text{ mm}.$

Marking Criteria

Descriptor	Marks
states that the sample mean will be normally distributed	1
states the mean of the distribution is $\mu$	1
states the expected standard deviation is $0.1 \text{ mm}$	1

Q12b

2 marks

What is the probability that $\overline{X}$ differs from $\mu$ by more than 0.2 mm. Give your answer correct to 0.001.

Reveal Answer

$P\left(\left|\overline{X} - \mu\right| > 0.2\right) = P(|z| > 2) = 2(0.023) = 0.046$

Marking Criteria

Descriptor	Marks
forms the correct probability statement in terms of $z$	1
calculates the probability correct to $0.001$	1

Q12c

2 marks

Calculate the width of this confidence interval, correct to 0.001.

Reveal Answer

Width $w = 2(k)\left(\sigma\left(\overline{X}\right)\right) = 2(1.96)(0.1) = 0.392$

Marking Criteria

Descriptor	Marks
forms the correct expression for the width	1
calculates the width correctly	1

Q12d

2 marks

Lilian, the production manager, wishes to decrease the width of the confidence interval. She suggests:

"We can form sample $R_2$ by using the data from sample $R_1$ and then combining this data with itself to form a sample with 200 observations. Using $n = 200$ will decrease the width of the confidence interval."

State two major problems with using this idea.

Reveal Answer

Any 2 of the following:

The idea simply replicates (repeats) the data in sample $R_1$ . As such the sample $R_2$ is no longer random. Therefore the assumptions for using the normal distribution for the sample mean does not hold anymore.
Repeating the data values in sample $R_1$ will not reflect the true random variation in the data (manufacturing process). The confidence interval will therefore not be a true representation of the variation in the sample mean.
The sample mean and standard deviation will change if a new larger sample is taken. However, with Lillian's idea these will not change. This will affect both the width and location of the confidence interval.
Replicating the sample does NOT decrease the width of the confidence interval. Let the sample observations for $R_1$ be $x_1, x_2, \dots, x_{100}$ .

\begin{align*} \overline{X_2} = \frac{\sum_{i=1}^{100} 2x_i}{200} = \overline{X_1} \quad \text{similarly } s_1^2 = s_2^2 \quad \text{so } \overline{X_2} \sim N\left(\mu, 0.1^2\right). \end{align*}

Marking Criteria

Descriptor	Marks
states that the $R_2$ data will no longer be a random sample	1
states that the assumptions for using the normal distribution for the sample mean does not hold anymore (i.e. states any two of the four points outlined in the solution)	1

Q11

2024

QCAA

Paper 2

4 marks

Q11

A company claims that the mean battery life of their latest model of smartphone is 9.5 hours.
To test this claim, the battery lives of a random sample of 40 of the smartphones were measured.
A sample mean of 9.31 hours and a standard deviation of 0.52 hours were calculated from this data.

Q11a

1 mark

Determine an approximate 95% confidence interval for $\mu$ . Give your answer to at least two decimal places.

Reveal Answer

Given $n=40, \bar{x}=9.31$ and $s=0.52$

Using GDC

$CI(95\%) = (9.15, 9.47)$ hours

Marking Criteria

Descriptor	Marks
correctly calculates 95% confidence interval to at least two decimal places	1

Q11b

1 mark

Determine an approximate 99% confidence interval for $\mu$ . Give your answer to at least two decimal places.

Reveal Answer

Using GDC

$CI(99\%) = (9.10, 9.52)$ hours

Marking Criteria

Descriptor	Marks
correctly calculates 99% confidence interval to at least two decimal places	1

Q11c

2 marks

A manager comments that either confidence interval could be used to support the company’s claim.
Use your results from Questions 11a) and 11b) to evaluate the reasonableness of the manager’s comment. Justify your decision using mathematical reasoning.

Reveal Answer

The 95% confidence interval does not include the claimed mean battery life of 9.5 hours, although the 99% CI does.

So the comment is not reasonable.

Marking Criteria

Descriptor	Marks
justifies decision using mathematical reasoning	1
provides appropriate statement of reasonableness	1

Q2

2024

QCAA

Paper 2

1 mark

Q2

1 mark

Rounded to two decimal places, the z-value used in the calculation of an approximate 95% confidence interval for $\mu$ is

A

0.95

B

1.64

C

1.96

D

2.58

Reveal Answer

A

0.95

This value represents the confidence level itself (0.95), not the critical z-score derived from the standard normal distribution.

B

1.64

This z-value (approximately 1.645) is typically used for a 90% confidence interval, corresponding to a tail area of 0.05.

C

1.96

Correct Answer

For a 95% confidence interval, the significance level is $\alpha = 0.05$ . The critical value $z_{\alpha/2}$ leaves $0.025$ in the upper tail, which corresponds to $1.96$ .

D

2.58

This z-value is typically used for a 99% confidence interval, corresponding to a tail area of 0.005.

Q1

2021

QCAA

Paper 2

1 mark

Q1

1 mark

The time taken to complete orders at a pizza store is normally distributed with a mean time ( $\mu$ ) of 10 minutes.
The owner of the pizza store records the time taken to complete orders for a random sample of 20 pizzas each day over a 30-day period. From this data, an approximate 90% confidence interval for $\mu$ is calculated at the end of each day.
How many of these confidence intervals would be expected to contain $\mu$ ?

A

3

B

18

C

27

D

30

Reveal Answer

A

3

This represents $10\%$ of the 30 days ( $0.10 \times 30 = 3$ ). This is the expected number of intervals that would \textit{fail} to contain the mean, not the number that would contain it.

B

18

This represents only $60\%$ of the 30 days ( $0.60 \times 30 = 18$ ). Given a $90\%$ confidence level, the expected number of successful intervals should be higher.

C

27

Correct Answer

By definition, a $90\%$ confidence interval is expected to contain the true population parameter $90\%$ of the time in repeated sampling. Therefore, the expected number is $0.90 \times 30 = 27$ .

D

30

This assumes that every single interval will contain the mean ( $100\%$ ). While possible, the expected value is determined by the specific confidence level of $90\%$ , not $100\%$ .

Q18

2020

SCSA

Paper 2

11 marks

Q18

The mass of chocolate that is placed into each biscuit produced by the BikkiesAreUs company has been observed to be normally distributed with mean $\mu = 7.5$ grams and standard deviation $\sigma = 1.5$ grams.

Q18a

4 marks

Determine the probability, correct to 0.01, that the total amount of chocolate used for 50 biscuits is less than 365 grams.

Reveal Answer

\begin{align*} \text{Let } \overline{M} &= \text{sample mean for the mass of chocolate per biscuit for 50 biscuits (g)}\\ &= N\left(7.5, \sigma_{\overline{M}}^2\right) \text{ where } \sigma_{\overline{M}} = \frac{1.5}{\sqrt{50}} = 0.21213... \end{align*}

$\text{For a total of 365 g, the sample mean } \overline{M} = \frac{365}{50} = 7.3 \text{ grams per biscuit}$

$\text{Require } P\left(\overline{M} < 7.3\right) = 0.1729... = 0.17$

Marking Criteria

Descriptor	Marks
states that the sample mean is a normal random variable	1
states the correct parameters for the normal random variable	1
calculates the sample mean correctly for the total 365 grams	1
determines the correct probability (to 0.01)	1

Q18b

3 marks

If the probability that the mean amount of chocolate used per biscuit differs from $\mu$ by less than 0.2 grams is 98%, determine $n$ , the number of biscuits that need to be sampled.

Reveal Answer

$\sigma_{\overline{M}} = \frac{1.5}{\sqrt{n}}$

$\text{We require } P(-k < z < k) = 0.98 \text{ this gives } k = 2.326$

$\text{Hence } 2.326\left(\frac{1.5}{\sqrt{n}}\right) < 0.2 \implies \text{Solving gives } n > 304.32$

i.e. we require at least 305 biscuits to have the sample mean differ by less than 0.2 grams

Marking Criteria

Descriptor	Marks
uses the standard $z$ score that represents 98% confidence	1
forms the correct inequality/equation to solve for $n$	1
states the correct minimum integer value for $n$	1

Q18c

4 marks

A competitor company called YouBeautChokkies produces similar biscuits. A sample of 144 biscuits was taken and it was found that the standard deviation of the mass of chocolate used in each biscuit was 1.8 grams and the total amount of chocolate used in the sample of 144 biscuits was 1.09 kg.

Charlie Chokka, a representative from the YouBeautChokkies company, stated that "we are using significantly more chocolate for each biscuit than BikkiesAreUs. If you want that real chocolate taste, then buy from us!"

Perform the necessary calculations to comment on Charlie's claim.

Reveal Answer

Let $\mu_Y$ be the population mean for the mass of chocolate per biscuit for the YBC company (grams).

For the YBC total of 1090 grams, this gives $\overline{M} = 7.56944...$ grams.

$\text{The distribution for } \overline{M} \sim N\left(7.56944, \sigma_{\overline{M}}^2\right) \text{ where } \sigma_{\overline{M}} = \frac{1.8}{\sqrt{144}} = 0.15$

$\text{Confidence Interval for } \mu_Y \text{ 95\% level :}$

\begin{align*} 7.56944 - 1.96\left(\sigma_{\overline{M}}\right) &< \mu_Y < 7.56944 + 1.96\left(\sigma_{\overline{M}}\right)\\ \text{i.e. } 7.2754 &< \mu_Y < 7.8634 \end{align*}

$\text{Confidence Interval for } \mu_Y \text{ 99\% level :}$

\begin{align*} 7.56944 - 2.576\left(\sigma_{\overline{M}}\right) &< \mu_Y < 7.56944 + 2.576\left(\sigma_{\overline{M}}\right)\\ 7.1830 &< \mu_Y < 7.9558 \end{align*}

$\text{The BAU population mean } \mu = 7.5$ is WITHIN the confidence interval using $\overline{M} = 7.56944 \text{ and } \sigma = 1.8$ . i.e. the claim is NOT vindicated.

i.e. the YBC company are NOT using significantly more chocolate per biscuit than compared to BAU.

Marking Criteria

Descriptor	Marks
determines the expected variation using $n = 144$	1
determines an appropriate confidence interval for the YouBeautChokkies population mean	1
states that the BikkiesAreUs population mean 7.5 is within the confidence interval	1
concludes correctly by writing a comment about the claim	1

Q15

2023

QCAA

Paper 2

7 marks

Q15

The travel time for students attending a certain university is assumed to be normally distributed, with a population mean of 25.2 minutes and standard deviation of 4.7 minutes.

Travel times are collected from a random sample of 120 of these students and used to calculate a sample mean, $\bar{X}_1$ , in minutes.

Q15a

2 marks

Determine $P(\bar{X}_1 \leq 25)$ .

Reveal Answer

Given $\mu_{\bar{x}} = 25.2$
$\sigma_{\bar{x}_1} = \frac{\sigma}{\sqrt{n}} = \frac{4.7}{\sqrt{120}}$
$= 0.429 \text{ minutes}$

Using GDC
$P(\bar{X}_1 \le 25) = 0.32$

Marking Criteria

Descriptor	Marks
correctly calculates $\sigma_{\bar{x}}$ for the first sample	1
calculates required probability	1

Q15b

1 mark

Given $P(\bar{X}_1 > k) = 0.9$ , determine the value of $k$ .

Reveal Answer

$P(\bar{X}_1 > k) = 0.9$
Using GDC
$k = 24.65$ minutes

Marking Criteria

Descriptor	Marks
calculates $k$	1

Q15c

4 marks

Travel times are collected from a second random sample of the university's students and used to calculate a second sample mean, $\bar{X}_2$ , in minutes.

Given $P(\bar{X}_2 \leq 25) \approx 0.4$ , determine the number of students in the second sample.

Reveal Answer

$P(z \le z_1) \approx 0.4 \Rightarrow z_1 = -0.253$
$z = \frac{\bar{X}_2 - \mu}{\frac{\sigma}{\sqrt{n}}}$
$-0.253 = \frac{25 - 25.2}{\frac{4.7}{\sqrt{n}}}$

Using GDC
$n \approx 35.3$

The sample size is 35.

Marking Criteria

Descriptor	Marks
correctly calculates the z-value based on given probability	1
determines an equation in terms of the sample size (n)	1
determines an approximate value of n	1
evaluates the reasonableness of the solution by rounding n to an integer value	1

Q9

2022

QCAA

Paper 1

1 mark

Q9

1 mark

A random variable $X$ is normally distributed with a mean of 36 and a standard deviation of 4.
The respective mean and standard deviation of the distribution of $\bar{X}$ from repeated random samples of size 9 are

A

4 and $\frac{4}{9}$

B

4 and $\frac{4}{3}$

C

36 and $\frac{4}{9}$

D

36 and $\frac{4}{3}$

Reveal Answer

A

4 and $\frac{4}{9}$

This is incorrect because the mean of the sampling distribution should equal the population mean (36), not the population standard deviation (4). Additionally, the standard deviation is calculated incorrectly as $\frac{\sigma}{n}$ .

B

4 and $\frac{4}{3}$

This is incorrect because the mean of the sampling distribution is equal to the population mean (36), not 4. However, the standard deviation value of $\frac{4}{3}$ is calculated correctly.

C

36 and $\frac{4}{9}$

This is incorrect because the standard deviation of the sample mean (standard error) is calculated as $\frac{\sigma}{n}$ instead of the correct formula $\frac{\sigma}{\sqrt{n}}$ .

D

36 and $\frac{4}{3}$

Correct Answer

The mean of the sampling distribution $\mu_{\bar{X}}$ equals the population mean (36), and the standard deviation $\sigma_{\bar{X}}$ is calculated as $\frac{\sigma}{\sqrt{n}} = \frac{4}{\sqrt{9}} = \frac{4}{3}$ .

Q17

2021

SCSA

Paper 2

12 marks

Q17

A researcher is interested in estimating the population mean $\mu$ (dollars) that Perth residents had spent via online shopping in December 2020. A random sample of size $n$ gave a sample mean of ＄400, a sample standard deviation $s$ and a 95% confidence interval of width ＄200.

Q17e

Four different confidence intervals (A, B, C and D) are obtained for the mean amount spent via online shopping by Perth residents in December 2020.

Confidence interval	Sample size	Sample standard deviation	Confidence level
A	$n$	$s$	95%
B	$n$	$s$	99%
C	$2n$	$s$	95%
D	$n$	$0.8s$	95%

Q17f

For each of the following, state the confidence interval that has the smaller width. Justify your answers.

Q17a

1 mark

State the 95% confidence interval obtained.

Reveal Answer

$95\% \text{ CI: } 400 - \frac{200}{2} \le \mu \le 400 + \frac{200}{2}$

$\text{i.e. } 300 \le \mu \le 500$

Marking Criteria

Descriptor	Marks
states the upper and lower limits of the interval correctly	1

Q17b

2 marks

Calculate the standard deviation of the sample mean, correct to ＄0.01.

Reveal Answer

Margin of error $= 100$

i.e. $100 = 1.96 \times \sigma\left(\overline{X}\right) \implies \therefore \sigma\left(\overline{X}\right) = 51.02$

Marking Criteria

Descriptor	Marks
forms the equation relating the margin of error and the standard deviation correctly	1
determines the standard deviation correctly to 0.01	1

Q17c

2 marks

In terms of $n$ , what sample size would yield a 95% confidence interval of width ＄50? Show your reasoning.

Reveal Answer

The interval width is reduced by a factor of 4, so the sample size needs to increase by a factor of $4^2 = 16$

i.e. a sample size of $16n$ is required.

Marking Criteria

Descriptor	Marks
uses an interval width equal to one-quarter the original	1
states the new sample size in terms of $n$	1

Q17d

3 marks

What is the probability that another sample of size $2n$ would produce a sample mean that differs from $\mu$ by more than ＄50?

Reveal Answer

$\overline{X} \sim N\left(\mu, \frac{s^2}{2n}\right) \implies \text{i.e. } \overline{X} \sim N\left(\mu, \frac{51.02^2}{2}\right) \implies \therefore \sigma\left(\overline{X}\right) = 36.0768...$

\begin{align*} \text{Require }P\left(\left|\overline{X} - \mu\right| > 50\right) &= P\left(|z| > \frac{50}{36.0768}\right)\\ &= P\left(|z| > 1.3859\right)\\ &= 2(0.083)\\ &= 0.166 \end{align*}

Marking Criteria

Descriptor	Marks
determines the standard deviation for the sample size $2n$ correctly	1
forms the correct probability statement	1
calculates the correct probability	1

Q17e

2 marks

Which of the confidence intervals (A, B, C or D) contains $\mu$ , the population mean expenditure for online shopping in December 2020? Justify your answer.

Reveal Answer

Since the true value of $\mu$ is unknown, we CANNOT determine which interval contains the true mean. This is due to the inherent nature of random sampling.

Marking Criteria

Descriptor	Marks
states we cannot determine which interval contains $\mu$	1
states that either $\mu$ is unknown OR refers to the nature of random sampling	1

Q17f (i)

1 mark

... A and B.

Reveal Answer

Confidence interval A will have the smaller width since the level of confidence 95% is less than that of B 99%.

Marking Criteria

Descriptor	Marks
justifies why A will have the smaller width	1

Q17f (ii)

1 mark

... (ii) C and D.

Reveal Answer

Need to compare the standard deviation of the sample means:

$C: \quad \sigma\left(\overline{X}\right) = \frac{s}{\sqrt{2n}} = 0.707\left(\frac{s}{\sqrt{n}}\right)$
$D: \quad \sigma\left(\overline{X}\right) = \frac{0.8s}{\sqrt{n}} = 0.8\left(\frac{s}{\sqrt{n}}\right)$

Hence confidence interval C will have the smaller width.

Marking Criteria

Descriptor	Marks
justifies why C will have the smaller width by correctly comparing the respective standard deviations of the sample mean	1

Q18

2021

VCAA

Paper 2

1 mark

Q18

1 mark

A scientist investigates the distribution of the masses of fish in a particular river. A 95% confidence interval for the mean mass of a fish, in grams, calculated from a random sample of 100 fish is (70.2, 75.8).

The sample mean divided by the population standard deviation is closest to

A

1.3

B

2.6

C

5.1

D

10.2

E

13.0

Reveal Answer

A

1.3

This is incorrect. The sample mean is 73 and the population standard deviation is approximately 14.29, which does not yield a ratio of 1.3.

B

2.6

This is incorrect. This value is half of the correct ratio, which might result from incorrectly using the full interval width (5.6) instead of the margin of error (2.8) to calculate the standard deviation.

C

5.1

Correct Answer

This is correct. The sample mean is the midpoint of the interval, $\bar{x} = 73$ . The margin of error is 2.8, so $2.8 = 1.96 \times \frac{\sigma}{\sqrt{100}}$ , giving $\sigma \approx 14.29$ . The ratio is $\frac{73}{14.29} \approx 5.1$ .

D

10.2

This is incorrect. This is double the correct ratio, likely resulting from forgetting to divide the interval width by 2 when calculating the margin of error, which would incorrectly halve the calculated standard deviation.

E

13.0

This is incorrect. This value does not represent the ratio of the sample mean (73) to the population standard deviation ( $\approx 14.29$ ).

Q9

2020

QCAA

Paper 1

1 mark

Q9

1 mark

The scores on a test are assumed to be normally distributed.
Researchers use the results from a random sample of scores to calculate a confidence interval for the population mean. However, a shorter confidence interval width is required so the researchers decide to use a second sample for their calculations.
Assuming that the standard deviations for both samples are the same, the researchers can ensure that a shorter confidence interval width is produced by

A

decreasing the sample size and decreasing the confidence level.

B

decreasing the sample size and increasing the confidence level.

C

increasing the sample size and decreasing the confidence level.

D

increasing the sample size and increasing the confidence level.

Reveal Answer

A

decreasing the sample size and decreasing the confidence level.

Decreasing the sample size increases the standard error ( $\frac{\sigma}{\sqrt{n}}$ ), which widens the interval and counteracts the narrowing effect of a lower confidence level.

B

decreasing the sample size and increasing the confidence level.

Both decreasing the sample size and increasing the confidence level contribute to a wider confidence interval, not a shorter one.

C

increasing the sample size and decreasing the confidence level.

Correct Answer

A confidence interval width is determined by $2 \times z^* \times \frac{\sigma}{\sqrt{n}}$ . Increasing the sample size ( $n$ ) reduces the standard error, and decreasing the confidence level reduces the critical value ( $z^*$ ), both of which shorten the interval.

D

increasing the sample size and increasing the confidence level.

Increasing the confidence level requires a larger critical value, which widens the interval and opposes the narrowing effect of the increased sample size.

Q1

2024

QCAA

Paper 1

1 mark

Q1

1 mark

Repeated random samples will be used to calculate a large number of 90% confidence intervals for a population mean $\mu$ .

Which statement best describes the possible outcomes?

A

Approximately 90% of the intervals will contain $\mu$ .

B

More than 90% of the intervals will contain $\mu$ .

C

Less than 90% of the intervals will contain $\mu$ .

D

Exactly 90% of the intervals will contain $\mu$ .

Reveal Answer

A

Approximately 90% of the intervals will contain $\mu$ .

Correct Answer

This is the definition of a confidence level; in repeated sampling, the proportion of intervals capturing the true population mean $\mu$ will approach the stated confidence level (90%).

B

More than 90% of the intervals will contain $\mu$ .

A 90% confidence level indicates that the method is designed to capture the parameter 90% of the time, not more than that.

C

Less than 90% of the intervals will contain $\mu$ .

A 90% confidence level indicates that the method is designed to capture the parameter 90% of the time, not less than that.

D

Exactly 90% of the intervals will contain $\mu$ .

Due to sampling variability, the actual proportion of intervals capturing $\mu$ in a finite number of samples will likely be close to 90%, but rarely exactly 90%.

Q18

2020

QCAA

Paper 2

6 marks

Q18

6 marks

The mass of a certain species of kangaroo is known to be normally distributed with a mean mass of $\mu$ kg and standard deviation of $\sigma$ kg.
When one of the kangaroos is randomly selected, the probability that its mass is greater than 83.2 kg is 0.145.
When a sample of 12 kangaroos is randomly selected, the probability that the sample mean mass is less than 74.1 kg is 0.079.
A 90% approximate confidence interval for $\mu$ is calculated using a random sample of $n$ of the kangaroos that has a sample mean mass of 79.1 kg and a sample standard deviation equal to $\sigma$ .
Determine the possible range of values that $n$ could have been, given that the confidence interval did not contain $\mu$ .

Reveal Answer

Sample 1: $n = 1$
$P(X > 83.2) = 0.145$
$P\left(z > \frac{83.2-\mu}{\sigma}\right) = 0.145$
$\frac{83.2 - \mu}{\sigma} = 1.058$
$\mu = 83.2 - 1.058\sigma \quad ... (1)$

Sample 2: $n = 12$
$P(\bar{X} < 74.1) = 0.079$
$P\left(z < \frac{74.1-\mu}{\frac{\sigma}{\sqrt{12}}}\right) = 0.079$
$\frac{74.1 - \mu}{\frac{\sigma}{\sqrt{12}}} = -1.412$
$\mu = 74.1 + \frac{1.412\sigma}{\sqrt{12}} \quad ... (2)$

Using graph facility of GDC to solve (1) and (2)
$\mu = 76.63 \text{ kg}, \sigma = 6.21 \text{ kg}$

Sample 3: Consider the 90% CI
Since $\bar{x} = 79.1, \mu = 76.63$ can only lie in an interval below the lower bound of CI.
Determining $n$ where the lower bound of CI $= \mu$
$\bar{x} - z\frac{s}{\sqrt{n}} = 76.63$
$79.1 - 1.64 \times \frac{6.21}{\sqrt{n}} = 76.63$
Using solve facility of GDC, $n \approx 17.1$
As $\mu$ must lie in an interval below the lower bound of CI, the range of values is $n \ge 18$ where $n \in Z$ .

Marking Criteria

Descriptor	Marks
correctly uses the sample of 1 to determine an equation in terms of μ and σ	1
correctly uses the sample of 12 to determine an equation in terms of μ and σ	1
solves simultaneous equations to determine the values of μ and σ	1
determines solution of n	1
evaluates the reasonableness of the solution to the equation to determine suitable integer values of n	1
shows logical organisation communicating key steps	1

Q15

2023

SCSA

Paper 2

9 marks

Q15

The WeLuvYas Bank extends personal loans to approved customers. A random sample of $n$ personal loans is taken. A 99% confidence interval for the population mean loan $\mu$ (in thousands of dollars) based on this sample is $10.2 < \mu < 25.4$ .

Q15a

2 marks

What is the mean personal loan $\overline{x}$ for this sample?

Reveal Answer

$\overline{x} = \frac{10.2+25.4}{2} = 17.8$

Hence the mean personal loan was ＄17 800.

Marking Criteria

Descriptor	Marks
calculates the midpoint of the confidence interval correctly	1
states the personal loan amount in dollars	1

Q15b

2 marks

Calculate the standard deviation of the sample mean.

Reveal Answer

Half-width $w = 17.8 - 10.2 = 7.6$
$\therefore 7.6 = 2.5758 \times \sigma(\overline{X})$

i.e. $\sigma(\overline{X}) = 2.95$ (2 d.p.)$
i.e. standard deviation is ＄2950

Marking Criteria

Descriptor	Marks
forms the expression for half-width of interval in terms of the standard deviation	1
calculates the standard deviation correctly	1

Q15c

2 marks

Ali exclaims excitedly 'everyone here at WeLuvYas is 99% certain that the true population mean is within the interval $10.2 < \mu < 25.4$ '.

State two reasons why Ali is not correct.

Reveal Answer

Ali is not correct. It can be said that:

A single confidence interval either contains $\mu$ or it doesn't.
The value of $\mu$ is unknown so we do not know if any given CI contains $\mu$ .
If we repeatedly take samples of size $n$ then we will find that approximately 99% of these intervals will contain the true value of $\mu$ .
The value of $n$ may be less than 30, meaning that the distribution of the sample mean may not be distributed, hence the 99% confidence interval may not be valid.

Marking Criteria

Descriptor	Marks
states one reason	1
states a second reason	1

Q15d

3 marks

A data analyst discovers that the sample size was actually $2n$ . In addition to this, the sample mean was actually ＄2000 more than that originally determined.

Re-calculate the 99% confidence interval for the population mean on the basis of the updated information.

Reveal Answer

The confidence interval changes is two ways.

The whole interval is translated upwards by 2.
The standard error $\sigma(\overline{X})$ and consequently the width of the interval is scaled by a factor of $\frac{1}{\sqrt{2}}$ :

\begin{align*} w &= 2.5758 \times \frac{s}{\sqrt{2n}} \\ &= \left(2.5758 \times \frac{s}{\sqrt{n}}\right) \times \frac{1}{\sqrt{2}}\\ &= (7.6) \times \frac{1}{\sqrt{2}} \\ &= 5.37 \end{align*}

\begin{align*} \text{New confidence interval:}\quad 17.8+2 - 5.37 < &\mu < 17.8+2 + 5.37\\ \text{i.e.}\quad 14.43 < &\mu < 25.17 \end{align*}

Marking Criteria

Descriptor	Marks
indicates the midpoint of the confidence interval is increased by 2	1
calculates the new standard deviation correctly	1
calculates the new confidence interval correctly	1

SCSA Mathematics Specialist Statistical inference

Frequently Asked Questions

Ready to practise SCSA Mathematics Specialist?

SCSA Mathematics Specialist Statistical inference

Sample Answer

Sample Answer

Sample Answer

Sample Answer

Sample Answer

Sample Answer

Sample Answer

Sample Answer

Sample Answer

Sample Answer

Sample Answer

Sample Answer

Sample Answer

Sample Answer

Sample Answer

Sample Answer

Sample Answer

Sample Answer

Sample Answer

Sample Answer

Sample Answer

Sample Answer

Sample Answer

Sample Answer

Sample Answer

Sample Answer

Frequently Asked Questions

Ready to practise SCSA Mathematics Specialist?