SCSA Mathematics Specialist Statistical inference

15 sample questions with marking guides and sample answers · Avg. score: 47.3%

Q19
2021
QCAA
Paper 2
7 marks
Q19
7 marks

Consider the following information.

 meanvariance
Continuous random variable XXE(X)=μ=xp(x)dxE(X) = \mu = \int_{-\infty}^{\infty} x p(x)dxVar(X)=(xμ)2p(x)dxVar(X) = \int_{-\infty}^{\infty} (x-\mu)^2 p(x)dx

The waiting time (minutes) until workers at a certain call centre receive their nnth phone call, where nZ+n \in Z^+, is a random variable TT with probability density function

f(t)={kntn1(n1)!et3,t00,otherwisef(t) = \begin{cases} \frac{k^n t^{n-1}}{(n-1)!} e^{-\frac{t}{3}}, & t \ge 0 \\ 0 & , \text{otherwise} \end{cases}

where kk is a positive constant.

The waiting time until workers receive their 5th call is collected from a random sample of 80 workers.
Determine the probability that the mean waiting time from this sample is more than 16 minutes.

Reveal Answer

Using the property of a PDF
p(x)dx=1\int_{-\infty}^{\infty} p(x) dx = 1
Using n=5n=5 in the given PDF
0k5t44!et3dt=1\int_0^\infty \frac{k^5 t^4}{4!} e^{-\frac{t}{3}} dt = 1

Solving the equation: k=13k = \frac{1}{3}
Mean of distribution for waiting time until 5th call, μ\mu
E(X)=xp(x)dxE(X) = \int_{-\infty}^{\infty} x p(x) dx
μ=0t(13)5t44!et3dt=0(13)5t54!et3dt\mu = \int_0^\infty t \frac{\left(\frac{1}{3}\right)^5 t^4}{4!} e^{-\frac{t}{3}} dt = \int_0^\infty \frac{\left(\frac{1}{3}\right)^5 t^5}{4!} e^{-\frac{t}{3}} dt
=15 minutes= 15 \text{ minutes}

Variance of distribution for 5th call
Var(X)=(xμ)2p(x)dxVar(X) = \int_{-\infty}^{\infty} (x-\mu)^2 p(x) dx
=0(t15)2(13)5t44!et3dt=45 minutes2= \int_0^\infty (t-15)^2 \frac{\left(\frac{1}{3}\right)^5 t^4}{4!} e^{-\frac{t}{3}} dt = 45 \text{ minutes}^2
σ=45 minutes\therefore \sigma = \sqrt{45} \text{ minutes}

Consider the distribution of the sample mean of the waiting time until the 5th phone call is received, Tˉ\bar{T}.
As the sample size is large, the distribution of Tˉ\bar{T} can be considered normal.
μTˉ=15\mu_{\bar{T}} = 15 and σTˉ=σn=4580=0.75\sigma_{\bar{T}} = \frac{\sigma}{\sqrt{n}} = \frac{\sqrt{45}}{\sqrt{80}} = 0.75

Using normal cdf on GDC: P(Tˉ>16)0.09P(\bar{T} > 16) \approx 0.09

Marking Criteria
DescriptorMarks

Correctly determines equation in terms of k

1

Solves equation to determine k

1

Determines population mean

1

Determine population variance

1

Justifies that the distribution of T can be considered normal

1

Determines mean and standard deviation of the sample mean

1

Determines required probability

1
Q10
2022
QCAA
Paper 2
1 mark
Q10
1 mark

In a town, the mean number of residents per household is 3.79 people with a standard deviation of 1.47 people.
Using a random sample of 45 households from the town, determine the probability that the mean number of residents per household will be more than 4.

A

0.17

B

0.33

C

0.83

D

0.96

Reveal Answer
A

0.17

Correct Answer

First, calculate the z-score: z=43.791.47/450.96z = \frac{4 - 3.79}{1.47/\sqrt{45}} \approx 0.96. The probability P(Z>0.96)P(Z > 0.96) is 10.83150.171 - 0.8315 \approx 0.17.

B

0.33

This value does not result from the standard normal distribution calculation using the Central Limit Theorem parameters provided.

C

0.83

This represents the probability that the mean is less than 4 (P(Z<0.96)0.83P(Z < 0.96) \approx 0.83). You must subtract this from 1 to find the probability of being more than 4.

D

0.96

This value is the calculated z-score (z0.96z \approx 0.96), not the probability associated with that z-score.

Q12
2022
SCSA
Paper 2
9 marks
Q12

The inner diameter of a cylinder in a motor car engine is critical to its performance. Let μ\mu mm denote the population mean cylinder diameter produced by a manufacturing process. A random sample, R1R_1, of 100 cylinder diameters is taken and the standard deviation for this sample was found to be 1 mm.

Let X=\overline{X} = the sample mean cylinder diameter for sample R1R_1.

Q12c

From random sample R1R_1, a 95% confidence interval for μ\mu is formed.

Q12a
3 marks

State the distribution for X\overline{X} and its parameters.

Reveal Answer

Since n=100>30n = 100 > 30, then XN(μ,σX2)\overline{X} \sim N\left(\mu, \sigma_{\overline{X}}^2\right)

i.e. normally distributed and centred with a mean μ\mu and estimated standard deviation of the sample mean
σ(X)=1100=0.1 mm.\sigma\left(\overline{X}\right) = \frac{1}{\sqrt{100}} = 0.1 \text{ mm}.

Marking Criteria
DescriptorMarks

states that the sample mean will be normally distributed

1

states the mean of the distribution is μ\mu

1

states the expected standard deviation is 0.1 mm0.1 \text{ mm}

1
Q12b
2 marks

What is the probability that X\overline{X} differs from μ\mu by more than 0.2 mm. Give your answer correct to 0.001.

Reveal Answer

P(Xμ>0.2)=P(z>2)=2(0.023)=0.046P\left(\left|\overline{X} - \mu\right| > 0.2\right) = P(|z| > 2) = 2(0.023) = 0.046

Marking Criteria
DescriptorMarks

forms the correct probability statement in terms of zz

1

calculates the probability correct to 0.0010.001

1
Q12c
2 marks

Calculate the width of this confidence interval, correct to 0.001.

Reveal Answer

Width w=2(k)(σ(X))=2(1.96)(0.1)=0.392w = 2(k)\left(\sigma\left(\overline{X}\right)\right) = 2(1.96)(0.1) = 0.392

Marking Criteria
DescriptorMarks

forms the correct expression for the width

1

calculates the width correctly

1
Q12d
2 marks

Lilian, the production manager, wishes to decrease the width of the confidence interval. She suggests:

"We can form sample R2R_2 by using the data from sample R1R_1 and then combining this data with itself to form a sample with 200 observations. Using n=200n = 200 will decrease the width of the confidence interval."

State two major problems with using this idea.

Reveal Answer

Any 2 of the following:

  1. The idea simply replicates (repeats) the data in sample R1R_1. As such the sample R2R_2 is no longer random. Therefore the assumptions for using the normal distribution for the sample mean does not hold anymore.
  2. Repeating the data values in sample R1R_1 will not reflect the true random variation in the data (manufacturing process). The confidence interval will therefore not be a true representation of the variation in the sample mean.
  3. The sample mean and standard deviation will change if a new larger sample is taken. However, with Lillian's idea these will not change. This will affect both the width and location of the confidence interval.
  4. Replicating the sample does NOT decrease the width of the confidence interval. Let the sample observations for R1R_1 be x1,x2,,x100x_1, x_2, \dots, x_{100}.
X2=i=11002xi200=X1similarly s12=s22so X2N(μ,0.12).\begin{align*} \overline{X_2} = \frac{\sum_{i=1}^{100} 2x_i}{200} = \overline{X_1} \quad \text{similarly } s_1^2 = s_2^2 \quad \text{so } \overline{X_2} \sim N\left(\mu, 0.1^2\right). \end{align*}
Marking Criteria
DescriptorMarks

states that the R2R_2 data will no longer be a random sample

1

states that the assumptions for using the normal distribution for the sample mean does not hold anymore (i.e. states any two of the four points outlined in the solution)

1
Q11
2024
QCAA
Paper 2
4 marks
Q11

A company claims that the mean battery life of their latest model of smartphone is 9.5 hours.
To test this claim, the battery lives of a random sample of 40 of the smartphones were measured.
A sample mean of 9.31 hours and a standard deviation of 0.52 hours were calculated from this data.

Q11a
1 mark

Determine an approximate 95% confidence interval for μ\mu. Give your answer to at least two decimal places.

Reveal Answer

Given n=40,xˉ=9.31n=40, \bar{x}=9.31 and s=0.52s=0.52

Using GDC

CI(95%)=(9.15,9.47)CI(95\%) = (9.15, 9.47) hours

Marking Criteria
DescriptorMarks

correctly calculates 95% confidence interval to at least two decimal places

1
Q11b
1 mark

Determine an approximate 99% confidence interval for μ\mu. Give your answer to at least two decimal places.

Reveal Answer

Using GDC

CI(99%)=(9.10,9.52)CI(99\%) = (9.10, 9.52) hours

Marking Criteria
DescriptorMarks

correctly calculates 99% confidence interval to at least two decimal places

1
Q11c
2 marks

A manager comments that either confidence interval could be used to support the company’s claim.
Use your results from Questions 11a) and 11b) to evaluate the reasonableness of the manager’s comment. Justify your decision using mathematical reasoning.

Reveal Answer

The 95% confidence interval does not include the claimed mean battery life of 9.5 hours, although the 99% CI does.

So the comment is not reasonable.

Marking Criteria
DescriptorMarks

justifies decision using mathematical reasoning

1

provides appropriate statement of reasonableness

1
Q2
2024
QCAA
Paper 2
1 mark
Q2
1 mark

Rounded to two decimal places, the z-value used in the calculation of an approximate 95% confidence interval for μ\mu is

A

0.95

B

1.64

C

1.96

D

2.58

Reveal Answer
A

0.95

This value represents the confidence level itself (0.95), not the critical z-score derived from the standard normal distribution.

B

1.64

This z-value (approximately 1.645) is typically used for a 90% confidence interval, corresponding to a tail area of 0.05.

C

1.96

Correct Answer

For a 95% confidence interval, the significance level is α=0.05\alpha = 0.05. The critical value zα/2z_{\alpha/2} leaves 0.0250.025 in the upper tail, which corresponds to 1.961.96.

D

2.58

This z-value is typically used for a 99% confidence interval, corresponding to a tail area of 0.005.

Q1
2021
QCAA
Paper 2
1 mark
Q1
1 mark

The time taken to complete orders at a pizza store is normally distributed with a mean time (μ\mu) of 10 minutes.
The owner of the pizza store records the time taken to complete orders for a random sample of 20 pizzas each day over a 30-day period. From this data, an approximate 90% confidence interval for μ\mu is calculated at the end of each day.
How many of these confidence intervals would be expected to contain μ\mu?

A

3

B

18

C

27

D

30

Reveal Answer
A

3

This represents 10%10\% of the 30 days (0.10×30=30.10 \times 30 = 3). This is the expected number of intervals that would \textit{fail} to contain the mean, not the number that would contain it.

B

18

This represents only 60%60\% of the 30 days (0.60×30=180.60 \times 30 = 18). Given a 90%90\% confidence level, the expected number of successful intervals should be higher.

C

27

Correct Answer

By definition, a 90%90\% confidence interval is expected to contain the true population parameter 90%90\% of the time in repeated sampling. Therefore, the expected number is 0.90×30=270.90 \times 30 = 27.

D

30

This assumes that every single interval will contain the mean (100%100\%). While possible, the expected value is determined by the specific confidence level of 90%90\%, not 100%100\%.

Q18
2020
SCSA
Paper 2
11 marks
Q18

The mass of chocolate that is placed into each biscuit produced by the BikkiesAreUs company has been observed to be normally distributed with mean μ=7.5\mu = 7.5 grams and standard deviation σ=1.5\sigma = 1.5 grams.

Q18a
4 marks

Determine the probability, correct to 0.01, that the total amount of chocolate used for 50 biscuits is less than 365 grams.

Reveal Answer

Let M=sample mean for the mass of chocolate per biscuit for 50 biscuits (g)=N(7.5,σM2) where σM=1.550=0.21213...\begin{align*} \text{Let } \overline{M} &= \text{sample mean for the mass of chocolate per biscuit for 50 biscuits (g)}\\ &= N\left(7.5, \sigma_{\overline{M}}^2\right) \text{ where } \sigma_{\overline{M}} = \frac{1.5}{\sqrt{50}} = 0.21213... \end{align*}

For a total of 365 g, the sample mean M=36550=7.3 grams per biscuit\text{For a total of 365 g, the sample mean } \overline{M} = \frac{365}{50} = 7.3 \text{ grams per biscuit}

Require P(M<7.3)=0.1729...=0.17\text{Require } P\left(\overline{M} < 7.3\right) = 0.1729... = 0.17

Marking Criteria
DescriptorMarks

states that the sample mean is a normal random variable

1

states the correct parameters for the normal random variable

1

calculates the sample mean correctly for the total 365 grams

1

determines the correct probability (to 0.01)

1
Q18b
3 marks

If the probability that the mean amount of chocolate used per biscuit differs from μ\mu by less than 0.2 grams is 98%, determine nn, the number of biscuits that need to be sampled.

Reveal Answer

σM=1.5n\sigma_{\overline{M}} = \frac{1.5}{\sqrt{n}}

We require P(k<z<k)=0.98 this gives k=2.326\text{We require } P(-k < z < k) = 0.98 \text{ this gives } k = 2.326

Hence 2.326(1.5n)<0.2    Solving gives n>304.32\text{Hence } 2.326\left(\frac{1.5}{\sqrt{n}}\right) < 0.2 \implies \text{Solving gives } n > 304.32

i.e. we require at least 305 biscuits to have the sample mean differ by less than 0.2 grams

Marking Criteria
DescriptorMarks

uses the standard zz score that represents 98% confidence

1

forms the correct inequality/equation to solve for nn

1

states the correct minimum integer value for nn

1
Q18c
4 marks

A competitor company called YouBeautChokkies produces similar biscuits. A sample of 144 biscuits was taken and it was found that the standard deviation of the mass of chocolate used in each biscuit was 1.8 grams and the total amount of chocolate used in the sample of 144 biscuits was 1.09 kg.

Charlie Chokka, a representative from the YouBeautChokkies company, stated that "we are using significantly more chocolate for each biscuit than BikkiesAreUs. If you want that real chocolate taste, then buy from us!"

Perform the necessary calculations to comment on Charlie's claim.

Reveal Answer

Let μY\mu_Y be the population mean for the mass of chocolate per biscuit for the YBC company (grams).

For the YBC total of 1090 grams, this gives M=7.56944...\overline{M} = 7.56944... grams.

The distribution for MN(7.56944,σM2) where σM=1.8144=0.15\text{The distribution for } \overline{M} \sim N\left(7.56944, \sigma_{\overline{M}}^2\right) \text{ where } \sigma_{\overline{M}} = \frac{1.8}{\sqrt{144}} = 0.15

Confidence Interval for μY 95% level :\text{Confidence Interval for } \mu_Y \text{ 95\% level :}

7.569441.96(σM)<μY<7.56944+1.96(σM)i.e. 7.2754<μY<7.8634\begin{align*} 7.56944 - 1.96\left(\sigma_{\overline{M}}\right) &< \mu_Y < 7.56944 + 1.96\left(\sigma_{\overline{M}}\right)\\ \text{i.e. } 7.2754 &< \mu_Y < 7.8634 \end{align*}

Confidence Interval for μY 99% level :\text{Confidence Interval for } \mu_Y \text{ 99\% level :}

7.569442.576(σM)<μY<7.56944+2.576(σM)7.1830<μY<7.9558\begin{align*} 7.56944 - 2.576\left(\sigma_{\overline{M}}\right) &< \mu_Y < 7.56944 + 2.576\left(\sigma_{\overline{M}}\right)\\ 7.1830 &< \mu_Y < 7.9558 \end{align*}

The BAU population mean μ=7.5\text{The BAU population mean } \mu = 7.5 is WITHIN the confidence interval using M=7.56944 and σ=1.8\overline{M} = 7.56944 \text{ and } \sigma = 1.8. i.e. the claim is NOT vindicated.

i.e. the YBC company are NOT using significantly more chocolate per biscuit than compared to BAU.

Marking Criteria
DescriptorMarks

determines the expected variation using n=144n = 144

1

determines an appropriate confidence interval for the YouBeautChokkies population mean

1

states that the BikkiesAreUs population mean 7.5 is within the confidence interval

1

concludes correctly by writing a comment about the claim

1
Q15
2023
QCAA
Paper 2
7 marks
Q15

The travel time for students attending a certain university is assumed to be normally distributed, with a population mean of 25.2 minutes and standard deviation of 4.7 minutes.

Travel times are collected from a random sample of 120 of these students and used to calculate a sample mean, Xˉ1\bar{X}_1, in minutes.

Q15a
2 marks

Determine P(Xˉ125)P(\bar{X}_1 \leq 25).

Reveal Answer

Given μxˉ=25.2\mu_{\bar{x}} = 25.2
σxˉ1=σn=4.7120\sigma_{\bar{x}_1} = \frac{\sigma}{\sqrt{n}} = \frac{4.7}{\sqrt{120}}
=0.429 minutes= 0.429 \text{ minutes}

Using GDC
P(Xˉ125)=0.32P(\bar{X}_1 \le 25) = 0.32

Marking Criteria
DescriptorMarks

correctly calculates σxˉ\sigma_{\bar{x}} for the first sample

1

calculates required probability

1
Q15b
1 mark

Given P(Xˉ1>k)=0.9P(\bar{X}_1 > k) = 0.9, determine the value of kk.

Reveal Answer

P(Xˉ1>k)=0.9P(\bar{X}_1 > k) = 0.9
Using GDC
k=24.65k = 24.65 minutes

Marking Criteria
DescriptorMarks

calculates kk

1
Q15c
4 marks

Travel times are collected from a second random sample of the university's students and used to calculate a second sample mean, Xˉ2\bar{X}_2, in minutes.

Given P(Xˉ225)0.4P(\bar{X}_2 \leq 25) \approx 0.4, determine the number of students in the second sample.

Reveal Answer

P(zz1)0.4z1=0.253P(z \le z_1) \approx 0.4 \Rightarrow z_1 = -0.253
z=Xˉ2μσnz = \frac{\bar{X}_2 - \mu}{\frac{\sigma}{\sqrt{n}}}
0.253=2525.24.7n-0.253 = \frac{25 - 25.2}{\frac{4.7}{\sqrt{n}}}

Using GDC
n35.3n \approx 35.3

The sample size is 35.

Marking Criteria
DescriptorMarks

correctly calculates the z-value based on given probability

1

determines an equation in terms of the sample size (n)

1

determines an approximate value of n

1

evaluates the reasonableness of the solution by rounding n to an integer value

1
Q9
2022
QCAA
Paper 1
1 mark
Q9
1 mark

A random variable XX is normally distributed with a mean of 36 and a standard deviation of 4.
The respective mean and standard deviation of the distribution of Xˉ\bar{X} from repeated random samples of size 9 are

A

4 and 49\frac{4}{9}

B

4 and 43\frac{4}{3}

C

36 and 49\frac{4}{9}

D

36 and 43\frac{4}{3}

Reveal Answer
A

4 and 49\frac{4}{9}

This is incorrect because the mean of the sampling distribution should equal the population mean (36), not the population standard deviation (4). Additionally, the standard deviation is calculated incorrectly as σn\frac{\sigma}{n}.

B

4 and 43\frac{4}{3}

This is incorrect because the mean of the sampling distribution is equal to the population mean (36), not 4. However, the standard deviation value of 43\frac{4}{3} is calculated correctly.

C

36 and 49\frac{4}{9}

This is incorrect because the standard deviation of the sample mean (standard error) is calculated as σn\frac{\sigma}{n} instead of the correct formula σn\frac{\sigma}{\sqrt{n}}.

D

36 and 43\frac{4}{3}

Correct Answer

The mean of the sampling distribution μXˉ\mu_{\bar{X}} equals the population mean (36), and the standard deviation σXˉ\sigma_{\bar{X}} is calculated as σn=49=43\frac{\sigma}{\sqrt{n}} = \frac{4}{\sqrt{9}} = \frac{4}{3}.

Q17
2021
SCSA
Paper 2
12 marks
Q17

A researcher is interested in estimating the population mean μ\mu (dollars) that Perth residents had spent via online shopping in December 2020. A random sample of size nn gave a sample mean of $400, a sample standard deviation ss and a 95% confidence interval of width $200.

Q17e

Four different confidence intervals (A, B, C and D) are obtained for the mean amount spent via online shopping by Perth residents in December 2020.

Confidence intervalSample sizeSample standard deviationConfidence level
Annss95%
Bnnss99%
C2n2nss95%
Dnn0.8s0.8s95%
Q17f

For each of the following, state the confidence interval that has the smaller width. Justify your answers.

Q17a
1 mark

State the 95% confidence interval obtained.

Reveal Answer

95% CI: 4002002μ400+200295\% \text{ CI: } 400 - \frac{200}{2} \le \mu \le 400 + \frac{200}{2}

i.e. 300μ500\text{i.e. } 300 \le \mu \le 500

Marking Criteria
DescriptorMarks

states the upper and lower limits of the interval correctly

1
Q17b
2 marks

Calculate the standard deviation of the sample mean, correct to $0.01.

Reveal Answer

Margin of error =100= 100

i.e. 100=1.96×σ(X)    σ(X)=51.02100 = 1.96 \times \sigma\left(\overline{X}\right) \implies \therefore \sigma\left(\overline{X}\right) = 51.02

Marking Criteria
DescriptorMarks

forms the equation relating the margin of error and the standard deviation correctly

1

determines the standard deviation correctly to 0.01

1
Q17c
2 marks

In terms of nn, what sample size would yield a 95% confidence interval of width $50? Show your reasoning.

Reveal Answer

The interval width is reduced by a factor of 4, so the sample size needs to increase by a factor of 42=164^2 = 16

i.e. a sample size of 16n16n is required.

Marking Criteria
DescriptorMarks

uses an interval width equal to one-quarter the original

1

states the new sample size in terms of nn

1
Q17d
3 marks

What is the probability that another sample of size 2n2n would produce a sample mean that differs from μ\mu by more than $50?

Reveal Answer

XN(μ,s22n)    i.e. XN(μ,51.0222)    σ(X)=36.0768...\overline{X} \sim N\left(\mu, \frac{s^2}{2n}\right) \implies \text{i.e. } \overline{X} \sim N\left(\mu, \frac{51.02^2}{2}\right) \implies \therefore \sigma\left(\overline{X}\right) = 36.0768...

Require P(Xμ>50)=P(z>5036.0768)=P(z>1.3859)=2(0.083)=0.166\begin{align*} \text{Require }P\left(\left|\overline{X} - \mu\right| > 50\right) &= P\left(|z| > \frac{50}{36.0768}\right)\\ &= P\left(|z| > 1.3859\right)\\ &= 2(0.083)\\ &= 0.166 \end{align*}
Marking Criteria
DescriptorMarks

determines the standard deviation for the sample size 2n2n correctly

1

forms the correct probability statement

1

calculates the correct probability

1
Q17e
2 marks

Which of the confidence intervals (A, B, C or D) contains μ\mu, the population mean expenditure for online shopping in December 2020? Justify your answer.

Reveal Answer

Since the true value of μ\mu is unknown, we CANNOT determine which interval contains the true mean. This is due to the inherent nature of random sampling.

Marking Criteria
DescriptorMarks

states we cannot determine which interval contains μ\mu

1

states that either μ\mu is unknown OR refers to the nature of random sampling

1
Q17f (i)
1 mark

... A and B.

Reveal Answer

Confidence interval A will have the smaller width since the level of confidence 95% is less than that of B 99%.

Marking Criteria
DescriptorMarks

justifies why A will have the smaller width

1
Q17f (ii)
1 mark

... (ii) C and D.

Reveal Answer

Need to compare the standard deviation of the sample means:

C:σ(X)=s2n=0.707(sn)C: \quad \sigma\left(\overline{X}\right) = \frac{s}{\sqrt{2n}} = 0.707\left(\frac{s}{\sqrt{n}}\right)
D:σ(X)=0.8sn=0.8(sn)D: \quad \sigma\left(\overline{X}\right) = \frac{0.8s}{\sqrt{n}} = 0.8\left(\frac{s}{\sqrt{n}}\right)

Hence confidence interval C will have the smaller width.

Marking Criteria
DescriptorMarks

justifies why C will have the smaller width by correctly comparing the respective standard deviations of the sample mean

1
Q18
2021
VCAA
Paper 2
1 mark
Q18
1 mark

A scientist investigates the distribution of the masses of fish in a particular river. A 95% confidence interval for the mean mass of a fish, in grams, calculated from a random sample of 100 fish is (70.2, 75.8).

The sample mean divided by the population standard deviation is closest to

A

1.3

B

2.6

C

5.1

D

10.2

E

13.0

Reveal Answer
A

1.3

This is incorrect. The sample mean is 73 and the population standard deviation is approximately 14.29, which does not yield a ratio of 1.3.

B

2.6

This is incorrect. This value is half of the correct ratio, which might result from incorrectly using the full interval width (5.6) instead of the margin of error (2.8) to calculate the standard deviation.

C

5.1

Correct Answer

This is correct. The sample mean is the midpoint of the interval, xˉ=73\bar{x} = 73. The margin of error is 2.8, so 2.8=1.96×σ1002.8 = 1.96 \times \frac{\sigma}{\sqrt{100}}, giving σ14.29\sigma \approx 14.29. The ratio is 7314.295.1\frac{73}{14.29} \approx 5.1.

D

10.2

This is incorrect. This is double the correct ratio, likely resulting from forgetting to divide the interval width by 2 when calculating the margin of error, which would incorrectly halve the calculated standard deviation.

E

13.0

This is incorrect. This value does not represent the ratio of the sample mean (73) to the population standard deviation (14.29\approx 14.29).

Q9
2020
QCAA
Paper 1
1 mark
Q9
1 mark

The scores on a test are assumed to be normally distributed.
Researchers use the results from a random sample of scores to calculate a confidence interval for the population mean. However, a shorter confidence interval width is required so the researchers decide to use a second sample for their calculations.
Assuming that the standard deviations for both samples are the same, the researchers can ensure that a shorter confidence interval width is produced by

A

decreasing the sample size and decreasing the confidence level.

B

decreasing the sample size and increasing the confidence level.

C

increasing the sample size and decreasing the confidence level.

D

increasing the sample size and increasing the confidence level.

Reveal Answer
A

decreasing the sample size and decreasing the confidence level.

Decreasing the sample size increases the standard error (σn\frac{\sigma}{\sqrt{n}}), which widens the interval and counteracts the narrowing effect of a lower confidence level.

B

decreasing the sample size and increasing the confidence level.

Both decreasing the sample size and increasing the confidence level contribute to a wider confidence interval, not a shorter one.

C

increasing the sample size and decreasing the confidence level.

Correct Answer

A confidence interval width is determined by 2×z×σn2 \times z^* \times \frac{\sigma}{\sqrt{n}}. Increasing the sample size (nn) reduces the standard error, and decreasing the confidence level reduces the critical value (zz^*), both of which shorten the interval.

D

increasing the sample size and increasing the confidence level.

Increasing the confidence level requires a larger critical value, which widens the interval and opposes the narrowing effect of the increased sample size.

Q1
2024
QCAA
Paper 1
1 mark
Q1
1 mark

Repeated random samples will be used to calculate a large number of 90% confidence intervals for a population mean μ\mu.

Which statement best describes the possible outcomes?

A

Approximately 90% of the intervals will contain μ\mu.

B

More than 90% of the intervals will contain μ\mu.

C

Less than 90% of the intervals will contain μ\mu.

D

Exactly 90% of the intervals will contain μ\mu.

Reveal Answer
A

Approximately 90% of the intervals will contain μ\mu.

Correct Answer

This is the definition of a confidence level; in repeated sampling, the proportion of intervals capturing the true population mean μ\mu will approach the stated confidence level (90%).

B

More than 90% of the intervals will contain μ\mu.

A 90% confidence level indicates that the method is designed to capture the parameter 90% of the time, not more than that.

C

Less than 90% of the intervals will contain μ\mu.

A 90% confidence level indicates that the method is designed to capture the parameter 90% of the time, not less than that.

D

Exactly 90% of the intervals will contain μ\mu.

Due to sampling variability, the actual proportion of intervals capturing μ\mu in a finite number of samples will likely be close to 90%, but rarely exactly 90%.

Q18
2020
QCAA
Paper 2
6 marks
Q18
6 marks

The mass of a certain species of kangaroo is known to be normally distributed with a mean mass of μ\mu kg and standard deviation of σ\sigma kg.
When one of the kangaroos is randomly selected, the probability that its mass is greater than 83.2 kg is 0.145.
When a sample of 12 kangaroos is randomly selected, the probability that the sample mean mass is less than 74.1 kg is 0.079.
A 90% approximate confidence interval for μ\mu is calculated using a random sample of nn of the kangaroos that has a sample mean mass of 79.1 kg and a sample standard deviation equal to σ\sigma.
Determine the possible range of values that nn could have been, given that the confidence interval did not contain μ\mu.

Reveal Answer

Sample 1: n=1n = 1
P(X>83.2)=0.145P(X > 83.2) = 0.145
P(z>83.2μσ)=0.145P\left(z > \frac{83.2-\mu}{\sigma}\right) = 0.145
83.2μσ=1.058\frac{83.2 - \mu}{\sigma} = 1.058
μ=83.21.058σ...(1)\mu = 83.2 - 1.058\sigma \quad ... (1)

Sample 2: n=12n = 12
P(Xˉ<74.1)=0.079P(\bar{X} < 74.1) = 0.079
P(z<74.1μσ12)=0.079P\left(z < \frac{74.1-\mu}{\frac{\sigma}{\sqrt{12}}}\right) = 0.079
74.1μσ12=1.412\frac{74.1 - \mu}{\frac{\sigma}{\sqrt{12}}} = -1.412
μ=74.1+1.412σ12...(2)\mu = 74.1 + \frac{1.412\sigma}{\sqrt{12}} \quad ... (2)

Using graph facility of GDC to solve (1) and (2)
μ=76.63 kg,σ=6.21 kg\mu = 76.63 \text{ kg}, \sigma = 6.21 \text{ kg}

Sample 3: Consider the 90% CI
Since xˉ=79.1,μ=76.63\bar{x} = 79.1, \mu = 76.63 can only lie in an interval below the lower bound of CI.
Determining nn where the lower bound of CI =μ= \mu
xˉzsn=76.63\bar{x} - z\frac{s}{\sqrt{n}} = 76.63
79.11.64×6.21n=76.6379.1 - 1.64 \times \frac{6.21}{\sqrt{n}} = 76.63
Using solve facility of GDC, n17.1n \approx 17.1
As μ\mu must lie in an interval below the lower bound of CI, the range of values is n18n \ge 18 where nZn \in Z.

Marking Criteria
DescriptorMarks

correctly uses the sample of 1 to determine an equation in terms of μ and σ

1

correctly uses the sample of 12 to determine an equation in terms of μ and σ

1

solves simultaneous equations to determine the values of μ and σ

1

determines solution of n

1

evaluates the reasonableness of the solution to the equation to determine suitable integer values of n

1

shows logical organisation communicating key steps

1
Q15
2023
SCSA
Paper 2
9 marks
Q15

The WeLuvYas Bank extends personal loans to approved customers. A random sample of nn personal loans is taken. A 99% confidence interval for the population mean loan μ\mu (in thousands of dollars) based on this sample is 10.2<μ<25.410.2 < \mu < 25.4.

Q15a
2 marks

What is the mean personal loan x\overline{x} for this sample?

Reveal Answer

x=10.2+25.42=17.8\overline{x} = \frac{10.2+25.4}{2} = 17.8

Hence the mean personal loan was $17 800.

Marking Criteria
DescriptorMarks

calculates the midpoint of the confidence interval correctly

1

states the personal loan amount in dollars

1
Q15b
2 marks

Calculate the standard deviation of the sample mean.

Reveal Answer

Half-width w=17.810.2=7.6w = 17.8 - 10.2 = 7.6
7.6=2.5758×σ(X)\therefore 7.6 = 2.5758 \times \sigma(\overline{X})

i.e. σ(X)=2.95\sigma(\overline{X}) = 2.95 (2 d.p.)$
i.e. standard deviation is $2950

Marking Criteria
DescriptorMarks

forms the expression for half-width of interval in terms of the standard deviation

1

calculates the standard deviation correctly

1
Q15c
2 marks

Ali exclaims excitedly 'everyone here at WeLuvYas is 99% certain that the true population mean is within the interval 10.2<μ<25.410.2 < \mu < 25.4'.

State two reasons why Ali is not correct.

Reveal Answer

Ali is not correct. It can be said that:

  1. A single confidence interval either contains μ\mu or it doesn't.
  2. The value of μ\mu is unknown so we do not know if any given CI contains μ\mu.
  3. If we repeatedly take samples of size nn then we will find that approximately 99% of these intervals will contain the true value of μ\mu.
  4. The value of nn may be less than 30, meaning that the distribution of the sample mean may not be distributed, hence the 99% confidence interval may not be valid.
Marking Criteria
DescriptorMarks

states one reason

1

states a second reason

1
Q15d
3 marks

A data analyst discovers that the sample size was actually 2n2n. In addition to this, the sample mean was actually $2000 more than that originally determined.

Re-calculate the 99% confidence interval for the population mean on the basis of the updated information.

Reveal Answer

The confidence interval changes is two ways.

  1. The whole interval is translated upwards by 2.
  2. The standard error σ(X)\sigma(\overline{X}) and consequently the width of the interval is scaled by a factor of 12\frac{1}{\sqrt{2}}:
w=2.5758×s2n=(2.5758×sn)×12=(7.6)×12=5.37\begin{align*} w &= 2.5758 \times \frac{s}{\sqrt{2n}} \\ &= \left(2.5758 \times \frac{s}{\sqrt{n}}\right) \times \frac{1}{\sqrt{2}}\\ &= (7.6) \times \frac{1}{\sqrt{2}} \\ &= 5.37 \end{align*} New confidence interval:17.8+25.37<μ<17.8+2+5.37i.e.14.43<μ<25.17\begin{align*} \text{New confidence interval:}\quad 17.8+2 - 5.37 < &\mu < 17.8+2 + 5.37\\ \text{i.e.}\quad 14.43 < &\mu < 25.17 \end{align*}
Marking Criteria
DescriptorMarks

indicates the midpoint of the confidence interval is increased by 2

1

calculates the new standard deviation correctly

1

calculates the new confidence interval correctly

1

Frequently Asked Questions

How many SCSA Mathematics Specialist questions cover Statistical inference?
AusGrader has 117 SCSA Mathematics Specialist questions on Statistical inference, all with instant AI grading and detailed marking feedback.

Ready to practise SCSA Mathematics Specialist?

Get instant AI feedback on past exam questions, aligned to the syllabus

Start Practising Free