VCAA Mathematical Methods Data analysis, probability and statistics

15 sample questions with marking guides and sample answers · Avg. score: 68.8%

Q3
2022
VCAA
Paper 2
14 marks
Q3

Mika is flipping a coin. The unbiased coin has a probability of 12\frac{1}{2} of landing on heads and 12\frac{1}{2} of landing on tails.

Let XX be the binomial random variable representing the number of times that the coin lands on heads.

Mika flips the coin five times.

Q3b

The height reached by each of Mika's coin flips is given by a continuous random variable, HH, with the probability density function

f(h)={ah2+bh+c1.5h30elsewheref(h) = \begin{cases} ah^2 + bh + c & 1.5 \leq h \leq 3 \\ 0 & \text{elsewhere} \end{cases}

where hh is the vertical height reached by the coin flip, in metres, between the coin and the floor, and aa, bb and cc are real constants.

Q3c

Mika's sister Bella also has a coin. On each flip, Bella's coin has a probability of pp of landing on heads and (1p)(1 - p) of landing on tails, where pp is a constant value between 0 and 1.

Bella flips her coin 25 times in order to estimate pp.

Let P^\hat{P} be the random variable representing the proportion of times that Bella's coin lands on heads in her sample.

Q3a (i)
1 mark

Find Pr(X=5)\Pr(X = 5).

Reveal Answer

132=0.03125\frac{1}{32} = 0.03125

Marking Criteria
DescriptorMarks

Calculates the correct exact probability: 132\frac{1}{32} or 0.031250.03125

1
Q3a (ii)
1 mark

Find Pr(X2)\Pr(X \geq 2).

Reveal Answer

1316=0.8125\frac{13}{16} = 0.8125

Marking Criteria
DescriptorMarks

Calculates the correct exact probability: 1316\frac{13}{16} or 0.81250.8125

1
Q3a (iii)
2 marks

Find Pr(X2X<5)\Pr(X \geq 2 \mid X < 5), correct to three decimal places.

Reveal Answer

Pr(X2X<5)=Pr(2X4)Pr(X4)=0.781250.96875=0.806\text{Pr}(X \ge 2 | X < 5) = \frac{\text{Pr}(2 \le X \le 4)}{\text{Pr}(X \le 4)} = \frac{0.78125}{0.96875} = 0.806

Marking Criteria

Working

DescriptorMarks

Sets up the correct conditional probability expression or substitution, e.g., Pr(2X4)Pr(X4)\frac{\text{Pr}(2 \le X \le 4)}{\text{Pr}(X \le 4)} or 0.781250.96875\frac{0.78125}{0.96875}

1

Answer

DescriptorMarks

Calculates the correct answer to three decimal places: 0.8060.806

1
Q3a (iv)
2 marks

Find the expected value and the standard deviation for XX.

Reveal Answer

E(X)=52=2.5\text{E}(X) = \frac{5}{2} = 2.5, sd(X)=52\text{sd}(X) = \frac{\sqrt{5}}{2}

Marking Criteria

Expected Value

DescriptorMarks

Calculates the correct expected value: E(X)=2.5\text{E}(X) = 2.5 or 52\frac{5}{2}

1

Standard Deviation

DescriptorMarks

Calculates the correct exact standard deviation: sd(X)=52\text{sd}(X) = \frac{\sqrt{5}}{2}

1
Q3b (i)
1 mark

State the value of the definite integral 1.53f(h)dh\int_{1.5}^3 f(h) dh.

Reveal Answer

11

Marking Criteria
DescriptorMarks

States the correct value of the definite integral: 11

1
Q3b (ii)
3 marks

Given that Pr(H2)=0.35\Pr(H \leq 2) = 0.35 and Pr(H2.5)=0.25\Pr(H \geq 2.5) = 0.25, find the values of aa, bb and cc.

Reveal Answer

a=0.8=45a = -0.8 = -\frac{4}{5}, b=3.4=175b = 3.4 = \frac{17}{5}, c=2.783˙=16760c = -2.78\dot{3} = -\frac{167}{60}

Marking Criteria

Working

DescriptorMarks

Sets up the correct system of definite integrals: 1.53f(h)dh=1\int_{1.5}^3 f(h) dh = 1, 1.52f(h)dh=0.35\int_{1.5}^2 f(h) dh = 0.35, and 2.53f(h)dh=0.25\int_{2.5}^3 f(h) dh = 0.25

1

Demonstrates a correct method to evaluate the integrals and solve the system of equations for the constants

1

Answer

DescriptorMarks

Calculates all three correct exact values: a=0.8a = -0.8 (or 45-\frac{4}{5}), b=3.4b = 3.4 (or 175\frac{17}{5}), and c=16760c = -\frac{167}{60}

1
Q3b (iii)
1 mark

The ceiling of Mika's room is 3 m above the floor. The minimum distance between the coin and the ceiling is a continuous random variable, DD, with probability density function gg.

The function gg is a transformation of the function ff given by g(d)=f(rd+s)g(d) = f(rd + s),
where dd is the minimum distance between the coin and the ceiling, and rr and ss are real constants.

Find the values of rr and ss.

Reveal Answer

r=1,s=3r = -1, s = 3

Marking Criteria
DescriptorMarks

Finds both correct values: r=1r = -1 and s=3s = 3

1
Q3c (i)
1 mark

Is the random variable P^\hat{P} discrete or continuous? Justify your answer.

Reveal Answer

Discrete, countable

Marking Criteria
DescriptorMarks

Identifies the random variable as discrete and provides a valid justification (e.g., the number of heads is countable)

1
Q3c (ii)
1 mark

If p^=0.4\hat{p} = 0.4, find an approximate 95% confidence interval for pp, correct to three decimal places.

Reveal Answer

(0.208,0.592)(0.208, 0.592)

Marking Criteria
DescriptorMarks

Calculates the correct 95% confidence interval, correct to three decimal places: (0.208,0.592)(0.208, 0.592)

1
Q3c (iii)
1 mark

Bella knows that she can decrease the width of a 95% confidence interval by using a larger sample of coin flips.

If p^=0.4\hat{p} = 0.4, how many coin flips would be required to halve the width of the confidence interval found in part c.ii.?

Reveal Answer

100100

Marking Criteria
DescriptorMarks

Calculates the correct number of coin flips required: 100100

1
Q17
2025
QCAA
Paper 1
6 marks
Q17
6 marks

In a computer fishing game, a player repeatedly casts their hook into either a blue pond or a red pond. Each fish they catch scores points.

In the blue pond, the probability of catching a fish on a cast is 23\frac{2}{3} and each fish caught scores 10 points.

In the red pond, the probability of catching a fish on a cast is 13\frac{1}{3} and each fish caught scores 15 points.

A player has three casts left and needs to score at least 30 points to win. All remaining casts must be in the same pond.

It is claimed that the probability of winning if casting in the blue pond is 127\frac{1}{27} more than the probability of winning if casting in the red pond. Evaluate the reasonableness of the claim.

Reveal Answer

Let XX be a binomial random variable representing the number of casts that caught a fish.
pp corresponds to a probability of a catch (success).

Blue pond:
XB(3,23)X \sim B\left(3, \frac{2}{3}\right)
Red pond:
XB(3,13)X \sim B\left(3, \frac{1}{3}\right)

In the blue pond, 30 points will require three successes in three casts:

P(X=3)=23×23×23=(23)3=827P(X = 3) = \frac{2}{3} \times \frac{2}{3} \times \frac{2}{3} = \left(\frac{2}{3}\right)^3 = \frac{8}{27}

In the red pond, 30 points will require at least two successes in three casts:
i.e. X2X \ge 2

P(X2)=P(X=2)+P(X=3)=(32)p2q1+(33)p3q0=3(13)2(23)+1(13)3(23)0=627+127=727\begin{align*} P(X \ge 2)&= P(X = 2) + P(X = 3)\\ &= \binom{3}{2} p^2 q^1 + \binom{3}{3} p^3 q^0\\ &= 3\left(\frac{1}{3}\right)^2\left(\frac{2}{3}\right) + 1\left(\frac{1}{3}\right)^3\left(\frac{2}{3}\right)^0\\ &= \frac{6}{27} + \frac{1}{27}\\ &= \frac{7}{27} \end{align*}

In the blue pond, the required probability is 827\frac{8}{27}
In red pond, the required probability is 727\frac{7}{27}

827727=127\frac{8}{27} - \frac{7}{27} = \frac{1}{27}

The claim is correct. Probability of winning if casting in the blue pond is 127\frac{1}{27} more than the probability of winning using the red pond.

Marking Criteria
DescriptorMarks

correctly identifies a method to determine at least 30 points in three casts for the blue pond

1

determines the probability of scoring at least 30 points in the blue pond

1

correctly identifies a method to determine at least 30 points in three casts for the red pond

1

substitutes appropriate values into the chosen method for the red pond

1

determines the probability of scoring at least 30 points in the red pond

1

evaluates the claim

1
Q10
2021
QCAA
Paper 1
1 mark
Q10
1 mark

Handspans of teenagers are approximately normally distributed, with a mean of 15 cm and a standard deviation of 2 cm.

Which of the following groups is expected to be the largest?

A

teenagers with handspans that are between 7 cm and 11 cm

B

teenagers with handspans that are between 11 cm and 15 cm

C

teenagers with handspans that are between 13 cm and 17 cm

D

teenagers with handspans that are between 17 cm and 21 cm

Reveal Answer
A

teenagers with handspans that are between 7 cm and 11 cm

This range falls between 2 and 4 standard deviations below the mean (z=4z = -4 to z=2z = -2). This represents the far left tail of the curve, which contains a negligible percentage of the population.

B

teenagers with handspans that are between 11 cm and 15 cm

This range covers the area from 2 standard deviations below the mean up to the mean (z=2z = -2 to z=0z = 0). While substantial, it captures less area than an interval of the same width centered directly on the peak of the distribution.

C

teenagers with handspans that are between 13 cm and 17 cm

Correct Answer

This range corresponds to exactly one standard deviation below and above the mean (15±215 \pm 2 cm). Because the normal distribution is symmetric and peaks at the mean, the interval centered on the mean contains the largest proportion of data (approximately 68%).

D

teenagers with handspans that are between 17 cm and 21 cm

This range falls between 1 and 3 standard deviations above the mean (z=1z = 1 to z=3z = 3). This represents the tapering right tail of the distribution, which contains significantly fewer teenagers than the central range.

Q20
2020
QCAA
Paper 2
5 marks
Q20
5 marks

Assuming the approximate normality of sample proportions (p^1\hat{p}_1 and p^2\hat{p}_2) and based on two independent samples, the approximate confidence interval for the difference of two proportions is given by

(p^1p^2zp^1(1p^1)n1+p^2(1p^2)n2,p^1p^2+zp^1(1p^1)n1+p^2(1p^2)n2)\left( \hat{p}_1 - \hat{p}_2 - z \sqrt{\frac{\hat{p}_1 (1 - \hat{p}_1)}{n_1} + \frac{\hat{p}_2 (1 - \hat{p}_2)}{n_2}} , \hat{p}_1 - \hat{p}_2 + z \sqrt{\frac{\hat{p}_1 (1 - \hat{p}_1)}{n_1} + \frac{\hat{p}_2 (1 - \hat{p}_2)}{n_2}} \right)

If the approximate confidence interval for the difference between two proportions does not contain 0, this provides evidence that the two proportions are not equal.

The data in the table shows the observed frequencies of two drink preferences for independent samples of people who live in Town A and Town B.

TownTeaCoffeeTotal
A111105216
B150107257

Using the approximate 99% confidence interval for the difference of two proportions, determine if there is evidence to conclude that drink preference is associated with the town where the person lives.

Reveal Answer

p1=p_1 = proportion of Town A who prefer to drink tea
p2=p_2 = proportion of Town B who prefer to drink tea

The sample proportions are:
p^1=111216\hat{p}_1 = \frac{111}{216}
p^2=150257\hat{p}_2 = \frac{150}{257}

Using the 99% confidence interval for the difference of two proportions
(1112161502572.576111216(1111216)216+150257(1150257)257,111216150257+2.576111216(1111216)216+150257(1150257)257)\left(\frac{111}{216} - \frac{150}{257} - 2.576\sqrt{\frac{\frac{111}{216}(1-\frac{111}{216})}{216} + \frac{\frac{150}{257}(1-\frac{150}{257})}{257}}, \frac{111}{216} - \frac{150}{257} + 2.576\sqrt{\frac{\frac{111}{216}(1-\frac{111}{216})}{216} + \frac{\frac{150}{257}(1-\frac{150}{257})}{257}}\right)

=(0.188,0.048)= (-0.188, 0.048)

This interval contains zero; therefore, there is no evidence in the data to say that the two proportions are different, i.e. preference to drink tea does not depend on where the person lives.

Marking Criteria
DescriptorMarks

correctly determines the sample proportions

1

establishes confidence interval for the difference of two proportions

1

determines 99% confidence interval

1

interprets 99% confidence interval to determine equality of proportions

1

shows logical organisation communicating key steps

1
Q19
2024
QCAA
Paper 2
6 marks
Q19
6 marks

The normal distribution probability density function is

p(x)=1σ2πe12(xμσ)2p(x) = \frac{1}{\sigma\sqrt{2\pi}}e^{-\frac{1}{2}\left(\frac{x-\mu}{\sigma}\right)^2}

, with the parameters mean, μ\mu, and standard deviation, σ\sigma.

The speeds of electric scooter (e-scooter) riders on a particular section of a bike path are approximately normally distributed with a mean of 18 km/h. It is known that p(10)=0.0135p(10) = 0.0135.

The speed limit for e-scooters on this section of bike path is 23 km/h.

A speed camera is set up and records the speeds of 75 e-scooter riders. Every rider travelling faster than the speed limit is given a $143 fine. Before setting up the speed camera, the following suggestion was made.

The total of the fines expected to be issued will be more than $1500.

Evaluate the reasonableness of this suggestion.

Reveal Answer

Given that p(10)=0.0135p(10)=0.0135

0.0135=1σ2πe12(1018σ)20.0135=\dfrac{1}{\sigma\sqrt{2\pi}}e^{-\frac{1}{2}\left(\frac{10-18}{\sigma}\right)^2}

0.0135=1σ2πe12(8σ)20.0135=\dfrac{1}{\sigma\sqrt{2\pi}}e^{-\frac{1}{2}\left(\frac{-8}{\sigma}\right)^2}

Using a GDC:

Two solutions are obtained for the std deviation

σ=4.0002\sigma=4.0002 and 28.402028.4020

Reject 28.4020 as it is not a possible standard deviation
because for example three standard deviations less than
the mean would produce a negative speed or three above
would result in an impossible speed on an e-scooter
(>100 km/h>100\ \text{km/h}).

Use a GDC to determine P(X23)P(X\ge 23) with N(18,4.00)N\sim(18,4.00),

P(X23)=0.10566P(X\ge 23)=0.10566

The number of riders is:

0.10566×75=7.92450.10566\times 75=7.9245

The total fines obtained:

7.9245×143=1133.207.9245\times 143=1133.20

The expected total fines is about $1133, which is less
than the suggested $1500, so the suggestion is not
reasonable.

Marking Criteria
DescriptorMarks

Correctly substitutes the known information, x=10x=10 and p(10)=0.0135p(10)=0.0135, into the given normal distribution formula

1

Determines a possible value for the standard deviation

1

Determines the proportion of riders above 23 km/h

1

Determines the number of riders above 23 km/h

1

Determines the expected total fines

1

Provides appropriate statement of reasonableness

1
Q8
2023
VCAA
Paper 1
6 marks
Q8

Suppose that the queuing time, TT (in minutes), at a customer service desk has a probability density function given by

f(t)={kt(16t2)0t40elsewheref(t) = \begin{cases} kt(16 - t^2) & 0 \leq t \leq 4 \\ 0 & \text{elsewhere} \end{cases}

for some kRk \in R.

Q8a
1 mark

Show that k=164k = \frac{1}{64}.

Reveal Answer

04kt(16t2)dt=1k[8t2t44]04=1k(8×1616×164)=164k=1\begin{align*} \int_0^4 kt(16 - t^2) dt &= 1\\ k \left[ 8t^2 - \frac{t^4}{4} \right]_0^4 &= 1\\ k \left( 8 \times 16 - \frac{16 \times 16}{4} \right) &= 1\\ 64k &= 1 \end{align*}

k=164\therefore k = \frac{1}{64}

Marking Criteria
DescriptorMarks

Correctly forms an integral equation equal to 1, antidifferentiates, and solves to show k=164k = \frac{1}{64}

1
Q8b
2 marks

Find E(T)\text{E}(T).

Reveal Answer

E(T)=16404(16t2t4)dtE(T) = \frac{1}{64} \int_0^4 \left(16t^2 - t^4\right) dt

=164[16t33t55]04= \frac{1}{64} \left[ \frac{16t^3}{3} - \frac{t^5}{5} \right]_0^4

=164(10243102450)= \frac{1}{64} \left( \frac{1024}{3} - \frac{1024}{5} - 0 \right)

=164×204815= \frac{1}{64} \times \frac{2048}{15}

=6430=3215=2215= \frac{64}{30} = \frac{32}{15} = 2\frac{2}{15}

Marking Criteria
DescriptorMarks

Sets up the correct integral for the expected value, e.g., E(T)=16404(16t2t4)dt\text{E}(T) = \frac{1}{64} \int_0^4 \left(16t^2 - t^4\right) dt

1

Evaluates the integral correctly to find the final answer of 3215\frac{32}{15} or 22152\frac{2}{15}

1
Q8c
3 marks

What is the probability that a person has to queue for more than two minutes, given that they have already queued for one minute?

Reveal Answer

Pr(2<T<4T>1)=Pr(2<T<4)Pr(T>1)=16424(16tt3)dt16414(16tt3)dt=24(16tt3)dt14(16tt3)dt=[8t2t44]24[8t2t44]14=(64(324))(64(814))=36(2254)=144225=1625=0.64\begin{align*} \text{Pr}(2 < T < 4 | T > 1) &= \frac{\text{Pr}(2 < T < 4)}{\text{Pr}(T > 1)}\\ &= \frac{\frac{1}{64} \int_2^4 (16t - t^3) dt}{\frac{1}{64} \int_1^4 (16t - t^3) dt} = \frac{\int_2^4 (16t - t^3) dt}{\int_1^4 (16t - t^3) dt}\\ &= \frac{\left[ 8t^2 - \frac{t^4}{4} \right]_2^4}{\left[ 8t^2 - \frac{t^4}{4} \right]_1^4}\\ &= \frac{(64 - (32 - 4))}{\left( 64 - \left( 8 - \frac{1}{4} \right) \right)}\\ &= \frac{36}{\left( \frac{225}{4} \right)}\\ &= \frac{144}{225} = \frac{16}{25} = 0.64 \end{align*}
Marking Criteria
DescriptorMarks

Correctly formulates the conditional probability expression, such as Pr(2<T<4)Pr(T>1)\frac{\text{Pr}(2 < T < 4)}{\text{Pr}(T > 1)}

1

Correctly substitutes the integrals into the numerator and denominator as a single fraction

1

Correctly evaluates the expression to find the final probability of 1625\frac{16}{25} or 0.640.64

1
Q3
2025
QCAA
Paper 1
1 mark
Q3
1 mark

A council wants to survey residents about a new dog park. Which sampling method would best minimise bias in the survey?

A

Questioning every third resident entering a supermarket near an existing dog park.

B

Collecting responses from residents who clicked a survey link on the website.

C

Asking residents visiting a dog park on a randomly selected day.

D

Selecting residents using a random number generator.

Reveal Answer
A

Questioning every third resident entering a supermarket near an existing dog park.

Surveying near an existing dog park introduces location bias, as people in that area might have stronger opinions about dog parks than the general population.

B

Collecting responses from residents who clicked a survey link on the website.

This relies on voluntary response sampling, which introduces self-selection bias because only residents with strong opinions are likely to participate.

C

Asking residents visiting a dog park on a randomly selected day.

Surveying at a dog park introduces selection bias by overrepresenting dog owners and excluding residents who do not currently use dog parks.

D

Selecting residents using a random number generator.

Correct Answer

Using a random number generator creates a simple random sample, giving every resident an equal chance of being chosen and effectively minimizing bias.

Q9
2024
SCSA
Paper 2
8 marks
Q9

A mathematics teacher uses a coin flip activity to demonstrate confidence intervals to their class. They flip a fair coin 50 times in front of the class and observe 30 heads and 20 tails.

Q9a
2 marks

Calculate a 90% confidence interval for the proportion of heads obtained when the coin is flipped.

Reveal Answer

The sample proportion of heads is given by

p^=3050=0.6\hat{p} = \frac{30}{50} = 0.6

Hence, the 90% confidence interval is

0.61.6450.6×0.450p0.6+1.6450.6×0.4500.6 - 1.645\sqrt{\frac{0.6 \times 0.4}{50}} \le p \le 0.6 + 1.645\sqrt{\frac{0.6 \times 0.4}{50}}

0.4860p0.71400.4860 \le p \le 0.7140

Marking Criteria
DescriptorMarks

correctly calculates sample proportion of heads

1

correctly calculates confidence interval

1
Q9b
2 marks

As a homework exercise, the teacher asks all 20 students in the class to repeat the coin activity and calculate their own individual 90% confidence interval for the proportion of heads. Let XX be a random variable that denotes the number of students whose confidence interval contains the true proportion of heads.

State the distribution for XX.

Reveal Answer

XBin(20,0.9)X \sim \text{Bin}(20, 0.9)

Marking Criteria
DescriptorMarks

states that the distribution is binomial

1

states correct distribution parameters

1
Q9c
2 marks

Determine the expected value and variance of XX.

Reveal Answer

The expected value of XX is given by

E(X)=20×0.9E(X) = 20 \times 0.9

=18= 18

The variance of XX is given by

Var(X)=20×0.9×0.1Var(X) = 20 \times 0.9 \times 0.1

=1.8= 1.8

Marking Criteria
DescriptorMarks

correctly calculates expected value

1

correctly calculates variance

1
Q9d
2 marks

Calculate the probability that the confidence intervals of three students do not contain the true proportion.

Reveal Answer

If three confidence intervals did not contain the true proportion, then 17 did contain the true proportion.

P(X=17)=0.1901\text{P}(X = 17) = 0.1901

Marking Criteria
DescriptorMarks

identifies that they are considering 17 confidence intervals containing the true proportion or defines the distribution for the complementary event

1

calculates the correct probability

1
Q14
2024
VCAA
Paper 2
1 mark
Q14
1 mark

Let hh be the probability density function for a continuous random variable XX, where

h(x)={x6+k3x<0x2+k0x10elsewhereh(x) = \begin{cases} \frac{x}{6} + k & -3 \le x < 0 \\ -\frac{x}{2} + k & 0 \le x \le 1 \\ 0 & \text{elsewhere} \end{cases}

and kk is a positive real number.

The value of Pr(X<0.5)\Pr(X < 0.5) is

A

12\frac{1}{2}

B

1516\frac{15}{16}

C

316\frac{3}{16}

D

4948\frac{49}{48}

Reveal Answer
A

12\frac{1}{2}

This is the value of the constant kk, not the probability Pr(X<0.5)\Pr(X < 0.5). The constant k=12k = \frac{1}{2} is found by setting the total area under the PDF to 1.

B

1516\frac{15}{16}

Correct Answer

First, find k=12k = \frac{1}{2} by setting the integral of h(x)h(x) from 3-3 to 11 equal to 11. Then, calculate Pr(X<0.5)\Pr(X < 0.5) by evaluating 30.5h(x)dx=30(x6+12)dx+00.5(x2+12)dx=34+316=1516\int_{-3}^{0.5} h(x) dx = \int_{-3}^{0} (\frac{x}{6} + \frac{1}{2}) dx + \int_{0}^{0.5} (-\frac{x}{2} + \frac{1}{2}) dx = \frac{3}{4} + \frac{3}{16} = \frac{15}{16}.

C

316\frac{3}{16}

This is only the probability Pr(0X<0.5)\Pr(0 \le X < 0.5). It incorrectly omits the probability Pr(3X<0)=34\Pr(-3 \le X < 0) = \frac{3}{4} which must be included since the lower bound of the domain is 3-3.

D

4948\frac{49}{48}

This is an incorrect calculation. A probability value cannot exceed 1, so 4948\frac{49}{48} is impossible for any valid probability density function.

Q6
2023
VCAA
Paper 1
4 marks
Q6

Let P^\hat{P} be the random variable that represents the sample proportion of households in a given suburb that have solar panels installed.

From a sample of randomly selected households in a given suburb, an approximate 95% confidence interval for the proportion pp of households having solar panels installed was determined to be (0.04,0.16)(0.04, 0.16).

Q6b

Use z=2z = 2 to approximate the 95% confidence interval.

Q6a
1 mark

Find the value of p^\hat{p} that was used to obtain this approximate 95% confidence interval.

Reveal Answer

(0.04+0.16)÷2=0.1(0.04 + 0.16) \div 2 = 0.1

p^=0.1\hat{p} = 0.1

Marking Criteria
DescriptorMarks

Calculates the correct value of p^\hat{p}

1
Q6b
2 marks

Find the size of the sample from which this 95% confidence interval was obtained.

Reveal Answer

0.06=2×0.1×0.9n0.06 = 2 \times \sqrt{\frac{0.1 \times 0.9}{n}}

0.03=0.1×0.9n0.03 = \sqrt{\frac{0.1 \times 0.9}{n}}

(0.03)2=0.1×0.9n(0.03)^2 = \frac{0.1 \times 0.9}{n}

(0.03)2=9100n(0.03)^2 = \frac{9}{100n}

n=100n = 100

Marking Criteria
DescriptorMarks

Sets up a correct equation involving nn using the margin of error or confidence interval bounds

1

Calculates the correct sample size, n=100n = 100

1
Q6c
1 mark

A larger sample of households is selected, with a sample size four times the original sample.

The sample proportion of households having solar panels installed is found to be the same.

By what factor will the increased sample size affect the width of the confidence interval?

Reveal Answer

Confidence interval width is halved (reduced or decreased by a factor of 2; altered by a factor of 12\frac{1}{2}).

Marking Criteria
DescriptorMarks

States the correct factor by which the width is affected (e.g., 12\frac{1}{2}, halved, or decreased by a factor of 2)

1
Q4
2024
SCSA
Paper 1
6 marks
Q4a
3 marks

The uniformly distributed continuous random variable XX has an expected value of 6 and a maximum value of 9. Determine the variance of XX.

Reveal Answer

The expected value of a uniformly distributed continuous random variable XX is midway between the maximum and minimum values, so the probability density function is

f(x)={16,3x90,otherwise f(x)=\begin{cases} \dfrac{1}{6}, & 3\le x\le 9 \\ 0, & \text{otherwise} \end{cases}

The variance of XX is given by

Var(X)=39(x6)2f(x)dx=1639(x6)2dx=16[(x6)33]39=16(9(9))=3\begin{aligned} \operatorname{Var}(X)&=\int_{3}^{9} (x-6)^2 f(x)\,dx\\ &=\dfrac{1}{6}\int_{3}^{9} (x-6)^2\,dx\\ &=\dfrac{1}{6}\left[\dfrac{(x-6)^3}{3}\right]_{3}^{9}\\ &=\dfrac{1}{6}\left(9-(-9)\right)\\ &=3 \end{aligned}

Marking Criteria
DescriptorMarks

determines the correct value and domain of the probability density function

1

writes a correct integral expression for the variance

1

correctly calculates the variance

1
Q4b
3 marks

The binomially distributed discrete random variable WW has a mean of 12\frac{1}{2} and a variance of 512\frac{5}{12}. Evaluate P(W=1)\text{P}(W = 1).

Reveal Answer

From the question np=12np=\dfrac{1}{2} and np(1p)=512np(1-p)=\dfrac{5}{12}. It follows that

12(1p)=5121p=56p=16\begin{aligned} \dfrac{1}{2}(1-p)&=\dfrac{5}{12}\\ \Rightarrow 1-p&=\dfrac{5}{6}\\ \Rightarrow p&=\dfrac{1}{6} \end{aligned}

And

n6=12n=3\begin{aligned} \dfrac{n}{6}&=\dfrac{1}{2}\\ \Rightarrow n&=3 \end{aligned}

Hence,

P(W=1)=(31)(16)1(56)2=3×16×2536=2572\begin{aligned} P(W=1)&=\binom{3}{1}\left(\dfrac{1}{6}\right)^1\left(\dfrac{5}{6}\right)^2\\ &=3\times\dfrac{1}{6}\times\dfrac{25}{36}\\ &=\dfrac{25}{72} \end{aligned}

Marking Criteria
DescriptorMarks

correctly states two equations relating nn and pp

1

correctly solves for nn and pp

1

correctly calculates the probability

1
Q18
2022
QCAA
Paper 1
4 marks
Q18
4 marks

A percentile is a measure in statistics showing the value below which a given percentage of observations occur.

The continuous random variable XX has the probability density function
f(x)={2x2,1x20,otherwisef(x) = \begin{cases} 2x-2, & 1 \le x \le 2 \\ 0 , & \text{otherwise} \end{cases}

Determine the 36th percentile of XX.

Reveal Answer

1a2x2dx=0.36\int_1^a 2x - 2 dx = 0.36
x22x1a=0.36x^2 - 2x \Big|_1^a = 0.36
(a22a)(12)=0.36(a^2 - 2a) - (1 - 2) = 0.36
a22a+1=0.36a^2 - 2a + 1 = 0.36
a22a+0.64=0a^2 - 2a + 0.64 = 0
a=2±44×1×0.642\therefore a = \frac{2\pm\sqrt{4-4\times1\times0.64}}{2}
a=2±1.442\therefore a = \frac{2 \pm \sqrt{1.44}}{2}
a=2±1.22\therefore a = \frac{2 \pm 1.2}{2}
a=1.6\therefore a = 1.6 or 0.40.4
Given 1x21 \le x \le 2
a=1.6\therefore a = 1.6

Marking Criteria
DescriptorMarks

correctly determines the definite integral

1

determines the quadratic equation

1

determines values of a

1

evaluates the reasonableness of solutions

1
Q15
2024
QCAA
Paper 1
4 marks
Q15

A survey was conducted to understand whether people support a new policy.
Using a z-score of 2, the approximate confidence interval for the population proportion of people who support the policy was calculated as (310,710)\left(\frac{3}{10}, \frac{7}{10}\right).

Q15a
1 mark

Determine the margin of error.

Reveal Answer

The confidence interval corresponds to (p^E,p^+E)(\hat{p}-E, \hat{p}+E), where E is the margin of error about p^\hat{p}.
p^+E=710\hat{p} + E = \frac{7}{10}
p^E=310\hat{p} - E = \frac{3}{10}
subtracting:
2E=710310=410\therefore 2E = \frac{7}{10} - \frac{3}{10} = \frac{4}{10}
E=210=15\therefore E = \frac{2}{10} = \frac{1}{5}

Marking Criteria
DescriptorMarks

Correctly determines the margin of error

1
Q15b
3 marks

Determine the number of people surveyed.

Reveal Answer

p^+E=710\hat{p} + E = \frac{7}{10}
p^+210=710\hat{p} + \frac{2}{10} = \frac{7}{10}
p^=510=12\hat{p} = \frac{5}{10} = \frac{1}{2}
Upper CI value = p^+zp^(1p^)n\hat{p} + z\sqrt{\frac{\hat{p}(1-\hat{p})}{n}}
710=12+212(112)n\frac{7}{10} = \frac{1}{2} + 2\sqrt{\frac{\frac{1}{2}\left(1-\frac{1}{2}\right)}{n}}
15=212(112)n\frac{1}{5} = 2\sqrt{\frac{\frac{1}{2}\left(1-\frac{1}{2}\right)}{n}}
15=214n\frac{1}{5} = 2\sqrt{\frac{\frac{1}{4}}{n}}
15=214n\frac{1}{5} = 2\frac{\sqrt{\frac{1}{4}}}{\sqrt{n}}
15=212n\frac{1}{5} = 2\frac{\frac{1}{2}}{\sqrt{n}}
15=1n\frac{1}{5} = \frac{1}{\sqrt{n}}
110=14n\frac{1}{10} = \sqrt{\frac{1}{4n}}
1100=14n\frac{1}{100} = \frac{1}{4n}
100=4n100 = 4n
n=25n = 25
25 people were surveyed.

Marking Criteria
DescriptorMarks

Determines the value of the sample proportion p^\hat{p}

1

Substitutes p^\hat{p} and z-score into the confidence interval formula

1

Determines the number of people surveyed

1
Q5
2020
VCAA
Paper 1
4 marks
Q5

For a certain population the probability of a person being born with the specific gene SPGE1 is 35\frac{3}{5}.

The probability of a person having this gene is independent of any other person in the population having this gene.

Q5a
2 marks

In a randomly selected group of four people, what is the probability that three or more people have the SPGE1 gene?

Reveal Answer

XBi(4,35)X \sim \text{Bi}\left(4, \frac{3}{5}\right)

Pr(X3)\Pr(X \geq 3)

=Pr(X=3)+Pr(X=4)= \Pr(X = 3) + \Pr(X = 4)

=(43)(35)3(25)+(35)4= \binom{4}{3}\left(\frac{3}{5}\right)^3\left(\frac{2}{5}\right) + \left(\frac{3}{5}\right)^4

=297625= \frac{297}{625}

Marking Criteria
DescriptorMarks

Formulates the correct probability expression for Pr(X3)\Pr(X \geq 3) by summing the probabilities of exactly 3 and exactly 4 successes using the binomial distribution

1

Calculates the correct final probability of 297625\frac{297}{625}

1
Q5b
2 marks

In a randomly selected group of four people, what is the probability that exactly two people have the SPGE1 gene, given that at least one of those people has the SPGE1 gene? Express your answer in the form a3b4c4\frac{a^3}{b^4 - c^4}, where a,b,cZ+a, b, c \in Z^+.

Reveal Answer

Pr(X=2X1)=Pr(X=2)Pr(X1)Pr(X1)\Pr(X = 2 | X \geq 1) = \frac{\Pr(X = 2) \cap \Pr(X \geq 1)}{\Pr(X \geq 1)}

=Pr(X=2)1Pr(X=0)= \frac{\Pr(X = 2)}{1 - \Pr(X = 0)}

=6×(35)2×(25)2(55)4(25)4= \frac{6 \times \left(\frac{3}{5}\right)^2 \times \left(\frac{2}{5}\right)^2}{\left(\frac{5}{5}\right)^4 - \left(\frac{2}{5}\right)^4}

=635424= \frac{6^3}{5^4 - 2^4}

Marking Criteria
DescriptorMarks

Evaluates Pr(X=2)\Pr(X = 2) and Pr(X1)\Pr(X \geq 1) or correctly substitutes into the conditional probability formula

1

Provides the correct final answer in the required form, 635424\frac{6^3}{5^4 - 2^4}

1
Q11
2023
SCSA
Paper 2
13 marks
Q11

Mrs Euler is having her car serviced at BIMDAS Mechanics. She drops her vehicle off at 8 am and is told that her car will be ready for collection at some time between 1 pm and 5 pm that day.

Let the random variable BB denote the time after noon (12 pm) at which a vehicle is ready for collection at BIMDAS Mechanics. The probability density function for BB is shown in the graph below.

The probability of a vehicle being ready for collection between 2 pm and 3 pm is 0.1.

Q11d

Mr Euler is also having his car serviced, but by Addition Autos. He drops his vehicle off at 8 am and is told that his car will be ready for collection at some time between 1 pm and 5 pm that day.

Let the random variable AA denote the time after noon (12 pm) that a vehicle is ready for collection at Addition Autos. The cumulative distribution function for AA is given by

P(Aa)={0,a<110aa2916,1a51,a>5P(A \le a) = \begin{cases} 0, & a < 1 \\ \frac{10a - a^2 - 9}{16}, & 1 \le a \le 5 \\ 1, & a > 5 \end{cases}

Q11a
2 marks

Determine the value of kk.

Reveal Answer

The area under the curve must be equal to 1. Hence
4×0.1+0.5×2×(k0.1)=14 \times 0.1 + 0.5 \times 2 \times (k - 0.1) = 1
k+0.3=1\Rightarrow k + 0.3 = 1
k=0.7\Rightarrow k = 0.7

Marking Criteria
DescriptorMarks

states that the area under the curve must equal 1

1

obtains correct value of kk

1
Q11b
2 marks

An incomplete expression for the probability density function of BB is given below. Fill in the boxes to complete the missing parts of the expression.

f(b)={0.1,[box][box],3b50,otherwisef(b) = \begin{cases} 0.1, & \text{[box]} \\ \text{[box]}, & 3 \le b \le 5 \\ 0, & \text{otherwise} \end{cases}

Reveal Answer

The probability density function for BB is given by
f(b)={0.1,1b<30.3b0.8,3b50,otherwisef(b) = \begin{cases} 0.1, & \bold{1 \le b < 3} \\ \bold{0.3b - 0.8}, & 3 \le b \le 5 \\ 0, & \text{otherwise} \end{cases}

Marking Criteria
DescriptorMarks

correctly completes the interval

1

correctly completes the linear function

1
Q11c
3 marks

Determine the expected time that Mrs Euler's vehicle will be ready for collection at BIMDAS Mechanics.

Reveal Answer

E(B)=130.1b db+35b(0.3b0.8) dbE(B) = \int_{1}^{3} 0.1b \ db + \int_{3}^{5} b(0.3b - 0.8) \ db
=3.8= 3.8

Therefore, the expected pickup time is 3:48 pm.

Marking Criteria
DescriptorMarks

states a correct integral expression for the expected value of BB

1

determines the correct expected value of BB

1

states the expected value as a time

1
Q11d (i)
1 mark

Determine the probability that Mr Euler's vehicle will be ready to collect

by 3 pm.

Reveal Answer

P(A3)=10(3)32916P(A \le 3) = \frac{10(3) - 3^2 - 9}{16}
=0.75= 0.75

Marking Criteria
DescriptorMarks

calculates correct probability

1
Q11d (ii)
2 marks

between 3 pm and 4 pm.

Reveal Answer

P(3A4)=P(A4)P(A3)P(3 \le A \le 4) = P(A \le 4) - P(A \le 3)
=10(4)4291610(3)32916= \frac{10(4) - 4^2 - 9}{16} - \frac{10(3) - 3^2 - 9}{16}
=15161216= \frac{15}{16} - \frac{12}{16}
=316= \frac{3}{16}
=0.1875= 0.1875

Marking Criteria
DescriptorMarks

expresses the probability as the difference P(A4)P(A3)P(A \le 4) - P(A \le 3)

1

calculates correct probability

1
Q11e
3 marks

Determine the expected time at which Mr Euler's vehicle will be ready for collection at Addition Autos.

Reveal Answer

The probability density function is given by
p(a)=dda(10aa2916)p(a) = \frac{d}{da}\left( \frac{10a - a^2 - 9}{16} \right)
=58a8= \frac{5}{8} - \frac{a}{8}

for 1a51 \le a \le 5 (0 otherwise). Hence the expected value is given by
E(A)=15a(58a8)daE(A) = \int_{1}^{5} a\left( \frac{5}{8} - \frac{a}{8} \right) da
=73(=213)= \frac{7}{3} \left( = 2\frac{1}{3} \right)

Therefore, the expected pickup time is 2:20 pm.

Marking Criteria
DescriptorMarks

determines correct expression for the probability density function for 1a51 \le a \le 5

1

determines the correct expected value for AA

1

states the expected value as a time

1

Frequently Asked Questions

How many VCAA Mathematical Methods questions cover Data analysis, probability and statistics?
AusGrader has 406 VCAA Mathematical Methods questions on Data analysis, probability and statistics, all with instant AI grading and detailed marking feedback.

Ready to practise VCAA Mathematical Methods?

Get instant AI feedback on past exam questions, aligned to the syllabus

Start Practising Free