16.3.60 one-way ANOVA test

Researchers analyzed IQ data on preterm children at age 7-8 years. The mothers of the children in the study had chosen whether to provide their infants with breast milk within 72 hours of delivery. The researchers used the following designations. Group I: mothers declined to provide breast milk; Group IIa: mothers had chosen but were unable to provide breast milk; and Group IIb: mothers had chosen and were able to provide breast milk. The summary statistics on IQ are at the right. At the 1% significance level, do the data provide sufficient evidence to conclude that a difference exists in mean IQ at age 7-8 years for preterm children among the three groups? Critical values are given at the right.

Notation in one-way ANOVA:

k = number of populations
n = total number of observations
\(\bar x\) = mean of all n observations
\(n_j\) = size of sample from Population j
\(\bar{x_j}\) = mean of sample from Population j
\(s_j^2\) = variance of sample from Population j
\(T_j\) = sum of sample data from Population j

Defining formulas from sums of squares in one-way ANOVA:

SST = \(\sum (x_i - \bar x)^2\)
SSTR = \(\sum n_j(\bar{x_j} - \bar{x})^2\)
SSE = \(\sum (n_j-1)s_j^2\)

One-way ANOVA identity: SST = SSTR + SSE

Computing formulas from sums of squares in one-way ANOVA:

SST = \(\sum x_i^2 - (\sum x_i)^2/n\)
SSTR = \(\sum (T_j^2/n_j) - (\sum x_i)^2/n\)
SSE = SST - SSTR

The way they define \(\sum (T_j^2/n_j)\) is different from the one for x

Mean squares in one-way ANOVA:

MSTR = \(\frac{SSTR}{k-1}\)
MSE = \(\frac{SSE}{n-k}\)
SSE = SST - SSTR

Test statistic for one-way ANOVAA (independent samples, normal populations, and equal population standard deviations):

F = \(\frac{MSTR}{MSE}\)

with df = (k - 1, n - k)

Confidence interval for \(\mu_i - \mu_j\) in the Tukey multiple-comparison method (independent samples, normal populations, and equal population sstandard deviations):

\((\bar{x_i} - \bar{x_j}) \pm \frac{q_{\alpha}}{\sqrt{2}}.s\sqrt{\frac{1}{n_i} + \frac{1}{n_j}}\)

where s = \(\sqrt{MSE}\) and \(q_{\alpha}\) is obtained for a q-curve with parameters k and n - k

Test statistic for a Kruskal-Wallis test (independent samples, same-shape populations, all sample sizes 5 or greater):

\(K=\frac{SSTR}{SST/(n-1)}\) or
\(K=\frac{12}{n(n+1)}\sum_{j=1}^{k} \frac{R_j^2}{n_j} - 3(n+1)\)

where SSTR and SST are computed for the ranks of the data, and \(R_j\) denotes the sum of the ranks for the sample data from Population j. K has approximately a chi-square distribution with df = k -1

(a) Find the null and alternative hypotheses. Choose the correct answer below.

Since the question asks that “a difference exists in mean IQ at age 7-8 years for preterm children among the three groups,” the correct hypothesis is.

\(H_0: \mu_I = \mu_{IIa} = \mu_{IIb}\)

\(H_a:\) Not all the means are equal.

First we need to get the data from the question.

nj <- c(88, 17, 195)
x <- c(98.6, 97.3, 101.3)
s <- c(11.5, 20.6, 13.1)

Names of variables

n = total number of observations: n

We find SSTR, SSE, SST by using defining formula

n = sum(nj)
k = length(x)

Find SSTR using the formula SSTR = \(\sum n_j(\bar{x_j} - \bar{x})^2\)

SSTR = sum(nj*(x- sum(nj*x)/sum(nj))^2)
SSTR

## [1] 602.2155

round(SSTR, 2)

## [1] 602.22

Find SSE using the formula SSE = \(\sum (n_j-1)s_j^2\)

SSE = sum((nj-1)*s^2)
SSE

## [1] 51587.85

round(SSE, 2)

## [1] 51587.85

Find SST, we use formula SST = SSE + SSTR

SST = SSE + SSTR
SST

## [1] 52190.07

round(SST, 2)

## [1] 52190.07

Find df treatment

k = length(x)
k-1

## [1] 2

Find SS treatment

SSTR

## [1] 602.2155

Find MS treatment

MSTR = SSTR/(k-1)
MSTR

## [1] 301.1077

Find Error df

n - k

## [1] 297

Find Error SS

SSE

## [1] 51587.85

Find Error MS

MSE = SSE / (n - k)
MSE

## [1] 173.6965

Find F-statistic treatment

MSTR / MSE

## [1] 1.733528

round(MSTR / MSE, 2)

## [1] 1.73

Find df total

n - 1

## [1] 299

Find SS total

SST

## [1] 52190.07

Find the critical value.

Since \(\alpha = .01\)

alpha = 0.01
qf(1-alpha, k-1, n-k)

## [1] 4.67732

Round to decimal places

round(qf(1-alpha, k-1, n-k), 2)

## [1] 4.68

Since our test statistic = 1.73 < our critical value \(F_{\alpha}\)=4.68 and it is a right-tailed test, we do not have enough evidence to reject the hypothesis.

Hope that helps!