ML Wiki

Latest revision as of 18:40, 23 November 2015

Two-Sample t-test

This type of $t$-test is used when we want to compare the means of two different samples

suppose that we have two samples $a$ and $b$ of sizes $n_a$ and $n_b$ resp.
we're interested in inferring something about $\mu_a - \mu_b$
Point Estimate in this case is $\bar{X}_a - \bar{X}_b$
Standard Error is $\text{SE}_{\bar{X}_a - \bar{X}_b} = \sqrt{\text{SE}_a + \text{SE}_b } = \sqrt{ s^2_a / n_a + s^2_b / n_b}$
- because $\text{SE}^2_{\bar{X}_a - \bar{X}_b} = \text{var}[\bar{X}_a - \bar{X}_b] = \text{var}[x_a] + \text{var}[x_b] = \text{SE}^2_a + \text{SE}^2_b$

The test is of the following form

$H_0: \mu_a = \mu_b$, or $H_0: \mu_a - \mu_b = 0$
$H_A: \mu_a \neq \mu_b$ or $H_A: \mu_a - \mu_b \neq 0$ (two-sided, can also be $<$ or $>$)

So, test statistics:

$T = \cfrac{(\bar{X}_1 - \bar{X}_2) - (\mu_1 - \mu_2)}{\sqrt{s_1^2 / n_1 + s_2^2 / n_2}}$
$T \approx t_{\text{df}}$
$\text{df}$ depends on a few things, discussed below

Welch-Satterthwaite Approximation

What is $\text{df}$ there?

Welch-Satterthwaite Approximation for df is
$\text{df} = \cfrac{( s_1^2 / n_1 + s_2^2 / n_2 )^2 }{ \frac{(s_1^2 / n_1)^2 }{n_1 - 1} + \frac{(s_2^2 / n_2)^2 }{n_2 - 1} }$

This can be a non-integer value, but that's fine

Pooled Variance Estimation

Can we "pool" the samples?
Yes, but only under assumption that $\sigma_1^2 = \sigma_2^2$ (in other words, we assume that the variances are equal)

We can replace $s_1^2$ and $s_2^2$ by the pooled variance:

$s^2 = \cfrac{(n_1 - 1) s_1^2 + (n_2 - 1) s_2^2 }{ (n_1 - 1) + (n_2 - 1)}$
and $\text{df} = (n_1 - 1) + (n_2 - 1) = n_1 + n_2 - 2$

Example

Example 1

males: $n_1 = 281 $
females: $n_2 = 199$
$\bar{X}_1 = -12.9, s_1^2 = 181.5$
$\bar{X}_2 = -17.1, s_2^2 = 231.5$
$\bar{X}_1 - \bar{X}_2 = -12.9 + 17.1 = 4.2$

We then calculate

$\text{df} = 200.09$
so $T_{0.025, 200.09} = 1.97$

We have the following test

$H_0: \mu_1 = \mu_2, H_A: \mu_1 \neq \mu_2$
and $\bar{X}_1 - \bar{X}_2 = 4.2$

$p$-value:

$P(| \bar{X}_1 - \bar{X}_2 | \geqslant 4.2 ) = $
$P \left( \left| \cfrac{(\bar{X}_1 - \bar{X}_2) - (\mu_1 - \mu_2)}{\sqrt{s_1^2 / n_1 + s_2^2 / n_2}} \right| \geqslant \cfrac{4.2}{\sqrt{s_1^2 / n_1 + s_2^2 / n_2}} \right) \approx $
$P\left( |t_\text{df} | \geqslant \cfrac{4.2}{\sqrt{181.5 / 281 + 231 / 119}} \right) = 0.0097$

pretty small, so we reject the $H_0$.

Example 2

Life expectancy in E.Asia and Pacific vs S.Asia

EA&P: $n_1: 30, \bar{X}_1 = 73.1, s_1^2 = 38.7$
SA: $n_2: 8, \bar{X} = 67.0, s_2^2 = 72.5$
$\bar{X}_1 - \bar{X}_2 = 73.1 - 67.0 = 6.1$

We then calculate

$\text{df} = 9.09$ by Welch-Satterthwaite Approximation
$T_{0.025, 0.09} = 2.26$

Our test:

$H_0: \mu_0 = \mu_1, H_A: \mu_0 \neq \mu_1$

$p$-value:

$P(| \bar{X}_1 - \bar{X}_2 | \geqslant 6.1 ) = $
$P \left( \left| \cfrac{(\bar{X}_1 - \bar{X}_2) - (\mu_1 - \mu_2)}{\sqrt{s_1^2 / n_1 + s_2^2 / n_2}} \right| \geqslant \cfrac{6.1}{\sqrt{s_1^2 / n_1 + s_2^2 / n_2}} \right) \approx $
$P ( |t_\text{df} | \geqslant 1.90 ) \approx 0.09$

Not so small - we can't reject the $H_0$, it might be true that $\mu_0 = \mu_1$

R code

R (Means)

male = skeletons[sex == '1', 6]
female = skeletons[sex == '2', 6]

# critical value
qt(0.025, df=200.9, lower.tail=F)

or

t.test(male, female, mu=0, conf.level=0.95, alternative='two.sided')

Sources