Kekkei - Advancing Financial Science For Everyone

A Toolkit of Statistical Tests

Now that you understand the hypothesis testing framework, let's learn the most commonly used tests. Each test is designed for a specific type of question and data structure.

Z-Test (Large Sample, σ Known)

The simplest test. Used when you know the population standard deviation (rare in practice, but foundational).

z = \frac{\bar{x} - \mu_0}{\sigma / \sqrt{n}}

Z-Test Example

A factory claims light bulbs last μ₀ = 1000 hours. You test 64 bulbs: x̄ = 980, σ = 80 (known).

H₀: μ = 1000, H₁: μ ≠ 1000, α = 0.05

z = (980 - 1000) / (80/√64) = -20 / 10 = -2.0

p-value = 2 × P(Z < -2.0) = 2 × 0.0228 = 0.0456 < 0.05

Reject H₀. Evidence suggests bulbs don't last 1000 hours.

One-Sample t-Test

The workhorse of hypothesis testing. Used when σ is unknown (almost always) and you want to test if a population mean equals a specific value.

t = \frac{\bar{x} - \mu_0}{s / \sqrt{n}} \quad \text{with } df = n - 1

One-Sample t-Test

A nutritionist claims a diet results in weight loss of μ₀ = 5 kg. You measure 25 patients: x̄ = 4.2 kg, s = 2.1 kg.

H₀: μ = 5, H₁: μ ≠ 5, α = 0.05

t = (4.2 - 5) / (2.1/√25) = -0.8 / 0.42 = -1.90

With df = 24, the critical t-values are ±2.064. |-1.90| < 2.064, so p-value > 0.05.

Fail to reject H₀. Insufficient evidence that the true weight loss differs from 5 kg.

Two-Sample t-Test (Independent Samples)

Compares the means of two independent groups. "Is there a difference between Group A and Group B?"

t = \frac{\bar{x}_1 - \bar{x}_2}{\sqrt{\frac{s_1^2}{n_1} + \frac{s_2^2}{n_2}}}

Comparing Two Groups

Does a new study method improve test scores?

Control group: n₁ = 30, x̄₁ = 72, s₁ = 10 New method: n₂ = 30, x̄₂ = 78, s₂ = 12

H₀: μ₁ = μ₂, H₁: μ₁ ≠ μ₂

SE = √(100/30 + 144/30) = √(3.33 + 4.80) = √8.13 = 2.85

t = (72 - 78) / 2.85 = -2.11

With df ≈ 56, this gives p ≈ 0.040 < 0.05.

Reject H₀. Evidence suggests the new method produces different (higher) scores.

Assumptions: Both groups are independent, approximately normally distributed (or n is large enough for CLT), and ideally have similar variances. Welch's t-test (default in most software) relaxes the equal variance assumption.

Paired t-Test

Used when the two measurements are connected — typically before/after measurements on the same subjects.

t = \frac{\bar{d}}{s_d / \sqrt{n}} \quad \text{where } d_i = x_{i,\text{after}} - x_{i,\text{before}}

Before-After Study

Blood pressure before and after medication for 10 patients. Differences (before - after): 5, 8, 3, 12, 7, -2, 9, 6, 4, 8

d̄ = 6.0, s_d = 3.77, n = 10

H₀: μ_d = 0 (no change), H₁: μ_d > 0 (blood pressure decreases)

t = 6.0 / (3.77/√10) = 6.0 / 1.19 = 5.04

With df = 9, p < 0.001. Reject H₀. Strong evidence the medication reduces blood pressure.

Why paired is better: By computing differences, each subject serves as their own control. This eliminates person-to-person variability, making it easier to detect the treatment effect.

Chi-Square Test of Independence

For categorical data: tests whether two categorical variables are related.

\chi^2 = \sum \frac{(O_{ij} - E_{ij})^2}{E_{ij}}

Where O = observed count and E = expected count under independence.

Chi-Square Test

Is there a relationship between gender and preference for tea vs coffee?

| | Tea | Coffee | Total | |---|---|---|---| | Male | 30 | 70 | 100 | | Female | 50 | 50 | 100 | | Total | 80 | 120 | 200 |

Under independence, expected count = (row total × col total) / grand total. E(Male, Tea) = 100 × 80 / 200 = 40

χ² = (30-40)²/40 + (70-60)²/60 + (50-40)²/40 + (50-60)²/60 = 2.5 + 1.67 + 2.5 + 1.67 = 8.33

With df = (2-1)(2-1) = 1, the critical value at α = 0.05 is 3.841. 8.33 > 3.841 → Reject H₀. Gender and drink preference are related.

Choosing the Right Test

Test Selection Guide

Question	Data Type	Test
Is the mean equal to a value?	One continuous, σ known	Z-test
Is the mean equal to a value?	One continuous, σ unknown	One-sample t-test
Do two groups have different means?	Two independent groups	Two-sample t-test
Is there a before/after change?	Paired measurements	Paired t-test
Are two categories related?	Two categorical variables	Chi-square test

Decision tree: Start with your question, then identify your data type. The question + data type almost always uniquely determine the right test. If in doubt, most software will guide you.

Test your knowledge

🧠 Knowledge Check

1 / 3

Common Statistical Tests

A Toolkit of Statistical Tests

Z-Test (Large Sample, σ Known)

One-Sample t-Test

Two-Sample t-Test (Independent Samples)

Paired t-Test

Chi-Square Test of Independence

Choosing the Right Test

Test your knowledge

You measure 20 patients' blood pressure before and after a drug. Which test is appropriate?

A Toolkit of Statistical TestsFocusStart Focus Mode

Z-Test (Large Sample, σ Known)FocusStart Focus Mode

One-Sample t-TestFocusStart Focus Mode

Two-Sample t-Test (Independent Samples)FocusStart Focus Mode

Paired t-TestFocusStart Focus Mode

Chi-Square Test of IndependenceFocusStart Focus Mode

Choosing the Right TestFocusStart Focus Mode

Test your knowledgeFocusStart Focus Mode

You measure 20 patients' blood pressure before and after a drug. Which test is appropriate?

A Toolkit of Statistical Tests

Z-Test (Large Sample, σ Known)

One-Sample t-Test

Two-Sample t-Test (Independent Samples)

Paired t-Test

Chi-Square Test of Independence

Choosing the Right Test

Test your knowledge