Probability Axioms and Basic Properties

Master Kolmogorov's axioms and derive fundamental properties including complement rule and addition rule.

22 min read
Beginner

Introduction

Probability theory rests on three simple axioms proposed by Andrey Kolmogorov in 1933. These axioms provide the mathematical foundation for all probabilistic reasoning, from coin flips to quantum mechanics.

Learning Objectives:

  • Understand Kolmogorov's axioms of probability
  • Derive fundamental properties from axioms
  • Apply complement rule, union bound, and inclusion-exclusion
  • Verify properties with Python simulations

Kolmogorov's Axioms

A probability function PP assigns a real number to each event in a sample space Ω\Omega, satisfying three axioms:

Axiom 1 (Non-negativity): For any event AA, P(A)0P(A) \geq 0

Axiom 2 (Normalization): The probability of the entire sample space is 1, P(Ω)=1P(\Omega) = 1

Axiom 3 (Additivity): For mutually exclusive events A1,A2,A3,...A_1, A_2, A_3, ..., P(A1A2A3)=P(A1)+P(A2)+P(A3)+P(A_1 \cup A_2 \cup A_3 \cup \cdots) = P(A_1) + P(A_2) + P(A_3) + \cdots

Note: "Mutually exclusive" means AiAj=A_i \cap A_j = \emptyset for iji \neq j.

Why These Axioms?

  • Axiom 1: Probabilities can't be negative (you can't have "negative chance")
  • Axiom 2: Something must happen (certainty = 100%)
  • Axiom 3: If events can't occur together, their probabilities add

From just these three axioms, we can derive all properties of probability!

Derived Properties

Property 1: Probability of Empty Set

P()=0P(\emptyset) = 0

Proof: Ω\Omega and \emptyset are disjoint, and Ω=Ω\Omega \cup \emptyset = \Omega. By Axiom 3: P(Ω)=P(Ω)=P(Ω)+P()P(\Omega) = P(\Omega \cup \emptyset) = P(\Omega) + P(\emptyset) 1=1+P()1 = 1 + P(\emptyset) P()=0P(\emptyset) = 0

Property 2: Complement Rule

P(Ac)=1P(A)P(A^c) = 1 - P(A)

Proof: AA and AcA^c are disjoint and AAc=ΩA \cup A^c = \Omega: P(Ω)=P(AAc)=P(A)+P(Ac)P(\Omega) = P(A \cup A^c) = P(A) + P(A^c) 1=P(A)+P(Ac)1 = P(A) + P(A^c) P(Ac)=1P(A)P(A^c) = 1 - P(A)

python
import random

def simulate_complement_rule(n_trials=10000):
    """Verify complement rule: P(A) + P(A^c) = 1"""
    # Event A: die roll is even
    omega = [1, 2, 3, 4, 5, 6]
    A = {2, 4, 6}
    A_complement = {1, 3, 5}

    count_A = 0
    count_A_complement = 0

    for _ in range(n_trials):
        roll = random.choice(omega)
        if roll in A:
            count_A += 1
        if roll in A_complement:
            count_A_complement += 1

    p_A = count_A / n_trials
    p_A_complement = count_A_complement / n_trials

    print(f"Simulated P(A) = {p_A:.4f}")
    print(f"Simulated P(A^c) = {p_A_complement:.4f}")
    print(f"Sum: P(A) + P(A^c) = {p_A + p_A_complement:.4f}")
    print(f"\nTheoretical: P(A) = 3/6 = 0.5000")
    print(f"Theoretical: P(A^c) = 3/6 = 0.5000")

simulate_complement_rule()

Addition Rule

Property: Addition Rule (for two events)

For any events AA and BB:

P(AB)=P(A)+P(B)P(AB)P(A \cup B) = P(A) + P(B) - P(A \cap B)

Why subtract? Without it, we'd count ABA \cap B twice!

python
import random

def simulate_addition_rule(n_trials=10000):
    """Verify: P(A ∪ B) = P(A) + P(B) - P(A ∩ B)"""
    omega = [1, 2, 3, 4, 5, 6]
    A = {2, 4, 6}  # even
    B = {2, 3, 5}  # prime

    count_A, count_B, count_union, count_inter = 0, 0, 0, 0

    for _ in range(n_trials):
        roll = random.choice(omega)
        if roll in A: count_A += 1
        if roll in B: count_B += 1
        if roll in A or roll in B: count_union += 1
        if roll in A and roll in B: count_inter += 1

    p_A = count_A / n_trials
    p_B = count_B / n_trials
    p_union = count_union / n_trials
    p_inter = count_inter / n_trials

    print(f"P(A ∪ B) simulated: {p_union:.4f}")
    print(f"P(A) + P(B) - P(A ∩ B): {p_A + p_B - p_inter:.4f}")

simulate_addition_rule()

Key Takeaways

  1. Three axioms define all of probability
  2. Complement rule: P(Ac)=1P(A)P(A^c) = 1 - P(A)
  3. Addition rule: P(AB)=P(A)+P(B)P(AB)P(A \cup B) = P(A) + P(B) - P(A \cap B)
  4. Bounds: 0P(A)10 \leq P(A) \leq 1 for all events

Next Lesson: Apply axioms to finite sample spaces with counting!

Property
Formula
Complement$P(A^c) = 1 - P(A)$
Addition$P(A \cup B) = P(A) + P(B) - P(A \cap B)$
Bounds$0 \leq P(A) \leq 1$