Probability & Statistics
Probability and statistics are the foundation of machine learning. ML models learn patterns from data, quantify uncertainty, and make probabilistic predictions.
Why it matters: Every ML prediction is probabilistic. Understanding distributions, variance, and statistical inference is essential for building robust models.
Core Concepts
Random Variable
A variable whose value is determined by chance.
X ∈ {1, 2, 3, 4, 5, 6}
Discrete or continuous values
Probability Distribution
Describes how probabilities are distributed over values.
f(x) for continuous
Sum/integral equals 1
Expected Value (Mean)
Average value weighted by probability.
μ = mean
Center of distribution
Variance & Std Dev
Measure of spread around the mean.
σ = √Var(X)
Quantifies uncertainty
Common Distributions
Normal (Gaussian)
ContinuousBell curve. Most common in nature and ML.
Used in: Gaussian processes, noise modeling, initialization
Bernoulli
DiscreteBinary outcome: success (1) or failure (0).
Used in: Binary classification, coin flips
Uniform
BothAll outcomes equally likely.
Used in: Random initialization, sampling
Bayes' Theorem
The foundation of probabilistic reasoning. Update beliefs based on evidence.
Statistical Concepts
AI Applications
Key Takeaway: ML is fundamentally probabilistic. Models learn probability distributions, make probabilistic predictions, and quantify uncertainty.