Mathematical framework for representing uncertain statements.
1) Inherent stochasticity
2) Incomplete observability
3) Incomplete modeling
1) Frequentist Probability - rates at which events occur, where the experiment is replicable
2) Bayesian Probability - qualitative levels of certainty, which can be represented as degree of belief
A variable that can take on different values randomly.
Random variables may be discrete or continuous.
A discrete random variable has a countable number of states.It is not always numerical.
A continuous random variable has real numerical value.
A description of how likely a random variable or a set of random variables is to take on each of its possible states.
Probability Mass Function (PMF)
A probability distribution over discrete variables
It is denoted as P(x), P(X=x)
Probability Density Function (PDF)
A probability distribution over continuous variables
It is denoted as p(x)
Joint Probability Distribution
A probability distribution over many variables
It is denoted as P(X=x, Y=y), P(x,y)
Marginal Probability Distribution
A probability distribution over a subset of variables
For discrete random variables x and y, we can find P(x) with sum rule
-> for all x in X, P(X=x) equals sum of P(x,y) for all y
For continuous variables x and y, we can find p(x,y) with integration
-> for all x in X, p(x) equals integral of p(x,y) respect to y
A probability of Y given X=x
It is denoted as P(Y=y|X=x)
P(Y=y|X=x) equals P(Y=y,X=x)/P(X=x)
Chain Rule of Conditional Probabilities
This is derived from P(Y=y|X=x) = P(Y=y,X=x)/P(X=x)
EX) P(a,b,c) = P(a|b,c) P(b,c)
P(b,c) = P(b|c) P(c)
-> P(a,b,c) = P(a|b,c) P(b|c) P(c)
Bayes' Rule
P(y) can be computed with sum rule.
Independence and Conditional Independence
Expectation
The expectation, also called expected value, of some function f(x) with respect to a probability distribution P(X) is the mean value that f takes on when x is drawn from P.
It is denoted as E(X)
Variance
A measure of how much the values of a function of a random variable x vary as we sample different values of x from its probability distribution
Standard Deviation
The square root of the variance
Covariance
A measure of how much variables tend to deviate from their expected values and direction of linear relationship between two continuous vairables.
Correlation
A measure of the strength and direction of a linear relationship between two continuous variables.