A. Basic Concepts and Terminology
Two Examples:
1. Toss a coin once. Possible outcomes are H (heads) or T (tails). Outcome space = {H, T}
2. Toss a single six-sided die. Observe number of dots on face that
comes up. Possible outcomes are 1, 2, 3, 4, 5, 6. Outcome space = {1, 2,
3, 4, 5, 6}
The event A = {2,4,6} is the event rolling an even number --
a 2 or a 4 or a 6
Note: Skip Empirical Distributions for nowA. Review of Set Theory
Since we are concerned with events as subsets of the outcome space, we must understand the basics of set theory.
See page 19 of your text. Make sure you understand all of the event language, set language, and notation, and Venn Diagrams (Outcome space or universal set, event, intersection, union, empty set, partition, complement, inclusions (subset), disjoint sets.)Venn Diagrams: Used to denote sets.
B. Rules of Probability
Probability Distribution: A function P defined on the subsets of the outcome space which satisfies the following 3 rules:
- P(B) >= 0
- For a partition of a set B the probability of B is the sum of the probabilities of the sets in the partition.
- The probability of the outcome space is 1 (certain)
Complement Rule:
P(the complement of A) = 1 - P(A)(Since A and A-complement form a partition of outcome space)
Difference Rule: Note: B-A = B intersected with A-complement
- If A is a subset of B, then P(B-A) = P(B)-P(A)
- If A is not a subset of B, then use P(B-A) = P(B) - P(B intersect A-complement)
Inclusion-Exclusion Principle
- P(A union B) = P(A) + P(B) - P(AB)
- See the Venn Diagrams on page 22
Look at Example 1 page 23 (Rich and Famous) Look at Example 2 page 23 (Numbered tickets in a box) Look at Example 3 page 24 (Shape = "Rectangular-faced die" with "flat" sides numbered 1 and 6 Examples 2 and 3 represent same probability distribution.
Histograms for probability distributions: Area of bar over i is proportional to probability P(i)
(From Example 4) Consider the random experiment: Draw a ticket, replace it, and then draw again (called Sampling with Replacement). We show that the probability of drawing the ticket number i on the first draw and j on the second draw, P(i,j), is the product of the probabilities, P(i)P(j). Later we will learn that that means these two events of drawing the first ticket, and drawing the second ticket, are independent events.
A Few Important Named Distributions (more later)
- Bernoulli(p) distribution. Two outcomes (say 0 and 1). Often call one success and the other failure. Can think of as a biased coin toss where heads comes up with probability p
possible outcome
0 1 probability
p 1-p
- Uniform Distribution on a finite set. Each outcome equally likely. Example if tossing a fair die.
- Uniform(a,b) Distribution. Point picked at random on the interval (a,b) (Where a < b are real numbers). Each point is assumed to be equally likely to be selected. P(picking a point in the interval (x,y) where a <= x <= y <= b is (y-x)/(b-a). Many random number generators in computer languages or on calculators simulate the Uniform (0,1) distribution.
- Uniform distribution on (0,1) x (0,1) in the plane.
Section 1.4 Conditional Probability and Independence
A = event two or more heads occur in 3 tosses = {hhh, hht, hth, thh}
H = event that heads occurs on first toss = {hhh, hht, hth, htt}
P(A|H) = probability that A occurs given that H occurs = 3/4 since
out of 4 outcomes in H , 3 are in A.
Similarly, P(A|T) = 1/4
1. Set up the tree representing representing the stages in a two
(or more) stage experiment.
2. Label the branches with the probabilities. (First stage
-- P(A), second P(A|B) )
3. To calculate the joint probability represented by any path,
multiply the probabilities along that path ( P(AB) = P(A|B)P(B) )
4. Two calculate the probability of a particular event, select
the paths that are part of that event and add their probabilities.
P(B|A) = P(A|B)P(B) / P(A)
Sometimes the denominator on the right hand side, P(A),
will be calculated using the averaging formula:
2. From the tree diagram calculate
P(D|+) = P(D +) / P( + ) = .95 * .01 /( .95 * .01 + .99 * .02) =
approx 32%
P(D|+) = P(D +) / P( + ) = .95 * .30 /( .95 * .30 + .70 * .02)
= approx 95%
P(all 3 on time) = P(C1C2C3) = P(C1)P(C2|C1)P(C3|C1C2) = .7 * .8 * .9 = .504
Example 4: Probability of a flush. (all cards same suit)
5 - stage experiment. Pick the first card, pick the second card, ..., pick the fifth card.
P(Spade flush) = (First Spade) * (Second Space | First Spade) *
(Third Spade | First and Second Spade) *
(Fourth Spade | First, Second, and Third Spade) *
(Fifth Spade | First, Second, Thirds, and Fourth Spade)
= (13/52)*(12/51)*(11/50)*(10/49)*(9/48)
P(Flush) = P(Spade flush) + P(Heart fllush) + P(Diamond Flush) + P(Club
Flush)
= 4 * P(Spade flush) = approx. .00198
Consider first student. Pick another student -- Probability his/her birthday doesn't conflict with first is (364/365). Pick third student -- He/she doesn't conflict with first two with probability P(363/365). Continuing:
P(none of n students have the same birthday) = (364/365)(363/365)...(365- (n-1))/365))
If n = 23, This probability is approximately 50.6 % !