M241 Probability
Chapter 2:  Repeated Trials and Sampling

Introduction

Appendix 1 -- Basic Counting

2.1 The Binomial Distribution

2.2 Normal Approximation Method

2.4. Poisson Approximation

2.5 Random Sampling

Introduction

Model:  Repeated trials each of which result in success (event happening) or failure (event not happening.)
Each trial is called a Bernoulli (p) trial  where p is probability of success and q = 1 - p is the probability of failure. (see Section 1.3, page 27)
Examples are:

Appendix 1  Basic counting Principles

(2.1)  The Binomial Distribution

What is the probability of getting exactly k successes in n Bernoulli (p) trials?
for k = 0, 1, 2, ... n
This distribution is called the Binomial (n,p)  distribution.
For n = 4, a tree diagram can be used to determine the answer.
P(0 successes in 4 trials) = q4
P(1 success in 4 trials) = 4pq3
P(2 successes in 4 trials) = 6 p2q2
P(3 successes in 4 trials) = 4 p3q
P(4 successes in 4 trials) = p4
What would the next row in this tree look like?  (The row representing k successes in 5 trials)
In general we write
Look at Example 1:
Facts (we will not show):

( 2.2) Normal Approximation:  Method

Application: Working with Confidence Intervals:

Typical Problem:

Conduct a sequence of Bernoulli trials, where you don't know the true probability p of success.

Estimate the true probability p with p-est = #successes/#trials

Give a confidence interval: range about your estimate in which you have a level of confidence that the true value of p lies within.

The level of confidence is the probability that you are correct - i.e. that the true value of p lies within the confidence interval you specified!

Method of finding confidence interval and confidence level:
 


 
 

Section 2.4 Poisson Approximation


 
 

Section 2.5 Random Sampling

Sampling without Replacement:

Model: Population of size N. Choose n elements one a time at random, replacing each in the population after it is drawn. Assume each element is either Good ( and the number of good elements is G) or Bad (and the number of bad elements is B) ( and B + G = N)

this then is just the model we have already studied (where Good just is success and Bad is failure).

The number of Good elements picked in the sample of size n is represented by the Binomial Distribution.

Here the probability of success p = (G/N) and the probability of failure q = (B/N) -- Nothing new here.

Sampling Without Replacement: : Population of size N. Choose n elements one a time at random, but do not replace it. Assume each element is either Good ( and the number of good elements is G) or Bad (and the number of bad elements is B) ( and B + G = N)

The following formula hold:
 

    b) Use Hypergeometric formula with g = 1, b =2
c) P(at least one red) = 1 – P(no reds) = 1- P(3 blacks) = (26/52)(25/51)(24/50)   Exercise 4. Sampling without replacement
  • Exact expression for probability would be the sum of hypergeometric probabilities for g varying from 45 to 100.
  • Approximated well by Binomial Distribution, since N is large -- which is well approximated by the Normal Distribution (since p = .40 is close to 1/2) Mean is np = 40; Standard Deviation = square root (npq) = square root (240).

  •  

     
     
     
     
     
     
     
     
     
     
     
     

    We want the "right tail" of the Normal Distribution: