If you're seeing this message, it means we're having trouble loading external resources on our website.

If you're behind a web filter, please make sure that the domains *.kastatic.org and *.kasandbox.org are unblocked.

## AP®︎/College Statistics

### Course: AP®︎/College Statistics>Unit 9

Lesson 4: Sampling distributions for sample proportions

# Normal conditions for sampling distributions of sample proportions

Conditions for roughly normal sampling distribution of sample proportions.

## Want to join the conversation?

• What is the intuition for the rule np>=10 , n(1-p) >=10 ? • So as n increases, even very low or very high values of p start to produce normal sampling distributions?
Are normal distributions impossible if n < 10? • As for your first question, you might be interested in this video by jbstatistics on YT: https://www.youtube.com/watch?v=fuGwbG9_W1c.
The part ~ graphs the sample distribution of sample proportion with p=0.04 and increasing n, and the part ~ does the same thing except with p=0.96. That might answer your question.

As for the second question:
First, I think the 10 is more of an arbitrary value than an actual rule. That's because it's quite subjective whether a graph with lower (np) and/or n(1-p) is normal or not (i.e., the video I linked for your first question gives 15 instead of 10 as in this video.)
Secondly though, as you can see in the video on youtube, if your np or n(1-p) takes on small values, a part of your otherwise quite normal histogram gets cut off at zero. Higher np or n(1-p) values kind of makes the distribution skinnier and therefore prevents the cutoff.

I know your question was posted six months earlier, but hopefully this answers your question if you are still confused with it.
• In the first example, how could we tell which way it was going to be skewed? • what does np represent? • Why do the conditions need to be n*p > 10 and n * q > 10?

I thought that n > 30 is what it needs to be in order for any sample distribution of sample statistic to be normally distributed according the central limit theorem.

So,

The sample distribution of the sample means will be normally distributed if n > 30.

The sample distribution of the sample proportions will be normally distributed if n > 30?

What am I missing?
(1 vote) • The conditions n*p > 10 and n*q > 10 ensure that p is not too close to 0 or 1.

For any given value of n, if p is too close to 0 or 1, then the distribution of the number of successes in a binomial distribution with n trials and success probability p would be significantly asymmetric about its mean (and so significantly non-normal).
• for the first problem, 'a shipment of 50 tangerines everyday' is it means the 'population'? if yes , then how can she sampled 50 tangerines out of 50 population? • If we already know the true population proportion, why are we interested in calculating a sample proportion? We are using the "true" population proportion to validate the normal distribution of the sample, but then why not just work with the (known to be accurate) population data? Why bother with sampling at all in this case?
(1 vote) • For these examples, we know that the sample size is 50 and 125 respectively, but what about the number of samples (each of which consists of 50 or 125 in this case) taken? Obviously, the more the number of samples, the smoother the curve, so it's implicitly assumed that they take many samples (of 50 or 125)?
(1 vote) • How do you see if the sampling distribution will turn out to be uniform?
(1 vote) • Can the mean and standard deviation for the sample distribution of sample proportions still be determined even if the sample distribution is not a normal distribution (ex: skewed left or skewed right)?
(1 vote) 