ExplanationSampling Means We have said that a sample is a set of measurements picked with equal probability from a larger population and that a consequence of this fact is that two samples are unlikely to be the same. We also saw that statistics allows us to infer things about the population from a single sample. Level 1 of this topic introduced you to simple random sampling, sampling error and the idea of inferential statistics. At this level, we will expand on those topics a little. Simple Random Sampling When we say that samples from a population are picked at random, we mean that we have tried to ensure that every member of the population has an equal chance of making it into the sample. Think about rolling dice. If you didn't have an equal chance of rolling each number, you would claim that the dice was not properly random. We do not mean that the values we get are random or collected in a haphazard kind of way. Infact, taking a random sample often requires careful planning. Sampling Error If two samples from the same population are unlikely to contain the same values, then they are unlikely to have the same mean (or any other descriptive statistic). It follows that they are also unlikely to have the same mean as the population from which they were taken (they can't both be right!). Any difference between a sample mean and the population mean is known as the sampling error of the mean. More generally, any difference between sample statistics and population statistics is known as sampling error. Inferential Statistics When we report the mean or standard deviation of a sample (or any other statistic about it) we are stating a verifiable fact about the sample. Such facts are known as descriptive statistics as they describe something about the sample. If we want to use a sample to make a statement about the population, we cannot be as confident. We must use inferential statistics to infer what is most likely to be true about the population. You will see in the section on confidence intervals how we report a range into which we are confident that the population mean falls, rather than just reporting a value for a population mean. |