Jump to content

Margin of error

From Wikipedia, the free encyclopedia
Probability densitiesof polls of different sizes, each color-coded to its 95%confidence interval(below), margin of error (left), and sample size (right). Each interval reflects the range within which one may have 95% confidence that thetruepercentage may be found, given a reported percentage of 50%. Themargin of erroris half the confidence interval (also, theradiusof the interval). The larger the sample, the smaller the margin of error. Also, the further from 50% the reported percentage, the smaller the margin of error.

Themargin of erroris a statistic expressing the amount of randomsampling errorin the results of asurvey.The larger the margin of error, the less confidence one should have that a poll result would reflect the result of a census of the entirepopulation.The margin of error will be positive whenever a population is incompletely sampled and the outcome measure has positivevariance,which is to say, whenever the measurevaries.

The termmargin of erroris often used in non-survey contexts to indicateobservational errorin reporting measured quantities.

Concept[edit]

Consider a simpleyes/nopollas a sample ofrespondents drawn from a populationreporting the percentageofyesresponses. We would like to know how closeis to the true result of a survey of the entire population,without having to conduct one. If, hypothetically, we were to conduct pollover subsequent samples ofrespondents (newly drawn from), we would expect those subsequent resultsto be normally distributed about,the true but unknown percentage of the population. Themargin of errordescribes the distance within which a specified percentage of these results is expected to vary from.

According to the68-95-99.7 rule,we would expect that 95% of the resultswill fall withinabouttwostandard deviations() either side of the true mean.This interval is called theconfidence interval,and theradius(half the interval) is called themargin of error,corresponding to a 95%confidence level.

Generally, at a confidence level,a sample sizedof a population having expected standard deviationhas a margin of error

wheredenotes thequantile(also, commonly, az-score), andis thestandard error.

Standard deviation and standard error[edit]

We would expect the average of normally distributed valuesto have a standard deviation which somehow varies with.The smaller,the wider the margin. This is called the standard error.

For the single result from our survey, weassumethat,and thatallsubsequent resultstogether would have a variance.

Note thatcorresponds to the variance of aBernoulli distribution.

Maximum margin of error at different confidence levels[edit]

For a confidencelevel,there is a corresponding confidenceintervalabout the mean,that is, the intervalwithin which values ofshould fall with probability.Precise values ofare given by thequantile function of the normal distribution(which the 68-95-99.7 rule approximates).

Note thatis undefined for,that is,is undefined, as is.

0.84 0.994457883210 0.9995 3.290526731492
0.95 1.644853626951 0.99995 3.890591886413
0.975 1.959963984540 0.999995 4.417173413469
0.99 2.326347874041 0.9999995 4.891638475699
0.995 2.575829303549 0.99999995 5.326723886384
0.9975 2.807033768344 0.999999995 5.730728868236
0.9985 2.967737925342 0.9999999995 6.109410204869
Log-log graphs ofvs sample sizenand confidence levelγ.The arrows show that the maximum margin error for a sample size of 1000 is ±3.1% at 95% confidence level, and ±4.1% at 99%.
The inset parabolaillustrates the relationship betweenatandat.In the example,MOE95(0.71) ≈0.9 × ±3.1%≈ ±2.8%.

Sinceat,we can arbitrarily set,calculate,,andto obtain themaximummargin of error forat a given confidence leveland sample size,even before having actual results. With

Also, usefully, for any reported

Specific margins of error[edit]

If a poll has multiple percentage results (for example, a poll measuring a single multiple-choice preference), the result closest to 50% will have the highest margin of error. Typically, it is this number that is reported as the margin of error for the entire poll. Imagine pollreportsas

(as in the figure above)

As a given percentage approaches the extremes of 0% or 100%, its margin of error approaches ±0%.

Comparing percentages[edit]

Imagine multiple-choice pollreportsas.As described above, the margin of error reported for the poll would typically be,asis closest to 50%. The popular notion ofstatistical tieorstatistical dead heat,however, concerns itself not with the accuracy of the individual results, but with that of therankingof the results. Which is in first?

If, hypothetically, we were to conduct pollover subsequent samples ofrespondents (newly drawn from), and report result,we could use thestandard error of differenceto understand howis expected to fall about.For this, we need to apply thesum of variancesto obtain a new variance,,

whereis thecovarianceofand.

Thus (after simplifying),

Note that this assumes thatis close to constant, that is, respondents choosing either A or B would almost never choose C (makingandclose toperfectly negatively correlated). With three or more choices in closer contention, choosing a correct formula forbecomes more complicated.

Effect of finite population size[edit]

The formulae above for the margin of error assume that there is an infinitely large population and thus do not depend on the size of population,but only on the sample size.According tosampling theory,this assumption is reasonable when thesampling fractionis small. The margin of error for a particular sampling method is essentially the same regardless of whether the population of interest is the size of a school, city, state, or country, as long as the samplingfractionis small.

In cases where the sampling fraction is larger (in practice, greater than 5%), analysts might adjust the margin of error using afinite population correctionto account for the added precision gained by sampling a much larger percentage of the population. FPC can be calculated using the formula[1]

...and so, if pollwere conducted over 24% of, say, an electorate of 300,000 voters,

Intuitively, for appropriately large,

In the former case,is so small as to require no correction. In the latter case, the poll effectively becomes a census and sampling error becomes moot.

See also[edit]

References[edit]

  1. ^Isserlis, L. (1918)."On the value of a mean as calculated from a sample".Journal of the Royal Statistical Society.81(1). Blackwell Publishing: 75–81.doi:10.2307/2340569.JSTOR2340569.(Equation 1)

Sources[edit]

  • Sudman, Seymour and Bradburn, Norman (1982).Asking Questions: A Practical Guide to Questionnaire Design.San Francisco: Jossey Bass.ISBN0-87589-546-8
  • Wonnacott, T.H.; R.J. Wonnacott (1990).Introductory Statistics(5th ed.). Wiley.ISBN0-471-61518-8.

External links[edit]