Statistical population#Mean

{{Short description|Complete set of items that share at least one property in common}}

{{For|the number of people|Population}}

In statistics, a population is a set of similar items or events which is of interest for some question or experiment.{{Cite journal |last=Haberman |first=Shelby J. |date=1996 |title=Advanced Statistics |url=https://link.springer.com/book/10.1007/978-1-4757-4417-0 |journal=Springer Series in Statistics |language=en |doi=10.1007/978-1-4757-4417-0 |isbn=978-1-4419-2850-4 |issn=0172-7397}}{{Cite web|title=Glossary of statistical terms: Population|website=Statistics.com|url=http://www.statistics.com/glossary&term_id=812|access-date=22 February 2016}} A statistical population can be a group of existing objects (e.g. the set of all stars within the Milky Way galaxy) or a hypothetical and potentially infinite group of objects conceived as a generalization from experience (e.g. the set of all possible hands in a game of poker).{{MathWorld|Population}}

A population with finitely many values N in the supportDrew, J. H., Evans, D. L., Glen, A. G., Leemis, L. M. (n.d.). Computational Probability: Algorithms and Applications in the Mathematical Sciences. Deutschland: Springer International Publishing. Page 141 https://www.google.de/books/edition/Computational_Probability/YFG7DQAAQBAJ?hl=de&gbpv=1&dq=%22population%22%20%22support%22%20of%20a%20random%20variable&pg=PA141 of the population distribution is a finite population with population size N. A population with infinitely many values in the support is called infinite population.

A common aim of statistical analysis is to produce information about some chosen population.{{cite book | last1 = Yates | first1 = Daniel S. | last2 = Moore | first2 = David S | last3 = Starnes | first3 = Daren S. | year = 2003 | title = The Practice of Statistics | edition = 2nd | publisher = Freeman | location = New York | url = http://bcs.whfreeman.com/yates2e/ | isbn = 978-0-7167-4773-4 | url-status = dead | archive-url = https://web.archive.org/web/20050209001108/HTTP://bcs.whfreeman.com/yates2e/ | archive-date = 2005-02-09 }}

In statistical inference, a subset of the population (a statistical sample) is chosen to represent the population in a statistical analysis.{{Cite web|title=Glossary of statistical terms: Sample|website=Statistics.com|url=http://www.statistics.com/glossary&term_id=281|access-date=22 February 2016}} Moreover, the statistical sample must be unbiased and accurately model the population. The ratio of the size of this statistical sample to the size of the population is called a sampling fraction. It is then possible to estimate the population parameters using the appropriate sample statistics.{{Cite book |last1=Levy |first1=Paul S. |url=https://books.google.com/books?id=XU9ZmLe5k1IC |title=Sampling of Populations: Methods and Applications |last2=Lemeshow |first2=Stanley |date=2013-06-07 |publisher=John Wiley & Sons |isbn=978-1-118-62731-0 |language=en}}

For finite populations, sampling from the population typically removes the sampled value from the population due to drawing samples without replacement. This introduces a violation of the typical independent and identically distribution assumption so that sampling from finite populations requires "finite population corrections" (which can be derived from the hypergeometric distribution). As a rough rule of thumb,Hahn, G. J., Meeker, W. Q. (2011). Statistical Intervals: A Guide for Practitioners. Deutschland: Wiley. Page 19. https://www.google.de/books/edition/Statistical_Intervals/ADGuRxqt5z4C?hl=de&gbpv=1&dq=infinite%20population&pg=PA19 if the sampling fraction is below 10% of the population size, then finite population corrections can approximately be neglected.

Mean

The population mean, or population expected value, is a measure of the central tendency either of a probability distribution or of a random variable characterized by that distribution.{{cite book|last=Feller|first=William|title=Introduction to Probability Theory and its Applications, Vol I|year=1950|publisher=Wiley|isbn=0471257087|pages=221}} In a discrete probability distribution of a random variable X, the mean is equal to the sum over every possible value weighted by the probability of that value; that is, it is computed by taking the product of each possible value x of X and its probability p(x), and then adding all these products together, giving \mu = \sum x \cdot p(x).....Elementary Statistics by Robert R. Johnson and Patricia J. Kuby, [https://books.google.com/books?id=DWCAh7jWO98C&pg=PA279 p. 279]{{Cite web|last=Weisstein|first=Eric W.|title=Population Mean|url=https://mathworld.wolfram.com/PopulationMean.html|access-date=2020-08-21|website=mathworld.wolfram.com|language=en}} An analogous formula applies to the case of a continuous probability distribution. Not every probability distribution has a defined mean (see the Cauchy distribution for an example). Moreover, the mean can be infinite for some distributions.

For a finite population, the population mean of a property is equal to the arithmetic mean of the given property, while considering every member of the population. For example, the population mean height is equal to the sum of the heights of every individual—divided by the total number of individuals. The sample mean may differ from the population mean, especially for small samples. The law of large numbers states that the larger the size of the sample, the more likely it is that the sample mean will be close to the population mean.Schaum's Outline of Theory and Problems of Probability by Seymour Lipschutz and Marc Lipson, [https://books.google.com/books?id=ZKdqlw2ZnAMC&pg=PA141 p. 141]

See also

References

{{Reflist}}