statistical graphics

{{short description|Images used to represent statistical data visually}}

{{Data Visualization}}

Statistical graphics, also known as statistical graphical techniques, are graphics used in the field of statistics for data visualization.

Overview

Whereas statistics and data analysis procedures generally yield their output in numeric or tabular form, graphical techniques allow such results to be displayed in some sort of pictorial form. They include plots such as scatter plots, histograms, probability plots, spaghetti plots, residual plots, box plots, block plots and biplots.{{cite web |date=2003–2010 |title=The Role of Graphics |work=NIST/SEMATECH e-Handbook of Statistical Methods |url=http://www.itl.nist.gov/div898/handbook/eda/section1/eda15.htm |access-date=May 5, 2011}}

Exploratory data analysis (EDA) relies heavily on such techniques. They can also provide insight into a data set to help with testing assumptions, model selection and regression model validation, estimator selection, relationship identification, factor effect determination, and outlier detection. In addition, the choice of appropriate statistical graphics can provide a convincing means of communicating the underlying message that is present in the data to others.

Graphical statistical methods have four objectives:{{cite book |last=Jacoby |first=William G. |year=1997 |title=Statistical Graphics for Univariate and Bivariate Data: Statistical Graphics |pages=2–4}}

  • The exploration of the content of a data set
  • The use to find structure in data
  • Checking assumptions in statistical models
  • Communicate the results of an analysis.

If one is not using statistical graphics, then one is forfeiting insight into one or more aspects of the underlying structure of the data.

History

Statistical graphics have been central to the development of science and date to the earliest attempts to analyse data. Many familiar forms, including bivariate plots, statistical maps, bar charts, and coordinate paper were used in the 18th century. Statistical graphics developed through attention to four problems:James R. Beniger and Dorothy L. Robyn (1978). "Quantitative graphics in statistics: A brief history". In: The American Statistician. 32: pp. 1–11.

  • Spatial organization in the 17th and 18th century
  • Discrete comparison in the 18th and early 19th century
  • Continuous distribution in the 19th century and
  • Multivariate distribution and correlation in the late 19th and 20th century.

Since the 1970s statistical graphics have been re-emerging as an important analytic tool with the revitalisation of computer graphics and related technologies.

Examples

Image:Playfair TimeSeries-2.png's trade-balance time-series chart, published in his Commercial and Political Atlas, 1786]]

File:Snow-cholera-map-1.jpg's Cholera map in dot style, 1854]]

Famous graphics were designed by:

  • William Playfair who produced what could be called the first line, bar, pie, and area charts. For example, in 1786 he published the well known diagram that depicts the evolution of England's imports and exports,{{cite book |last=Tufte |first=Edward |author-link=Edward Tufte |year=1983 |title=The Visual Display of Quantitative Information |url=https://archive.org/details/visualdisplayofq0000tuft |url-access=registration |publisher=Graphics Press |location=Cheshire, Connecticut |isbn=0961392142}}
  • James Watt and his employee John Southern, who around 1790 invented the steam indicator, a device for plotting pressure variations within a steam engine cylinder through its stroke,{{cite book |title=Thing knowledge: a philosophy of scientific instruments |last=Baird |first=Davis |page=170 |year=2004 |publisher=University of California Press |isbn=978-0-520-23249-5 }}
  • Florence Nightingale, who used statistical graphics to persuade the British Government to improve army hygiene,{{cite web |last=Small |first=Hugh |title=Florence Nightingale's statistical diagrams |url=http://www.florence-nightingale-avenging-angel.co.uk/GraphicsPaper/Graphics.htm}}
  • John Snow who plotted deaths from cholera in London in 1854 to detect the source of the disease,{{cite web |last=Crosier |first=Scott |title=John Snow: The London Cholera Epidemic of 1854 |publisher=University of California, Santa Barbara |url=http://webprojects.oit.ncsu.edu/project/bio183de/Black/science/science_reading/8.html}} and
  • Charles Joseph Minard who designed a large portfolio of maps of which the one depicting Napoleon's campaign in Russia is the best known.{{cite web |last=Corbett |first=John |title=Charles Joseph Minard: Mapping Napoleon's March, 1861 |publisher=Center for Spatially Integrated Social Science |url=http://www.csiss.org/classics/content/58 |access-date=21 September 2014}}

See the plots page for many more examples of statistical graphics.

See also

References

; Citations

{{Reflist}}

; Attribution

{{NIST-PD}}

Further reading

  • {{cite book |last=Cleveland|first=W. S. |author-link=William S. Cleveland |year=1993 |title=Visualizing Data |publisher=Hobart Press |location=Summit, NJ, USA |isbn=0-9634884-0-6 |url=https://archive.org/details/visualizingdata00will}}
  • {{cite book |last=Cleveland|first=W. S. |author-link=William S. Cleveland |year=1994 |title=The Elements of Graphing Data |publisher=Hobart Press |location=Summit, NJ, USA |isbn=0-9634884-1-4}}
  • {{cite book |last=Lewi |first=Paul J. |author-link=Paul Lewi |year=2006 |title=Speaking of Graphics |url=http://www.datascope.be/sog.htm}}
  • {{cite book |last=Tufte |first=Edward R. |author-link=Edward Tufte |year=2001 |orig-year=1983 |title=The Visual Display of Quantitative Information |edition=2nd |publisher=Graphics Press |location=Cheshire, CT, USA |isbn=0-9613921-4-2 |url=https://archive.org/details/visualdisplayofq00tuft}}
  • {{cite book |last=Tufte |first=Edward R. |author-link=Edward Tufte |year=1992 |orig-year=1990 |title=Envisioning Information |publisher=Graphics Press |location=Cheshire, CT, USA |isbn=0-9613921-1-8 |url=https://archive.org/details/envisioninginfor0000tuft |url-access=registration}}