How to Spot Statistical Variability in a Histogram

Deborah J. Rumsey

Updated

2016-03-26 15:35:24

From the book

Statistics For Dummies

Download E-Book

Probability Workbook For Dummies

Explore Book

Download E-Book

Probability Workbook For Dummies

Explore Book

You can get a sense of variability in a statistical data set by looking at its histogram. For example, if the data are all the same, they are all placed into a single bar, and there is no variability. If an equal amount of data is in each of several groups, the histogram looks flat with the bars close to the same height; this signals a fair amount of variability.

The idea of a flat histogram indicating some variability may go against your intuition, and if it does you're not alone. If you're thinking a flat histogram means no variability, you're probably thinking about a time chart, where single numbers are plotted over time. Remember, though, that a histogram doesn't show data over time — it shows all the data at one point in time. Since the histogram is flat, that means that the data are spread out across the spectrum, hence a high variability.

Equally interesting is the idea that a histogram with a big lump in the middle and tails sloping sharply down on each side actually has less variability than a histogram that's straight across. The curves looking like hills in a histogram represent clumps of data that are close together, hence a low variability.

Variability in a histogram is higher when the taller bars are more spread out away from the mean and lower when the taller bars are close to the mean.

For the Best Actress Academy Award winners' ages shown in the above figure, you see many actresses are in the age range from 30–35, and most of the actresses are between 20–50 years in age, which is quite diverse; then you have those outliers, those few older actresses (7 of them) that spread the data out farther, increasing the data's overall variability.

The most common statistic used to measure variability in a data set is the standard deviation, which in a rough sense measures the "average" or "typical" distance that the data lie from the mean. The standard deviation for the Best Actress age data is 11.35 years. A standard deviation of 11.35 years is fairly large in the context of this problem, but the standard deviation is based on average distance from the mean, and the mean is influenced by outliers, so the standard deviation will be influenced as well.

About This Article

About the book author:

Deborah J. Rumsey, PhD, is an Auxiliary Professor and Statistics Education Specialist at The Ohio State University. She is the author of Statistics For Dummies, Statistics II For Dummies, Statistics Workbook For Dummies, and Probability For Dummies.

This article can be found in the category:

Statistics

Hot off the press

Explore Related content

Probability Workbook For Dummies

Statistics All-in-One For Dummies

Statistics Essentials For Dummies

Statistics II For Dummies

Statistics: 1001 Practice Problems For Dummies (+ Free Online Practice)

Statistics Workbook For Dummies with Online Practice

Statistics For Dummies

Probability For Dummies

Biostatistics For Dummies

Book & Article Categories

Book & Article Categories

Collections

How to Spot Statistical Variability in a Histogram

About This Article

About the book author:

This article can be found in the category:

Explore Related content

Book & Article Categories

Book & Article Categories

Collections

How to Spot Statistical Variability in a Histogram

About This Article

This article is from the book:

About the book author:

This article can be found in the category:

Explore Related content

Statistics All-in-One For Dummies Cheat Sheet

10 Steps to a Better Math Grade with Statistics

Statistics and Histograms

What is Categorical Data and How is It Summarized?

Statistics II For Dummies Cheat Sheet

SPSS For Dummies Cheat Sheet

Statistics Workbook For Dummies Cheat Sheet

Probability For Dummies Cheat Sheet

Statistics For Dummies Cheat Sheet

Statistics: 1001 Practice Problems For Dummies Cheat Sheet

Statistics Conundrums: Dealing with Survey Nonresponders

Generalizing Statistical Results to the Entire Population

Figuring Out What Probability Means

Using Probability When Hitting the Slot Machines

Statistical Standard Scores and Standard Normal Distributions — The &#147;Z-Table&#148;

Statistical T-Distribution — The “T-Table”

Discrete Probability Distributions

Principles of Probability

Continuous Probability Distributions

Statistically Figuring Sample Size

Statistical Standard Scores and Standard Normal Distributions — The Z-Table