Mean, Median, Mode, & Confidence Intervals
Alright, let’s pull our lab coats on and talk about statistics. And before you start having flashbacks to a tough math class, let me reframe this for you. These concepts—mean, median, mode, and confidence intervals—are not abstract theory. They are the diagnostic tools we use to evaluate the health and performance of our own data. Every time we run a batch of Quality Control, we generate a set of numbers. These statistical tools are how we take that pile of numbers and make it tell us a story: “Is our instrument behaving today? Is it precise? Is it accurate? And how confident are we in that assessment?”
Think of it like target practice. A single shot doesn’t tell you much. But a group of 20 shots tells you a story about your aim. That’s what we’re doing here
Center of the Target: Measures of Central Tendency
The first thing we want to know about our group of shots is, “Where did they cluster?” These are the measures of central tendency. They give us a single value that represents the “typical” or “middle” result of our data set
-
Mean (Average): The Workhorse The mean is the one we all know and love. You calculate it by adding up all your data points and dividing by the number of points. In the lab, we use the mean constantly to determine the central point of our QC data over time. It is the heart of our Levey-Jennings chart
- Strength: It uses every single piece of data, giving a complete picture
- Crucial Weakness: The mean is extremely sensitive to outliers. Imagine you have QC results of 20, 21, 20, 22, and then a single “flyer” at 45 because of a random bubble. That single high value will pull the mean significantly upward, making it a poor representation of where the instrument is truly running
-
Median: The Outlier-Resistant Hero The median is simply the middle value of a data set after you’ve arranged all the numbers in order from lowest to highest. If you have an even number of data points, it’s the average of the two middle numbers
- Strength: The median’s superpower is its resistance to outliers. In that same data set (20, 20, 21, 22, 45), the median is 21. It completely ignores the “flyer” and gives you a much better sense of the true central tendency of the data. This is why the median is sometimes a better indicator of an instrument’s performance if you have a few wacky results
- Weakness: It doesn’t use all the information in the data set, so it can be less statistically powerful than the mean if the data is clean and symmetrical
Mode: The Most Frequent The mode is the value that appears most often in a data set. A data set can have one mode, more than one mode, or no mode at all. While less common for analyzing continuous quantitative data in the lab, it can be useful in certain situations, like identifying the most common error code generated by an analyzer
Spread of the Shots: Measures of Dispersion
Knowing the center of our target is only half the story. Are all our shots clustered tightly around the bullseye (high precision), or are they scattered all over the target (low precision)? This is measured by the Standard Deviation (SD)
The Standard Deviation is the single most important statistic for Quality Control. It is a measure of the average distance of each data point from the mean
- A small SD is what we want. It means our data points are all tightly packed together, indicating our method is very precise
- A large SD is bad. It means our data is widely scattered, indicating our method is imprecise or has a lot of random error
The SD is the magic behind the lines on our Levey-Jennings charts. For data that follows a normal, “bell-shaped” curve (Normal Distribution), we know that: * ~68% of all data points will fall within ± 1 SD of the mean * ~95% of all data points will fall within ± 2 SD of the mean * ~99.7% of all data points will fall within ± 3 SD of the mean This is why our QC warning limits are set at ±2 SD and our rejection limits are at ±3 SD. It’s all based on the statistical probability of a random point falling that far from the mean
How Sure Are We?: Confidence Intervals
This is a more advanced concept, but it’s critical. When we calculate a mean from a sample of data (e.g., 20 QC runs), that mean is only an estimate of the “true” mean we would get if we could run the QC a million times. The Confidence Interval (CI) puts a range around our calculated mean and says, “We are X% confident that the true mean of the population lies within this range.”
A 95% Confidence Interval is the most common. If we calculate a 95% CI for our QC mean, we are stating that we are 95% sure that the true, long-term mean of our instrument falls somewhere between the lower and upper bound of that interval
What does this tell us?
- A narrow CI is good. It means our estimate of the mean is very precise, usually because we have a large sample size and a small standard deviation
- A wide CI is a sign of uncertainty. It means our estimate of the mean is fuzzy, likely due to a small sample size or a lot of scatter (high SD) in our data
We use this concept when validating a new instrument or a new lot of controls. We run the control many times, calculate our lab’s mean and its 95% CI, and then check if the manufacturer’s stated mean falls within our interval. If it does, we can be confident that our instrument is performing consistently with others. If it doesn’t, it’s a red flag that our system may have a bias
Key Terms
- Mean: The arithmetic average of a data set, calculated by summing the values and dividing by the number of values. It is sensitive to outliers
- Median: The middle value in a data set that has been arranged in numerical order. It is resistant to the effects of outliers
- Standard Deviation (SD): The most common measure of the dispersion or precision of a data set; it represents the average distance of each data point from the mean
- Normal Distribution: A theoretical data distribution that, when plotted, creates a symmetrical, bell-shaped curve where the mean, median, and mode are all equal
- Confidence Interval (CI): A calculated range of values that is likely to contain the true population parameter (such as the true mean) with a certain level of confidence (e.g., 95%)
- Outlier: A data point that is abnormally distant from the other values in a data set. It can significantly affect the mean but has little effect on the median
- Central Tendency: A measure that represents the typical or “central” value of a data set. The mean, median, and mode are all measures of central tendency