1.5.5: Normal Distributions and Probability Distributions (2024)

  1. Last updated
  2. Save as PDF
  • Page ID
    44829
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}}}\)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{#1}}} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}}}\)

    \( \newcommand{\vectorC}[1]{\textbf{#1}}\)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}}\)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}}\)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}}}\)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{#1}}} \)

    We will see shortly that the normal distribution is the key to how probability works for our purposes. To understand exactly how, let’s first look at a simple, intuitive example using pie charts.

    Probability in Pie

    Recall that a pie chart represents how frequently a category was observed and that all slices of the pie chart add up to 100%, or 1. This means that if we randomly select an observation from the data used to create the pie chart, the probability of it taking on a specific value is exactly equal to the size of that category’s slice in the pie chart.

    1.5.5: Normal Distributions and Probability Distributions (2)

    Take, for example, the pie chart in Figure \(\PageIndex{1}\) representing the favorite sports of 100 people. If you put this pie chart on a dart board and aimed blindly (assuming you are guaranteed to hit the board), the likelihood of hitting the slice for any given sport would be equal to the size of that slice. So, the probability of hitting the baseball slice is the highest at 36%. The probability is equal to the proportion of the chart taken up by that section.

    We can also add slices together. For instance, maybe we want to know the probability to finding someone whose favorite sport is usually played on grass. The outcomes that satisfy this criteria are baseball, football, and soccer. To get the probability, we simply add their slices together to see what proportion of the area of the pie chart is in that region: \(36\% + 25\% + 20\% = 81\%\). We can also add sections together even if they do not touch. If we want to know the likelihood that someone’s favorite sport is not called football somewhere in the world (i.e. baseball and hockey), we can add those slices even though they aren’t adjacent or continuous in the chart itself: \(36\% + 20\% = 56\%\). We are able to do all of this because 1) the size of the slice corresponds to the area of the chart taken up by that slice, 2) the percentage for a specific category can be represented as a decimal (this step was skipped for ease of explanation above), and 3) the total area of the chart is equal to \(100\%\) or 1.0, which makes the size of the slices interpretable.

    Normal Distributions

    The normal distribution is the most important and most widely used distribution in statistics. It is sometimes called the “bell curve,” although the tonal qualities of such a bell would be less than pleasing. It is also called the “Gaussian curve” of Gaussian distribution after the mathematician Karl Friedrich Gauss.

    Strictly speaking, it is not correct to talk about “the normal distribution” since there are many normal distributions. Normal distributions can differ in their means and in their standard deviations. Figure \(\PageIndex{1}\) shows three normal distributions. The green (left-most) distribution has a mean of -3 and a standard deviation of 0.5, the distribution in red (the middle distribution) has a mean of 0 and a standard deviation of 1, and the distribution in black (right-most) has a mean of 2 and a standard deviation of 3. These as well as all other normal distributions are symmetric with relatively more values at the center of the distribution and relatively few in the tails. What is consistent about all normal distribution is the shape and the proportion of scores within a given distance along the x-axis. We will focus on the Standard Normal Distribution (also known as the Unit Normal Distribution), which has a mean of 0 and a standard deviation of 1 (i.e. the red distribution in Figure \(\PageIndex{1}\)).

    1.5.5: Normal Distributions and Probability Distributions (3)

    Standard Normal Distribution

    Important features of normal distributions are listed below.

    1. Normal distributions are symmetric around their mean.
    2. The mean, median, and mode of a normal distribution are equal. In Standard Normal Curves, the mean, median, and mode are all 0.
    3. The area under the normal curve is equal to 1.0 (or 100% of all scores will fall somewhere in the distribution).
    4. Normal distributions are denser in the center and less dense in the tails (bell-shaped).
    5. There are known proportions of scores between the mean and each standard deviation.
      1. One standard deviation- 68% of the area of a normal distribution is within one standard deviation of the mean (one standard deviation below the mean through one standard deviation above the mean).
      2. Two standard deviations- Approximately 95% of the area of a normal distribution is within two standard deviations of the mean (two standard deviations below the mean through two standard deviations above the mean).

    These properties enable us to use the normal distribution to understand how scores relate to one another within and across a distribution.

    Probability in Normal Distributions

    Play with this applet to learn about probability distributions (website address: http://www.rossmanchance.com/applets/OneProp/OneProp.htm?candy=1)

    Probability distributions are what make statistical analyses work. Each distribution of events from any sample can be “translated” into a probability distribution, then used to predict the probability of that event happening!

    Just like a pie chart is broken up into slices by drawing lines through it, we can also draw a line through the normal distribution to split it into sections. We know that one standard deviation below the mean to one standard deviation above the mean contains 68% of the area under the curve because we know the properties of standard normal curves.

    1.5.5: Normal Distributions and Probability Distributions (4)

    This time, let’s find the area corresponding to the extreme tails. For standard normal curves, 95% of scores should be in the middle section of Figure \(\PageIndex{3}\), and a total of 5% in the shaded areas. Since there are two shaded areas, that's about 2.5% on each side.

    1.5.5: Normal Distributions and Probability Distributions (5)

      Those important characteristics of a Standard Normal Distribution are deceptively simple, so let's put some of these ideas together to see what we get.

      Law of Large Numbers: I know through the Law of Large Numbers that if I have enough data, then my sample will begin to become more and more "normal' (have the characteristics of a normally distributed population, including the important characteristics above).

      Probability Distributions: I know that I can make predictions from probability distributions. I can use probability distributions to understand the likelihood of specific events happening.

      Standard Normal Curve: I know that there should be specific and predictable proportions (percentages) of scores between the mean and each standard deviation away from the mean.

      Together, these features will allow you to:

      • Predict the probability of events happening.
      • Estimate how many people might be affected based on sample size.
      • Test Research Hypotheses about different groups.

      This is the magic of the Standard Normal Curve!

      Non-Parametric Distributions

      One last thing...

      There's a lot of heavy lifting for the Standard Normal Distribution. But what if your sample, or even worse, the population, is not normally distributed? That's when non-parametric statistics come in.

      A parameter is a statistic that describes the population. Non-parametric statistics don’t require the population data to be normally distributed. Most of the analyses that we'll conduct compare means (a measure of central tendency) of different groups. But if the data are not normally distributed, then we can’t compare means because there is no center! Non-normal distributions may occur when there are:

      • Few people (small N)
      • Extreme scores (outliers)
      • There’s an arbitrary cut-off point on the scale. (Like if a survey asked for ages, but then just said, “17 and below”.)

      The reason that I bring this up is because sometimes you just can't run the statistics that we're going to learn about because the population's data is not normally distributed. Since the Standard Normal Curve is the basis for most types of statistical inferences, you can't use these (parametric) analyses with data that they don't fit. We'll talk about alternative analyses for some of the common (parametric) statistical analyses, but you heard it here first! Non-parametric distributions are when the population is not normally distributed.

      Okay, onward to learning more about how the Standard Normal Curve is important...

      1.5.5: Normal Distributions and Probability Distributions (2024)

      FAQs

      What does 1.5 standard deviation mean? ›

      For example, if a z-score is 1.5, it is 1.5 standard deviations away from the mean. Because 68% of your data lies within one standard deviation (if it is normally distributed), 1.5 might be considered too far from average for your comfort.

      What is normal distribution and probability distribution? ›

      What is normal distribution? A normal distribution is a type of continuous probability distribution in which most data points cluster toward the middle of the range, while the rest taper off symmetrically toward either extreme. The middle of the range is also known as the mean of the distribution.

      How do you compare probabilities of two normal distributions? ›

      Answer and Explanation:

      The two normal distributions can be compared by comparing the value parameters of normal distribution. The parameters of the normal distribution are the mean and standard deviation. The value of mean shifts the distribution to the left or to the right of actual the distribution.

      What is the area to the left of μ 1.5 σ for a normal distribution? ›

      For a normal distribution, the area to the left of the number (μ - 1.5σ) is 0.6469. This means that 64.69% of the values in a normal distribution fall to the left of the number (μ - 1.5σ).

      What is a 1.5 SD z-score? ›

      What does a 1.5 z-score mean? A z-score is defined as the number of standard deviation from the mean. A z-score of 1.5 means that this value 1.5 standard deviations above the mean. If it were -1.5, it would be below the mean.

      What math SAT score is 1.5 standard deviations above the mean? ›

      The math SAT score is 520 + 1.5(115) ≈ 692.5. The exam score of 692.5 is 1.5 standard deviations above the mean of 520.

      How to calculate probability from normal distribution? ›

      The probability of P(a < Z < b) is calculated as follows. Then express these as their respective probabilities under the standard normal distribution curve: P(Z < b) – P(Z < a) = Φ(b) – Φ(a). Therefore, P(a < Z < b) = Φ(b) – Φ(a), where a and b are positive.

      How do you calculate probability distribution? ›

      Probability Distribution Function

      It can be written as F(x) = P (X ≤ x). Furthermore, if there is a semi-closed interval given by (a, b] then the probability distribution function is given by the formula P(a < X ≤ b) = F(b) - F(a). The probability distribution function of a random variable always lies between 0 and 1.

      What is the relationship between probability and normal distribution? ›

      Since the normal distribution is a continuous distribution, the probability that X is greater than or less than a particular value can be found. A normal curve table gives the precise percentage of scores between the mean (Z-score = 0) and any other Z score.

      How do you compare distributions to normal distributions? ›

      How to Compare Data Distributions to the Normal Distribution Model. Step 1: Look at the shape of the data distribution, comparing it to the normal distribution model. Step 2: Using the empirical rule, determine if the data is normally distributed. Step 3: Conclude if the data can be compared to the Normal Distribution.

      What is the normal distribution for dummies? ›

      A normal distribution is symmetrical around the mean. Normal distribution reaches its highest point at the mean. It is bell-shaped. It has a zero point at the mean and it decreases as you move away from the mean on both sides.

      What are the two 2 types of probability distribution? ›

      There are two types of probability distributions: Discrete probability distributions. Continuous probability distributions.

      What is the z-score for the standard normal distribution? ›

      A Z score represents how many standard deviations an observation is away from the mean. The mean of the standard normal distribution is 0. Z scores above the mean are positive and Z scores below the mean are negative.

      What is the area of the normal distribution of probability? ›

      Use the standard normal distribution to find probability. The standard normal distribution is a probability distribution, so the area under the curve between two points tells you the probability of variables taking on a range of values. The total area under the curve is 1 or 100%.

      How to tell if a population is normally distributed? ›

      A normal distribution is one in which the values are evenly distributed both above and below the mean. A population has a precisely normal distribution if the mean, mode, and median are all equal. For the population of 3,4,5,5,5,6,7, the mean, mode, and median are all 5.

      What is within 1.5 standard deviations of its mean value? ›

      From the empirical rule, about 87% of values are within 1.5 SDs of the mean, so 6.5% in either tail. We estimate that the probability is a little larger than 6.5%.

      What is 0.5 standard deviation mean? ›

      So, a standard deviation of 0.5 basically means that on average the difference between mean and data points is 0.5.

      Is A standard deviation of 1.2 good? ›

      There is no “acceptable” standard deviation. It is just a measure of variation in a data set. Yes it can be greater than 1.

      What does a standard deviation of more than 1 mean? ›

      Conversely, a high standard deviation (significantly higher than 1) indicates that data points spread out over a wider range, signifying high variability.

      References

      Top Articles
      Latest Posts
      Article information

      Author: Jamar Nader

      Last Updated:

      Views: 5950

      Rating: 4.4 / 5 (55 voted)

      Reviews: 94% of readers found this page helpful

      Author information

      Name: Jamar Nader

      Birthday: 1995-02-28

      Address: Apt. 536 6162 Reichel Greens, Port Zackaryside, CT 22682-9804

      Phone: +9958384818317

      Job: IT Representative

      Hobby: Scrapbooking, Hiking, Hunting, Kite flying, Blacksmithing, Video gaming, Foraging

      Introduction: My name is Jamar Nader, I am a fine, shiny, colorful, bright, nice, perfect, curious person who loves writing and wants to share my knowledge and understanding with you.