"Love Statistics" notes (ii) Understanding variability

Source: Internet
Author: User

Variability (also known as walking or dispersion) can be seen as a measure of the difference between different values.

It may be more accurate to think of variability as the degree to which each value differs from a particular value. So which "value" do you think might be used as that particular value? Typically, this particular value is the mean value. As a result, variability becomes the number of differences between each value and the mean in the measurement data set.

The three quantities of variability are typically used to reflect the variability, dispersion, or dispersion of a set of data. These three quantities are the extreme difference, the standard deviation, and the variance.

Our initial normal idea might be to calculate the mean value of the data set, and then subtract each value with the mean. The average of these distances is then calculated. But it doesn't really work, for instance.

Array: 5,8,5,4,6,7,8,8,3,6

The mean value of this array is 6, then the mean minus each value ( -1+2-1-2+0+1+2+2-3+0) result is 0 according to our idea

In fact, most of the results of this method are zero, so we need to find a way to eliminate the minus sign, so that the result is not zero.

Calculate the extreme difference:

The extreme difference is the most general measure of variability. The extreme difference allows you to understand how the values differ from each other. The extreme difference is calculated by subtracting the minimum value from the maximum value in the data distribution.

In general, the very poor formula is as follows: R = H-l

where r is the extreme difference, H is the maximum value in the dataset, and L is the minimum value in the data set.

Calculate standard deviation:

The most commonly used variability amount is the standard deviation.

The standard deviation (abbreviated to S or SD) represents the average number of variability in a data set. The actual meaning is the average distance from the mean.

The larger the standard deviation, the greater the average distance between each data point and the mean of the data distribution.

650) this.width=650; "title=" Standard deviation "alt=" wkiol1c0fn6d67lkaabx9ty-w_e856.jpg "src=" http://s3.51cto.com/wyfs02/M01/7F/ Fe/wkiol1c0fn6d67lkaabx9ty-w_e856.jpg "/>


S is the standard deviation

∑ is Sigma, which means that all subsequent values are summed

X is the specific value

X on a horizontal line is the mean value of all data

n is the sample size

1) List each value. How values are sorted doesn't matter

2) Calculate the mean value of the data set

3) minus the mean value for each value

4) calculates the square of each difference.

5) calculates the sum of squares of deviations of all and mean values.

6) squared divided by n-1

7) Calculate square root

    • As a measure of variability, the standard deviation can tell us the average deviation of each value of the data set from the mean.

    • The standard deviation is calculated as the average distance from the mean value of the deviation. Therefore, you first need to calculate the mean value as the amount of the concentration trend. Therefore, it is not necessary to waste time on the median and the majority when calculating the standard deviation.

    • The larger the standard deviation, the wider the numerical distribution, the greater the difference between the values

    • As with the mean, standard deviation is sensitive to extreme values. When you calculate the standard deviation of a sample, if there is an extremum in the data, you need to report it in the data.

    • If the s=0, there is absolutely no variability in the data set, and the value is exactly the same, this situation rarely occurs.


Variance is the square of the standard deviation.

This article is from the "Libydwei" blog, make sure to keep this source http://libydwei.blog.51cto.com/37541/1772635

"Love Statistics" notes (ii) Understanding variability

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.