How to describe distribution in a histogram
Shape
Outliers
Center
Spread
Median: the middle number. We use this when the distribution is pretty skewed.
Mean: add up all the appeared numbers and divide the sum by the amount of numbers. We use this when the distribution is quite
Mode: the number that appears the most
Direction
Number of peaks
Inter quartile range (IQR): We use this when the distribution is quite skewed.
Standard deviation: We use this when the distribution is quite symmetric.
Range: the difference between the maximun and the minimun
Bimodel: when there are 2 peaks
Multimodel: when there are more than 2 peaks
Unimodel: when there's only 1 peak
Skewed to the left: distribution has a long tail that extents to the left
Symmetric: the distribution has an equal form on each of its side
Skewed to the right: distribution has a long tail that extents to the right
Uniform: no peaks at all, the same height througout
Outliers are individual data that seems to be isolated (far away) from most of others. We usually call them "Potential ouliters".