Please enable JavaScript.
Coggle requires JavaScript to display documents.
Statistics: collecting, analyzing, presenting, interpreting, and making…
Statistics: collecting, analyzing, presenting, interpreting, and making decision base on such analysis
Descriptive Stats: methods for organizing, displaying and describing data using tables, graphs, and summary measures; Means, Median
Ungrouped Data
-
Measures of Dispersion:
-
Standard Deviation: how spreader data is around the mean. lower std tells the data is spread relatively smaller range around the mean. for higher std is the opposite.
-
-
Grouped Data
Mean: sum of the midpoint of the class times frequency of that class divided by the number of the classes
µ = Σmf/N (m midpoint of the class, and f frequency of the class)
-
-
STD
-
-
Empirical rules: bell-shape or normal distribution
- 68% of the observation lie within one standard deviation of the mean.
- 95% of the observation lie within two std of the mean.
- 99.7 of the observation lie within three std of the mean.
Measures of Position
-
Percentiles: The summary measure that divide a ranked data into 100 equal parts. each data set has 99 percentile that divide the data into 100 equal parts.
-
percentile rank of a value: Xi = Number of values less than Xi / total number of values in the data set
Inferential Stats: Methods that use sample results to help make decisions or predictions about population.
Population: all elements (individuals), items or objects whose characteristics are being studies.
-
-
Basic Terms
Data Set: collection of observations on one or more variables. studying total revenue of supermarkets in Tokyo.
Observation or Measurement: The value for an element; revenue of Aeon, revenue of life etc.
Variable: characteristic under study that can assume different value for different element such as; supermarket.
type of variables
Quantitative Variables: can be measure numerically, data is quantitative data
Discrete Variable: countable value such as number of houses, cars etc.
Continuous Variable: can assume any numerical value over a certain interval such as length, age, time etc
Quantitative Data
Frequency Distribution: List all the classes and number of values that belong to each class. data presented in form of frequency distributions are called grouped data
Class boundary: A class boundary is given by the midpoint of the upper limit of one class and the lower limit of the next class
-
-
-
-
Single-valued class: when the value of class is discrete not interval we can construct class with single value. such vehicle owned by household. 0 vehicle class, 2 vehicle class, 5 vehicle class etc.
Cumulative Frequency Distribution: total numbers of value that fall bellow the upper boundary of each class.
cumulative relative frequency = (cumulative frequency of a class )/total observation in the data set
-
Ogive: is a curve drown for the cumulative frequency distribution by joining by the straight line with dot equal to cumulative frequency of respective classes
-
Dotplots:
Outlier or Extreme Values: Values that are very small or very large relative to the values in dataset.
qualitative or categorical variables: variable that cannot assume numerical value but can be classified such as gender, maker of computers etc.
-
Element or Member: specific subject or object such as AEON, LIFE etc
-