Please enable JavaScript.

Coggle requires JavaScript to display documents.

Statistics :chart_with_upwards_trend: Representations of Data (How do you…

- - - - .
        
        Mean: For quantitative data and this gives a true measure of the data because it uses all the data. It is affected by extreme values. (x bar, x̅)
        
        You can use your calculator to help you find all these!
        
        Go to statistic mode (6) and select 1 (as we are using 1 variable only). If the frequency column does not show up, press shift, menu and then 3. If using grouped data enter the midpoints into x (variable column). Now press OPTN and 3 for 1-variable calculations.
        
        Median, Q2: The middle value when the data values are put in order. Used for quantitative data and extreme values as it gives outliers less influence on the final result. Arrange data in order, use question (n+1)/2 and find the number in the list. If this value happens to be between two numbers, find the mean of those two numbers.
        
        Mode: The value/ class that occurs the most often. This can be both qualitative and quantitative for single modes or two (binomial) modes. The most popular value, :dress:. It is most useful when you have a large data set.
    - - Variance and Standard Deviation :strawberry::lollipop: ( for grouped data on a frequency table)
        
        Standard deviation σ is the square root of the variance. It helps us visualise how spread measurements are from the mean.
        
        Square root of sxx/n
        
        Variance σ2 is used to work out a spread of a data set using all the data given. It can help investors work out the risk of a product :money_with_wings:.
      - Range: Difference between smallest and largest values in the data set.
      - Interquartile range: The difference between values for two given percentiles.
  - - - There are 3 different ways of presenting coding.
        
        Regular coding:
        
        Mean of coded data:
        
        Standard Deviation of coded data:
      - Sometimes you will need to un-code data
        
        To find the mean of the original data when you are given the coded data:
        
        To find the standard deviation of the original data when you are given the coded data:
  - - - Use Interpolation!Note that by doing this, you are assuming that all the data values are evenly distributed with the class.
        
        For this example, we will find the median
        
        Find how many values there are (the sum of all the frequencies)
        
        Half this number to give the n'th value.
        
        Find which group (x column) the n'th value belongs too.
        
        Put all relevant values in you special, magical digram that was invented to save your life in the exam.
        
        Substitute values into equation (Q2 - LB)/ (q2 - lf) = (UB -LB)/ (uf -lf)
        
        Rearrange this equation to find Q2.
- - - - Comparing Box plots
        Compare the quartiles
        Compare the minimum and maximum
      - location: median
        spread: IQR
      - Advantages:
        :check: helps us see the spread of data easily
        :check: easy to compare stratified samples
      - Disadvantages:
        :green_cross: Original data not shown in box plot
        :green_cross: Mean and mode are not represented (easily misunderstood).
    - - Frequency density is needed to calculate the height. :straight_ruler:
      - area of bar = frequency x k, class width
      - Frequency density = frequency/ class width
    - - When comparing two or more sets of data you must always comment on
        1) The measure of location
        2) The measure of spread
        
        When comparing data that has no extreme values compare:
        :slightly_smiling_face: mean
        :slightly_smiling_face: standard deviation
        
        When comparing data that has extreme values compare:
        :slightly_smiling_face: interquartile range
        :slightly_smiling_face:median
    - - The first thing you always have to do is find the midpoints (x)- so easy!
        
        If you need to estimated the mean just create another column called fx. Divide sigma fx by the total frequency :smile:! Remember that the answers are only estimated because the exact values are unknown.