Please enable JavaScript.

Coggle requires JavaScript to display documents.

Chapter 20: Data Preparation and Analysis of Data - Coggle Diagram

- - - - a) mail surveys returns
      - b) coded interview data
      - c) pre-test or post-test data
      - d) observational data
  - - - a) Are the response legible/readible?
      - b) Are all important questions answered?
      - c) Are the responses complete?
      - d) Is all relevant contextual information included (e.g., data time)?
  - - - database programs - more complex but is more flexible in data manipulating
      - Statistical programs
    - - a.variable name
      - b. variable description
      - c. variable format (number, data, text)
      - d. instrument/method of collection
      - e. date collected
      - f. respondent or group
      - g.variable location (in database)
      - h. notes
- - - - -many analysis programs automatically treat blank values as missing
      - -in others, you need to designate specific values to represent missing values
    - - -to help reduce the possibility of a response set
      - -when you analyse data, you want all scores for scale items to be in the same direction where high scores mean the same thing and low scores mean the same thing
      - -you have to reverse the ratings for some of the scale items with formula : New Value= (High Value + 1) - Original Value
    - - -once you have transformed any individual scale items you will often want to add or average across individual items to get a total score for the scale
    - - -for many variables you will want to collapse them into categories
- - - - i. Quality of data- should be checked as early as possible.
      - ii. Use different types of of analyses: frequency counts, descriptive statistics, normality, associations
      - iii. Other initial data quality checks on data cleaning, analysis of missing observations, analysis of extreme observations, comparisons, and correction of differences in coding schemes.
      - i. Quality of measurements- checked during the initial data analysis phase when this is not the focus or research question of the study
      - ii. 2 ways to assess measurement quality:
        
        analysis of homogeneity
        
        1.confirmatory factor analysis
      - iii. During this analysis, one inspects the variances of the items and the scales, the Cronbach's of the scales, and the change in the Cronbach's alpha when an item would be deleted from a scale.
      - iv. After assessing data's and measurement's quality, one might decide to impute missing data or to perform initial transformations. Possible transformations of variables are:
        
        a. Square root transformation (if distribution differs moderately from normal).
        b. Log-transformation (if the distribution differs substantially from normal).
        c. Inverse transformation (if the distribution differs severely from normal)
        d. Make categorical (ordinal/dichotomous) (if the distribution differs severely from normal, and no transformations help)
      - v. One should check the success of the randomisation procedure, for instance by checking whether background and substantive variables are equally distributed within and across group. If the study did not need and/or use a randomisation procedure, one should check the success of the non-random sampling, for instance, by checking whether all subgroups of the population of interest are represented in the sample. Other possible data distortions that should be checked are:
        
        a. Dropout (this should be identified during the initial data analysis phase)
        b. Item nonresponse (whether this is random or not should be assessed during initial data analysis phase)
        c. Treatment quality (using manipulation checks)
      - Characteristics of data sample
        
        Can be assessed by looking at :
        
        a) Basic characteristics of important variable
        
        b) Scatter plots
        
        c) Correlations
        
        d) Cross-tabulations
        
        Analyses can be used during initial data analysis phase:
        
        a. Univariate statistics
        
        b. Bivariate associations (correlations)
        
        c. Graphical techniques (scatter plots)
        
        It is important to take the measurement levels of the variables into account for the analyses, as special techniques are available for each level:
        
        Nominal and ordinal variables (frequency counts (numbers and percentages), associations, circumambulations (cross tabulation), hierarchical log linear analysis, log linear analysis, exact tests or bootstraping, computation of new variables.
        
        Continuous variables (distribution, statistics (Mean M), SD, variance, skewness, kurtosis), stem-and-leaf displays, box plots.
- - - - No clear hypotheses about analysing the data, and the data is searched for models that describe the data well
    - - Clear hypotheses about data are tested