Please enable JavaScript.

Coggle requires JavaScript to display documents.

Chp. 12 - Intro to Analysis of Variance - Coggle Diagram

- - - - When a researcher uses a nonmanipulated variable to designate groups, the variable is called a quasi-independent variable
      - The individual groups or treatment conditions that are used to make up a factor are called the levels of the factor.
  - - - Often, a single experiment requires several hypothesis tests to evaluate all the mean differences. However, each test has a risk of a Type I error, and the more tests you do, the more risk there is.
    - - When an experiment involves several different hypothesis tests, the experimentwise alpha level is the total probability of a Type I error that is accumulated from all of the individual tests in the experiment. Typically, the experimentwise alpha level is substantially greater than the value of alpha used for any one of the individual tests.
        
        The advantage of ANOVA is that it performs all three comparisons simultaneously in one hypothesis test. Thus, no matter how many different means are being compared, ANOVA uses one test with one alpha level to evaluate the mean differences and thereby avoids the problem of an inflated experimentwise alpha level.
  - - - The solution to this problem is to use variance to define and measure the size of the differences among the sample means.
- - - - The differences between treatments are not caused by any treatment effect but are simply the naturally occurring, random and unsystematic differences that exist between one sample and another. That is, the differences are the result of sampling error.
        
        The differences between treatments have been caused by the treatment effects. For example, if treatments really do affect performance, then scores in one treatment should be systematically different from scores in another condition.
        
        To demonstrate that there really is a treatment effect, we must establish that the differences between treatments are bigger than would be expected by sampling error alone.
        
        To accomplish this goal, we determine how big the differences are when there is no systematic treatment effect; that is, we measure how much difference (or variance) can be explained by random and unsystematic factors. To measure these differences, we compute the variance within treatments.
  - - - Why are the scores different? The answer is that there is no specific cause for the differences. Instead, the differences that exist within a treatment represent random and unsystematic differences that occur when there are no treatment effects causing the scores to be different.
  - - - When there are no systematic treatment effects, the differences between treatments (numerator) are entirely caused by random, unsystematic factors.
        
        When the treatment does have an effect, causing systematic differences between samples, then the combination of systematic and random differences in the numerator should be larger than the random differences alone in the denominator.
    - - Another possible example of random and unsystematic variability is error of measurement.
        
        Because the denominator of the F-ratio measures only random and unsystematic variability, it is called the error term.
        
        The error term provides a measure of the variance caused by random and unsystematic differences.
- - - - Total Sum of Squares, SS total . As the name implies, SS total is the sum of squares for the entire set of N scores.
        
        Within-Treatments Sum of Squares, SS within treatments . Now we are looking at the variability inside each of the treatment conditions.
        
        Between-treatments Sum of Squares, SS between treatments
  - - - Each df value is associated with a specific SS value.
        
        Normally, the value of df is obtained by counting the number of items that were used to calculate SS and then subtracting 1. For example, if you compute SS for a set of n scores, then df = n - 1 .
  - - - In ANOVA, it is customary to use the term mean square , or simply MS, in place of the term variance.
  - - - The total number of scores in the entire study is specified by a capital letter N.
        
        The sum of the scores for each treatment condition is identified by the capital letter T (for treatment total).
        
        The sum of all the scores in the research study (the grand total) is identified by G.
        
        Although there is no new notation involved, we also have computed SS and M for each sample, and we have calculated for the entire set of scores in the study.
        
        Please note that there is no universally accepted notation for ANOVA. Although we are using Gs and Ts, for example, you may find that other sources use other symbols.
- - - - To answer this question, we need to look at all the possible F values that can be obtained when the null hypothesis is true—that is, the distribution of F-ratios.
        
        F values always are positive numbers. Remember that variance is always positive.
        
        The distribution of F-ratios should pile up around 1.00.
        
        With very large df values, nearly all the F-ratios are clustered very near to 1.00. With the smaller df values, the F distribution is more spread out.
  - - - In the F distribution, we need to separate those values that are reasonably near 1.00 from the values that are significantly greater than 1.00.
        
        To use the F distribution table, you must know the df values for the F-ratio (numerator and denominator), and you must know the alpha level for the hypothesis test.
  - - - Thus, the term significant does not necessarily mean large, it simply means larger than expected by chance.
        
        To provide an indication of how large the effect actually is, it is recommended that researchers report a measure of effect size in addition to the measure of significance.
  - - - In situations where there is unequal number of participants, ANOVA still provides a valid test, especially when the samples are relatively large and when the discrepancy between sample sizes is not extreme.
  - - - The populations from which the samples are selected must be normal.
        
        The populations from which the samples are selected must have equal variances (homogeneity of variance).
- - - - Post hoc tests (or posttests) are additional hypothesis tests that are done after an ANOVA to determine exactly which mean differences are significant and which are not.
        
        However, with three or more treatments k greater than equal to 3 , the problem is to determine exactly which means are significantly different.
  - - - As you do more and more separate tests, the risk of a Type I error accumulates and is called the experimentwise alpha level.
  - - - This value, called the honestly significant honestly significant difference, or HSD, is then used to compare any two treatment conditions.
        
        If the mean difference exceeds Tukey’s HSD, you conclude that there is a significant difference between the treatments. Otherwise, you cannot conclude that the treatments are significantly different.
  - - - Although you are comparing only two treatments, the Scheffé test uses the value of k from the original experiment to compute df between treatments. Thus, df for the numerator of the F-ratio is k - 1 .
        
        The critical value for the Scheffé F-ratio is the same as was used to evaluate the F-ratio from the overall ANOVA. Thus, Scheffé requires that every posttest satisfy the same criterion that was used for the complete ANOVA.