Please enable JavaScript.

Coggle requires JavaScript to display documents.

Inferential Statistics - Coggle Diagram

- - - - In inferential statistics, this probability is called the p-value, 5% is called the significance level (a), and the desired relationship between the p-value and a is denoted as: p<0.05.
        
        The significance level is the maximum level of risk that we are
        willing to accept as the price of our inference from the sample to the population.
        
        We must also understand three related statistical concepts: sampling distribution,
        standard error, and confidence interval.
        
        A sampling distribution is the theoretical
        distribution of an infinite number of samples from the population of interest in your study.
        
        However, because a sample is never identical to the population, every sample always has some
        inherent level of error, called the standard error.
        
        The precision of our sample estimates is defined in terms of a confidence interval
        (CI).
- - - - Though most variables in the GLM tend to be interval or ratio-scaled, this does not have to be the case.
        
        The GLM is a very powerful statistical tool because it is not one single statistical method, but rather a family of methods that can be used to conduct sophisticated analysis with different
        types and quantities of predictor and outcome variables.
        
        The most important problem in GLM is model specification, i.e., how to specify a regression equation (or a system of equations) to best represent the phenomenon of interest.
- - - - Probit regression (or probit model) is a GLM in which the outcome variable can vary between 0 and 1 (or can assume discrete values 0 and 1) and is presumed to follow a standard normal distribution, and the goal of the regression is to predict the probability of each outcome,
        
        Path analysis is a multivariate GLM technique for analyzing directional relationships
        among a set of variables.
        
        Time series analysis is a technique for analyzing time series data, or variables that
        continually changes with time.