Please enable JavaScript.
Coggle requires JavaScript to display documents.
L6 A&B - Biostatistics Describe, calculate and interpret basic…
L6 A&B - Biostatistics
- Describe, calculate and interpret basic descriptive
statistics, confidence intervals, and plots with a focus
on boxplots.
- Describe, calculate and interpret confidence intervals.
- Describe distribution including normal distribution.
- Describe and interpret hypothesis testing including the
null and alternative hypotheses and type 1 and type 2
errors.
- To describe power and recognise the impact of
interpreting n on significant results.
Note all data used in practical examples is simulated for teaching purposes and
should not be used in clinical decision making.
Populations and Samples
-
Sample
A sample is a representative subset of a population
Research investigates samples, not entire population, which inherently incorporates error when forming inferences (representation, not exact truth)
E.g. Let’s imagine we take a random sample of 20 men aged between 45 and 65, living in Perth
Descriptive Data
What might me want to know about our sample?
• What is the average BMI?
• What is the spread/distribution of BMI?
• What is the range of BMI ?(Excluding outliers)
• Anything else?
-
Measure of Spread
Standard Deviation
- Difference between the mean and the other data points
- Commonly reported along with mean
- Provides insight into the spread of the data
- Is the square root of "variance"
- Sample Sd is unbiased estimator of population Sd
-
Interquartile Range
- Difference between the 25th and 75th percentile values
- Details where the middle 50% of the data lies between
- Can be useful to report with median
- Q1 = Median of lower half or; (= (n+1)/4)
- Q3 = Median of upper half or; (= [3(n+1)]/4)
Range
• Difference between the minimum and maximum values
Boxplots
1. IQR = Q3 - Q1
- Determining Outliers
- Min Outlier < Q1-1.5xIQR
- Max Outlier > Q3+1.5xIQR
-
-
Confidence Intervals (CI)
When we calculate the mean of a sample we acknowledge that our value is unlikely to be the true mean of the population.
=> it is only an approximate representation, not an exact representation
- random sample take from the population
Thereby it is important to present a range for our mean, rather than a single value (The mean for the population will fall within this range)
Conventionally 95% CI
The range which the true value falls within is commonly set as the same arbitrary level of accepted error as p values [20:1 0.95]
Yet we can use 90% or 99% CR
-
Using Normal Distributions to calculate CI in Larg Sample Sizes
For smaller sample sizes (N<120) we must make adjustments for error incurred when estimating the standard error
- The t-distribution with n-1 is used instead of the mean
-
Statistics
“the practice or science of collecting and analysing numerical data in large quantities"
Allows researchers to extrapolate and form inferences using data obtained from representative sample.”
Bio statistics is “the branch of statistics that deals with data relating to living organisms.