Reading 11: Hypothesis Testing
Hypothesis testing process
S1: State a null hypothesis (what is to be rejected, must have equal sign) and alternative hypothesis (what is to be supported)
S2: Select test statistics
S3: Specify the significance level.
S4: State the decision rule
S5: Calculate sample statistics
S6: Make decision about the hypothesis and conclusion based on test result.
1-tailed vs. 2 tailed tests
1-tailed test: results from a 1-tailed alternative hypothesis (e.g., Ha:μ>μ0 )
2-tailed test: results from a 2-sided alternative hypothesis.
(e.g., \(H^{a}: \mu \neq \mu _{0}\) )
Test statistics, type I & II errors, and Significance level
Test statistic: is the value that a decision about the hypothesis will be based on.
For a test about the value of distribution mean:
\(Test Statistics = \frac{\bar{x} - \bar{x_{hypothesized}}}{Sd}\)
Type I and II errors
Type I: rejecting the true Null hypothesis.
Type II: Not rejecting a false null hypothesis
Significance level: probability that Type I Error happens. A significance level must be specified to select critical values for the test.
Decision rule, power of a test, and confidence interval
Decision rule: a critical value at a stated level of significance.
Power of A Test: Probability of accurately rejecting a false null hypothesis.
Power of A Test = 1 - P(Type II error)
Relation with Confidence interval:a hypothesis is rejected when the sample statistic lies outside the confidence interval (100x(1- \(\alpha\)))% around the hypothesized value for the chosen level of significance
Statistical result vs. Economically meaningful result
the benefit of Statistical significant result sometimes is not enough to outweigh the costs
P-value
The smallest significance level for which the Null hypothesis would be rejected.
Identify appropriate test
USE TABLES
Concerning 1 Mean
(Normally distributed population)
Known Population Variance
Unknown Population Variance
Test whether (Population) Means are different
(for samples from 2 independent, normally distributed populations)
Unequal Variances
Calculating the Mean Value of Difference
(2 DEPENDENT, equal-size, samples from normal populations)
Concerning 1 Variance
(Normally distributed population)
Comparing Variance
(Normally distributed populations)
Non-parametric test
S1: State hypothesis
S3: Identify significance level
S4: Find the rejection points
S5: Calculate statistic: Use T-statistic: \(T=\frac{\bar{x}-\mu _{0}}{s\div \sqrt{n}}\)
S6: Make conlcusion: if Test statistic is outside the range of critical values, reject Null Hypothesis
S2 & 4: Use Z-statistic: \(Z=\frac{\bar{x}-\mu _{0}}{\sigma\div \sqrt{n}}\)
Equal Varriance
S1: State hypothesis:
\( H_{0}: \mu_{1}-\mu_{2}=0\)
\( H_{1}: \mu_{1}-\mu_{2}\neq 0\)
S2: select t-statistic and Degree of Freedom
degree of freedom: \(df=n_{1}+n_{2}-2\)
S4: Find the rejection points
S5: t-test: \( t=\frac{(\bar{X_{1}}-\bar{X_{2}})-(\mu _{1}-\mu _{2})}{\sqrt{\frac{s_{p}^{2}}{n_{1}}+\frac{s_{p}^{2}}{n_{2}} }}\)
S6: Make conlcusion: Reject of T-stat > critical value or T-stat < negative critical value
S2: T-statistic and Degree of Freedom : \(df=\frac{\left ( \frac{s_{1}^{2}}{n_{1}}+\frac{s_{2}^{2}}{n_{2}} \right )^{2}}{\frac{\left ( \frac{s_{1}^{2}}{n_{1}} \right )^{2}}{n_{1}}+\frac{\left ( \frac{s_{2}^{2}}{n_{2}} \right )^{2}}{n_{2}}}\)
Can use Z-statistic if n is large (n > 30) but prefer T-statistic.
S2: Select Test statistic
S3: Find significance level
S5: \( t= \frac{\left ( \bar{X_{1}}-\bar{X_{2}} \right )-\left ( \mu _{1}-\mu _{2} \right )}{\sqrt{\frac{s_{1}^{2}}{n_{1}}+\frac{s_{2}^{2}}{n_{2}} }}\)
Unknown Population Variance
Step 0: Find Sample Mean Difference, Sample Variance, and Standard Error
Sample Mean Difference: \(\bar{d}=\frac{\sum_{i=1}^{n}(d_{i})}{n}\)
Sample Variance: \( s_{d}^{2}=\frac{\sum_{i=1}^{n}(d_{i}-\bar{d})^{2}}{n-1}\)
S5: \(t=\frac{\bar{d}-\mu _{d0}}{s_{\bar{d}}}\)
Standard Error: \( s_{\bar{d}}=\frac{s_{d}}{\sqrt{n}}\)
S2: Degree of freedom = (n-1)
S1: State hypothesis with \(\mu_{d}\) and \(\mu_{d0}\)
S5: Use Chi-square statistic: \(\chi ^{2}=\frac{(n-1 )\times S^{2}}{\sigma _{0}^{2}}\)
Rejection Points
"Not equal to" \(H_{a}\): reject null hypothesis if test statistic > \(\chi _{\alpha /2}^{2}\) or <\(\chi _{1-\alpha /2}^{2}\) with df = n-1
"Greater than" \(H_{a}\): reject null hypothesis if test statistic > \(\chi _{\alpha}^{2}\) with df= n-1
"Less than" \(H_{a}\): reject null hypothesis if test statistic < \(\chi _{1-\alpha}^{2}\) with df= n-1
Degree of freedom: \((n_{1}-1)\) for \(S_{1}^{2}\) and \((n_{2}-1)\) for \(S_{2}^{2}\) (usually equal)
S4: Rejection points
"Not equal to" \(H_{a}\): reject null hypothesis if test statistic > \(F_{\alpha /2}\) or <\(F_{1-\alpha /2}\) with specified number of numerator and denominator df
"Greater than" \(H_{a}\): reject null hypothesis if test statistic > \(F_{\alpha}\)
"Less than" \(H_{a}\): reject null hypothesis if test statistic < \(F_{1-\alpha}\)
S5: F-distribution: \(F=\frac{s_{1}^{2}}{s_{2}^{2}}\) (whichever larger sample variance is on numerator)
Used when assumptions of parametric test cannot be supported or when data is not suitable for parametric test.
Transaction cost
Taxes
Risk
\(\bar{x}\): Sample mean
\(\mu _{0}\): Hypothesized population mean
\(s\): Sample Standard Deviation
\(n\): Sample Size
\(\bar{x}\): Sample mean
\(\mu _{0}\): Hypothesized sample mean
\(\sigma\): Population Standard Deviation
\(n\): Sample Size
\( (\mu _{1}-\mu _{2}) \): this is the hypothesized difference in Means which is usually equal 0
\( (\mu _{1}-\mu _{2}) \): this is the hypothesized difference in Means which is usually equal 0
\( \mu _{d0}\): is usually = 0
with Sample Variance:
\(S^{2}=\frac{\sum_{i=1}^{n}\left (X_{i}-\overline{X} \right )}{n-1}\)
With \(s_{p}^{2}=\frac{\left ( n_{1}-1 \right )\times s_{1}^{2}+\left ( n_{2}-1 \right )\times s_{2}^{2}}{n_{1}+n_{2}-2}\)
Comparing Correlation
Test of whether population correlation coefficient equal 0
T-stat = \( \frac{r \sqrt{n-2}}{\sqrt{1-r^{2}}} \)
r = sample correlation coefficient
(n-2) degree of freedom
Run test
Rank Correlation Test
From df and \(\alpha\) find t-value
Rejection point: from \(\alpha\) find z-value so that P(Zvalue) = \(1-\frac{\alpha}{2} \) for 2-tail test or \(1-\alpha\) for 1-tail test
\(\frac{\alpha}{2}\) for 2 tail test
\(\alpha\) for 1 tail test