For this quiz, use the EG DIB evaluation dataset. Note that the variable “total_ely1” contains Year 1 endline test scores and the variable “treatment” denotes EG students (=1) and non-EG students (=0). For now, ignore covariates or clustering in your calculations.
0 of 4 Questions completed
Questions:
You have already completed the quiz before. Hence you can not start it again.
Quiz is loading…
You must sign in or sign up to start the quiz.
You must first complete the following:
0 of 4 Questions answered correctly
Your time:
Time has elapsed
You have reached 0 of 0 point(s), (0)
Earned Point(s): 0 of 0, (0) 0 Essay(s) Pending (Possible Point(s): 0)
Compare the average Year 1 test scores for EG and non-EG students using the ‘summarize’ command with conditional statements:
sum total_ely1 if treatment == 0 sum total_ely1 if treatment == 1
The Y1 average for EG students is 9.606 while the Y1 average for non-EG students is 9.557.
Compare the median Year 1 test scores for EG and non-EG students using the ‘summarize’ command with conditional statements, this time including the ‘detail’ option:
sum total_ely1 if treatment == 0, det sum total_ely1 if treatment == 1, det
The Y1 median for both EG and non-EG students is 10.
Perform a t-test comparing Year 1 endline scores for EG and non-EG students:
ttest total_ely1, by(treatment)
The p-value for a two-sided test is 0.64 > 0.05. Thus we cannot reject the null hypothesis that the difference in means between EG and non-EG students is 0.
First create the pass variable, assigning a 1 to students who scored 11 points and above on the Year 1 test and a 0 otherwise, and omitting students who did not take the Y1 test:
gen pass = (total_ely1 >= 11) if total_ely1 != .
Now perform a proportions test on this variable for EG vs non-EG students:
prtest pass, by(treatment)
The p-value from a two-sided test is 0.09 > 0.05, so we cannot reject the null hypothesis of no difference in EG and non-EG pass rates at the 5% level. However, note that we could reject this difference at the 10% significance level, whereas we would not be able to reject the difference in raw scores at the 10% significance level. This highlights the need to pre-specify your main outcome of interest, since with enough fishing, you can prove almost anything.
1. Does the average EG student perform better on the Year 1 endline test than the average non-EG student?
2. Does the median EG student perform better on the Year 1 endline test than the median non-EG student?
3. Do EG students perform significantly differently on average than non-EG students on the Year 1 endline test (at the 5% level of significance)?
4. Suppose that a passing score on the endline test is 11 points. Are EG students significantly more likely to pass the test than students in the control group (at the 5% level of significance)? Use the command prtest instead of ttest in Stata since your outcome is a binary variable. (Note: the syntax for prtest is exactly the same as ttest, the only difference is how Stata calculates the standard error for a binary variable versus a continuous variable.)
6 December 2024
5 December 2024
4 December 2024
3 December 2024
12 September 2022
Username or Email Address
Password
Remember Me