This lesson assumes that you have basic knowledge of Stata syntax and writing code in .do files (Intro to Stata, Stata Best Practices), an understanding of the sources of errors in data collection (Collecting High-Quality Data, Questionnaire Design), and familiarity with the pros & cons of digital data collection (e.g. the first section on mobile data collection in the SurveyCTO lesson; knowledge of SurveyCTO itself is not necessary for this lesson).
Lesson slides
Datasets
Stata code for examples in videos
IDinsight data cleaning checklist
IDinsight .do file checklist
multract .ado file for splitting multiple response variables into binary variables
Andrade et al (2021) "iefieldkit to document primary data collection and cleaning in Stata", World Bank
Gentzkow & Shapiro (2014) "Code and Data for the Social Sciences: A Practical Guide"
Kopper et al "Data cleaning and management", J-PAL Research Resources
7 May 2026
23 April 2026
14 April 2026
9 April 2026
27 March 2026
17 March 2026
13 March 2026
12 March 2026
9 September 2022
12 September 2022
Username or Email Address
Password
Remember Me