Data workflow and version control
Economics PhD Professionalism Workshop
These slides are all about transparency and reproducibility (for your future self and for others). Some other resources…
One benefit of good tracking and workflow is that you partially automate your projects…
hint: consider pytask for efficient automation
Avoid spaces in file names. Avoid them at all costs. DO NOT PUT SPACES IN YOUR FILE NAMES.
“A space in a file name is a space in your soul.”
Avoid things like this…
great-research-idea
|
|---analysis
| | final_analysis.R
| | final_final_analysis.R
| | last_analysis.R
|
|---data
| | clean_data.csv
| | extra_clean_data.csv
|
|---paper
| | draft1.tex
| | final_draft.tex
| | final_final_draft.tex
Related point…use common sense (but short) variable names without spaces
Some quick thoughts on workflow…