Stata to R cheatsheet for Econometrics

Within the field, Stata is the dominant software package for economists. I suspect a large portion of universities, like mine, still do alot of their teaching using it, and given its outsized influence, it’s probably still important to know how to use it if you plan to continue studying or working in the field.

Outside of Economics, however, R is more widely used, more versatile, and just as importantly, free. It’s also true that R is far less user friendly than Stata and can be confusing given the wide range of different approaches, packages and even syntax that are all used to do the same things that seem so effortless in Stata.

With that said, inspired by the great cheatsheets on the Rstudio site, I tried to capture some of the more common Stata commands used in an applied econometrics and their equivalent expressions in R in this Stata to R cheatsheet.

This cheatsheet assumes a basic familiarity with R: how to read-in files, install and load packages, and work with dataframes.

I went back and forth on whether to use or include Tidyverse commands, but based on what I’ve seen within the Economics department at my own university, I’ve opted to keep everything in base R and tried to minimize the use of packages in order to make this cheatsheet as useful to the broadest range of R beginners. When there really was no simple way to do things (e.g. calculating robust standard errors), I just picked the package that I knew of that best replicated the simplicity of the Stata solution.

Lastly, it should be noted that it’s impossible to cover everything in two pages, so this meant to be a “survival” level cheat sheet–a handy reference for the core commands you would need to replicate while doing a basic econometric analysis in Stata.

The content is biased towards my own experience, but if this cheat sheet is at all helpful to others, I would love to hear your feedback, which will help tremendously with future updates.

All comments and suggestions are welcome!

PSE Summer School, Migration Economics Predicting household poverty in Latin America, Part I: Re-splitting the CRHP dataset
comments powered by Disqus