textreg is an R package for doing text analysis. But generally see CRAN for most recent version.
The attached 'ngram_code_bundle' contains the code for the Statistical Analysis and Data Mining paper.
For a file of the simulation results please contact the authors (file size of 400MB).