I have made public some of the data I collected over the years. Due to COVID19, it has been and will be difficult to conduct field research or collect new data. I hope these datasets can help scholars develop research projects. Users are free to publish the results of their analyses without restrictions, with the understanding that the source of the data will be credited. All the data can be downloaded from the Harvard Dataverse following this link.
Below is a summary of these data:
China’s Corruption Investigations Dataset
China’s Corruption Investigations Dataset includes information on almost 20,000 officials who were investigated during Xi Jinping’s anti-corruption campaign.
Comprehensive Catalogue of Chinese Genealogies
Comprehensive Catalogue of Chinese Genealogies (CCCG) includes information on more than 50,000 genealogy books that were compiled in China from 1005 to 2007.
Chinese Provincial Legal Funding Dataset
Chinese Provincial Legal Funding Dataset (CPLSD) contains information on government spending on legal institutions, including courts, procuratorates, the police, and judicial bureaus, at the provincial level from 1995 to 2006.
Chinese Political-Legal Leaders Database
Chinese Political-Legal Leaders Database (CLLD) includes information on the Party rank of political-legal leaders, including political-legal committee chairs, court and procuratorate presidents, and police chiefs, at the provincial level in China from 1978 to 2013.
Chinese Listed Firms Personnel Database
Chinese Listed Firms Personnel Database (CLFPD) includes the biographical information of all listed firms’ board members in China from 1991 to 2012.