Presentations

Citation++: Data citation, provenance, and documentation, at NSF SI2 Conference, Arlington, VA, Tuesday, February 21, 2017:

 

By Mercè Crosas (PI), Margo Seltzer (PI), Jackson Okhun, Thomas Pasquier & Dataverse team.

The dawning of the digital research age – computational science, computational social science, and the digital humanities – brings with it both enormous potential and challenges. Visions of interactive publication, open data, reproducible results, and massive digital collections are exciting, opening up new research frontiers and the promise of more rapid dissemination of and building upon research ouput. However, to date, little of this vision has been realized. It remains...

Read more about Citation++: Data citation, provenance, and documentation
Dataverse, Cloud Dataverse, and DataTags Tuesday, February 7, 2017:

Talk given at Two Sigma:
The Dataverse project, developed at Harvard's Institute for Quantitative Social Science since 2006, is a widely used software platform to share and archive data for research. There are currently more than 20 Dataverse repository installations worldwide, with the Harvard Dataverse repository alone hosting more than 60,000 datasets. Dataverse provides incentives to researchers to share their data, giving them credit through data citation and control over terms of use and access. In this talk, I'll discuss the Dataverse project, as well as related projects such as...

Read more about Dataverse, Cloud Dataverse, and DataTags
Cloud Dataverse, at Massachusetts Open Cloud Workshop, Boston University, Tuesday, December 6, 2016:

 

Cloud Dataverse will be a new service for accessing and processing public data sets in a the Massachusetts Open Cloud (MOC). It is based on Dataverse, a popular software framework for sharing, archiving, and analyzing research data. Cloud Dataverse extends Dataverse to replicate datasets from institution repositories to a cloud-based repository and store their data files in Swift, making data processing faster for in-situ application running in the cloud.

Cloud...

Read more about Cloud Dataverse
Cloud Dataverse - A Faster Way to Process Data in the Cloud, at OpenStack Summit, Barcelona, Wednesday, October 26, 2016:

A presententation at the OpenStack Summit by  Piyanai Saowarattitada, Mercè Crosas, and Orran Krieger. https://www.openstack.org/summit/barcelona-2016/summit-schedule/events/16875/cloud-dataverse-a-faster-way-to-process-data-in-the-cloud

Abstract: Cloud Dataverse is a new service for accessing and processing public data sets in an OpenStack Cloud. It is based on Dataverse, a popular framework for sharing, preserving, and analyzing research data. Cloud Dataverse extends Dataverse to replicate datasets from per-institution repositories to a cloud-based...

Read more about Cloud Dataverse - A Faster Way to Process Data in the Cloud
Making Data Accessible , at Harvard SEAS, CS Colloquium, Thursday, October 20, 2016:

Since modern science began, data have been a critical part of the scientific enterprise, not only for conducting science but also for communicating and validating scientific results. From the beginning, it was clear that for the scientific community to continually verify scientific results, the underlying data had to be made accessible. But that has not been, and is still not, always the case. In recent years however, public data repositories have grown significantly, making many research data sets easily accessible to others. The ...

Read more about Making Data Accessible

Pages