Publications

2015

Sweeney L, Crosas M, Bar-Sinai M. Sharing Sensitive Data with Confidence: the DataTags System. Technology Science. 2015. Publisher's Version Abstract

Society generates data on a scale previously unimagined. Wide sharing of these data promises to improve personal health, lower healthcare costs, and provide a better quality of life. There is a tendency to want to share data freely. However, these same data often include sensitive information about people that could cause serious harms if shared widely. A multitude of regulations, laws and best practices protect data that contain sensitive personal information. Government agencies, research labs, and corporations that share data, as well as review boards and privacy officers making data sharing decisions, are vigilant but uncertain. This uncertainty creates a tendency not to share data at all. Some data are more harmful than other data; sharing should not be an all-or-nothing choice. How do we share data in ways that ensure access is commensurate with risks of harm?

techsci-datatags-sweeneycrosasbarsinai.pdf

Sweeney L, Crosas M. An Open Science Platform for the Next Generation of Data. Arxiv.org Computer Science, Computers and Scoiety. 2015. Publisher's Version Abstract

Imagine an online work environment where researchers have direct and immediate access to myriad data sources and tools and data management resources, useful throughout the research lifecycle. This is our vision for the next generation of the Dataverse Network: an Open Science Platform (OSP). For the first time, researchers would be able to seamlessly access and create primary and derived data from a variety of sources: prior research results, public data sets, harvested online data, physical instruments, private data collections, and even data from other standalone repositories. Researchers could recruit research participants and conduct research directly on the OSP, if desired, using readily available tools. Researchers could create private or shared workspaces to house data, access tools, and computation and could publish data directly on the platform or publish elsewhere with persistent, data citations on the OSP. This manuscript describes the details of an Open Science Platform and its construction. Having an Open Science Platform will especially impact the rate of new scientific discoveries and make scientific findings more credible and accountable. (This manuscript was originally conceived in 2013)

Starr J, Castro E, Crosas M, Dumontier M, Downs RR, Duerr R, Haak L, Haendel M, Herman I, Hodson S, et al. Achieving human and machine accessibility of cited data in scholarly publications. PeerJ Computer Science. 2015. Publisher's Version

Crosas M, Honaker J, King G, Sweeney L. Automating Open Science for Big Data. ANNALS of the American Academy of Political and Social Science. 2015;659 (1) :260-273. Publisher's Version Abstract

The vast majority of social science research uses small (megabyte- or gigabyte-scale) datasets. These fixed-scale datasets are commonly downloaded to the researcher’s computer where the analysis is performed. The data can be shared, archived, and cited with well-established technologies, such as the Dataverse Project, to support the published results. The trend toward big data—including large-scale streaming data—is starting to transform research and has the potential to impact policymaking as well as our understanding of the social, economic, and political problems that affect human societies. However, big data research poses new challenges to the execution of the analysis, archiving and reuse of the data, and reproduction of the results. Downloading these datasets to a researcher’s computer is impractical, leading to analyses taking place in the cloud, and requiring unusual expertise, collaboration, and tool development. The increased amount of information in these large datasets is an advantage, but at the same time it poses an increased risk of revealing personally identifiable sensitive information. In this article, we discuss solutions to these new challenges so that the social sciences can realize the potential of big data.

AutomaticOpenScienceforBigData-2015-crosas-260-73.pdf

2014

Pepe A, Goodman A, Muench A, Crosas M, Erdmann C. How Do Astronomers Share Data? Reliability and Persistence of Datasets Linked in AAS Publications and a Qualitative Study of Data Practices among US Astronomers. PLoS ONE. 2014;9.

Goodman A, Pepe A, Blocker AW, Borgman CL, Cranmer K, Crosas M, Di Stefano R, Gil Y, Groth P, Hedstrom M, et al. Ten simple rules for the care and feeding of scientific data. PLoS computational biology. 2014;10.

2013

Crosas M. A data sharing story. Journal of eScience Librarianship. 2013;1 :7. Publisher's Version

Altman M, Crosas M. The evolution of data citation: From principles to implementation. IASSIST Quarterly. 2013;37. Publisher's Version

Rajasekar A, Sankaran S, Lander H, Carsey T, Crabtree J, Crosas M, King G, Kum H-C, Zhan J. Sociometric Methods for Relevancy Analysis of Long Tail Science Data, in Social Computing (SocialCom), 2013 International Conference on. IEEE ; 2013 :1–6.

2011

Crosas M. The dataverse network®: an open-source application for sharing, discovering and preserving data. D-lib Magazine. 2011;17 :2. Publisher's Version

2000

Knapp GR, Crosas M, Young K, Ivezić Ż. Atomic carbon in the envelopes of carbon-rich post-asymptotic giant branch stars. The Astrophysical Journal. 2000;534 :324.

1999

Knapp GR, Young K, Crosas M. The Circumstellar Envelope of pi Gru. arXiv preprint astro-ph/9903338. 1999.

Sakamoto K, Scoville NZ, Yun MS, Crosas M, Genzel R, Tacconi LJ. Counterrotating nuclear disks in Arp 220. The Astrophysical Journal. 1999;514 :68.

Wood K, Crosas M, Ghez A. GG Tauri's Circumbinary Disk: Models for Near-Infrared Scattered-Light Images and 13CO (J= 1→ 0) Line Profiles. The Astrophysical Journal. 1999;516 :335.

Knapp GR, Dobrovolsky SI, Ivezic Z, Young K, Crosas M, Mattei JA, Rupen MP. The light curve and evolutionary status of the carbon star V Hya. arXiv preprint astro-ph/9907234. 1999.

1998

Crosas M, Menten KM, Young K, Phillips TG. Radiative Transfer in a Turbulent Expanding Molecular Envelope: Application to Mira. In: Dust and Molecules in Evolved Stars. Springer ; 1998. pp. 189–192.

1997

Crosas M, Menten KM. Physical parameters of the IRC+ 10216 circumstellar envelope: new constraints from submillimeter observations. The Astrophysical Journal. 1997;483 :913.

1996

Crosas M, Weisheit J. Spallation in Active Galactic Nuclei. The Astrophysical Journal. 1996;465 :659.

1993

Crosas M, Weisheit J. Cosmic Rays in AGNs. Revista Mexicana de Astronomia y Astrofisica. 1993;27 :107.

Crosas M, Weisheit JC. Hydrogen molecules in quasar broad-line regions. Monthly Notices of the Royal Astronomical Society. 1993;262 :359–368.

Mercè Crosas

Publications

Pages