| Title | Data Provenance for Preservation of Digital Geoscience Data |
| Publication Type | Book Chapter |
| Year of Publication | 2011 |
| Authors | Plale, B., B. Cao, C. Herath, and Y. Sun |
| Refereed Designation | Unknown |
| Secondary Authors | Sinha, K. A., D. Arctur, I. Jackson, and L. Gundersen |
| Book Title | Societal Challenges and Geoinformatics |
| Series Title | Geological Society of America Special Papers |
| Volume | 482 |
| Pagination | 125-137 |
| Publisher | Geological Society of America (GSA) |
| Publication Language | eng |
| ISBN Number | 978-0-8137-2482-9 |
| Keywords | Cyberinfrastructure, data provenance, digital data preservation, geoinformatics |
| Abstract | A necessary first step in the preservation of digital scientific data is gathering enough information “about” a scientific outcome or data collection, that it can be discovered and used a decade from now as easily as it is reused next week. Data provenance, or lineage of a collection, can capture how a particular scientific collection was created, when and by whom. Our goal is to devise tools automate the collection of provenance so that this task does not fall onto the researcher, and to efficiently store and represent the provenance data that makes the data more amenable to long term preservation. We demonstrate through application to several projects that automated provenance collection can reach the level of necessary provenance but challenges remain in addressing provenance collection in a non-workflow setting, and in data preservation in cyberinfrastructure architectures. |
| URL | Follow Link |
| DOI | 10.1130/2011.2482(11) |