Relating data practices, types, and curation functions: An empirically derived framework


  • Copyright is held by the authors. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted provided that copies are not made or distributed for profit or commercial advantage and that copies include a full citation. To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission.


We present a general conceptual framework that maps relationships and dependencies among scientific data practices, types of data produced and used, and associated curation activities. As part of the Data Conservancy initiative, the framework is being elaborated through empirical studies of data practices in the earth sciences and life science and validated against use cases as curatorial services are developed around data being prepared for ingest into the repository. The framework can be applied more broadly for identifying and representing curation requirements and to support description and assessment of existing or planned curation infrastructure and services. It will support full accounts of the data products and workflows required to maintain the coherence and context of complex data collections.