Integrating PROV with DDI: Mechanisms of data discovery within the US Census Bureau

Main Authors: Bill Block, Warren Brown, Jeremy Williams, Lars Vilhuber, Carl Lagoze
Format: info Proceeding eJournal
Terbitan: , 2014
Online Access: https://zenodo.org/record/3780776
Daftar Isi:
  • Within the United States Census Bureau, datasets are often derived by complex methods that are not always well documented. This derivation process, or provenance, can be hard to understand for a researcher attempting to use or explore a given dataset. Without understanding the provenance of a dataset, it can be impossible establish whether it is appropriate to use for a given investigation, because its history remains a black box with no way to see inside. The infrastructure upon which the semantic web is built provides a means to label the relationships of social science datasets with logical meaning according to standardized ontologies and controlled vocabularies. This paper outlines the work of the Comprehensive Data Documentation and Access Repository (CED2AR) to integrate provenance metadata encoded according to the W3C PROV ontology with a DDI-based repository with the aim of making US Census data more discoverable and accessible.