Preserving and making available curated data is central to the National Science Foundations directives. We are working to meet the needs of researchers in archiving, preserving, and serving data out to researchers.
Data and metadata standards
Following along these guidlines:
the MRL aims to provide a format for researchers to archive and curate analyzed data and its metadata. Many journals provide for the submission of supporting and secondary data which meet the NSF's goals and the MRL attempts not to duplicate those submissions.
While curating and archiving all raw data would be unsustainable, data in support of publications will be archived in the Merritt repository ( http://www.cdlib.org/uc3/merritt/ ).
Merritt is a repository service and curation environment for storage and preservation of data and is provided by the California Digital Library. Merritt is used to manage, archive, and share content. It provides significant features for a digital object:
• permanent storage
• access via persistent URLs
• tools for long term management
• easy-to-use interface for deposits and udpates
Merritt uses the Dublin Core format for metadata standards and researchers will be asked to generally include descriptions of instruments used, analytic techniques, dates, data description, DOI links and other metadata which Merritt will abstract in to XML following the Dublin Core model. Upon paper submission researchers create a container of primary data tagged with the appropriate metadata in the Merritt repository. For training and best-practices please contact the computational facility manager.
Backup and security are maintained by the MRL. Data security and integrity is of central concern to the MRL and the MRL may mirror its data stores to UCLA in the near-term.
Researchers will use an existing tape library for long-term data storage and archiving. Tape archives will be kept for 6 years beyond the end of the grant, though primary data may reside in Merritt for much longer.