Preservation and Stewardship Old
The objective of the new Preservation and Stewardship Cluster is to support the long-term preservation of Earth system science data and information. This Cluster provides a forum for ESIP members to collaborate on data preservation issues.
The Preservation and Stewardship cluster has regular telecons on the second Wednesday of every month at 3 PM EST
Telephone: 877-326-0011 Meeting #: *4917475*
Cluster Chair: Ruth Duerr, NSIDC firstname.lastname@example.org
To join the e-mail list for this cluster, visit | esip-preserve and submit a request to join.
Meeting Summaries and Preparation[edit | edit source]
- January 2011 ESIP Federation Meeting - Planning is underway
- July 2010 ESIP Federation Meeting - Preservation and Stewardship presentations are available.
- January 2010 ESIP Federation Meeting - Notes and presentations from preservation sessions are available.
- Notes from the 2009 fall AGU town hall on Peer-Reviewed Data Publication and Other Strategies to Sustain Verifiable Science.
- July 2009 ESIP Federation Meeting - Notes and presentations from preservation sessions are available.
- January 2009 ESIP Federation Meeting - Notes from the Interagency Forum on Data Preservation/LifeCycle/Stewardship, held Jan 8, 2009 at the ESIP Federation Meeting are available.
Next Telecon[edit | edit source]
Wednesday November 10, 2010, 1 pm MST (3 pm EST)
Meeting #: *4917475*
Data Management Workshop[edit | edit source]
Data Identifiers Testbed[edit | edit source]
Data Stewardship Principles[edit | edit source]
Standards that Support Preservation[edit | edit source]
- OAIS (Open Archival Information System): http://en.wikipedia.org/wiki/Open_Archival_Information_System
- OAIS-based standards
- Preservation Metadata Implementation Strategies (PREMIS) - preservation metadata standard developed by joint RLG-OCLC group and maintained by the Library of Congress
- CCSDS 651.0-B-1: Producer - Archive Interface Methodology Abstract Standard (ISO 14721:2006) - CCSDS sponsored ISO standard "to identify, define and provide structure to the relationships and interactions between an information Producer and an Archive."
- Requirements for Bodies Providing Audit and Certification of Digital Repositories - in development under CCSDS auspices for submission as an ISO standard in May/June 2009
- Metrics for Digital Repository Audit and Certification - in development under CCSDS auspices for submission as an ISO standard in May/June 2009
- Data Identification Standards (NOTE: Several members of this cluster are working on a paper that assesses the utility of these standards for earth science. If the proposal to the Federation for a testbed is funded, we may get to actually test these out!)
- The Data Documentation Initiative (DDI) is an international standard for describing data from the social, behavioral, and economic sciences. Expressed in XML, the DDI metadata specification supports the entire research data life cycle. DDI metadata accompanies and enables data conceptualization, collection, processing, distribution, discovery, analysis, repurposing, and archiving (http://ddi.icpsr.umich.edu/what)
Agency Strategies and Policies that Support Preservation[edit | edit source]
Procedural Requirements (NPR) 1441.1D: NASA Records Retention Schedules (1/31/08) 
This document covers all types of records managed by NASA. In Chapter 8 (Program Management Records), sections 101-113 (PROGRAM AND PROJECT RECORDS) are of particular interest. Quoted from the document:
"What items 101-113 cover. These items designate appropriate retention of NASA program and project records produced through compliance with NPR 7120.5 or other authorized project management practices. It provides for permanent retention of substantive and historically significant records, and temporary retention of other records until the Agency no longer needs them. The terms "program" and "project" are defined in the current versions of NPD 7120.4 and NPR 7120.5. This schedule applies to all activities performed as part of programs/projects whether designated "tasks," "work packages," or other terminology."
National Geological and Geophysical Data Preservation Program
Plan for the Scientific Data Stewardship Program 
Interagency Working Group on Digital Data
IWGDD represents 23 agencies including: NSF, NASA, DOE, USDA, HHS, Office of Science & Technology Policy. Established by the National Science and Technology Council's Committee on Science in 2006, its mission is to: - Develop and promote implementation of strategic plan for the federal government to cultivate an open, interoperable framework; and - Ensure reliable preservation and effective access to digital data for research, development, and education in science, technology, and engineering.
The Interagency Working Group on Digital Data (IWGDD) of the National Science and Technology Council's(NSTC) Committee on Science recently issued a report detailing a strategy to "ensure that digital scientific data can be reliably preserved for maximum use in catalyzing progress in science and society".
Their goal is to achieve this vision:
"Create a comprehensive framework of transparent, evolvable, extensible policies and management and organizational structures that provide reliable, effective access to the full spectrum of public digital scientific data. Such a framework will serve as a driving force for American leadership in science and in a competitive, global information society."
Their recommendations are that:
- a National Science and Technology Council (NSTC) Subcommittee for digital scientific data preservation, access, and interoperability be created;
- appropriate departments and agencies lay the foundations for agency digital scientific data policy and make the policy publicly available; and
- agencies promote a data management planning process for projects that generate
The full report can be obtained at http://www.nitrd.gov/about/Harnessing_Power_Web.pdf
Technologies that Support Data Preservation[edit | edit source]
Preservation Definitions[edit | edit source]
Data Archiving: Formally preserving data and information and making it available for an identified but potentially large and changing group of data consumers or users (Derived from the ISO standard Open Archival Information System (OAIS) Reference Model (CCSDS, 2002)). This definition applies to the archiving of any type of data or information whether it is a physical sample, a medieval manuscript, a photograph, or a digital data file.
Digital Preservation: “The series of actions and interventions required to ensure continued and reliable access to authentic digital objects for as long as they are deemed to be of value. This encompasses not just technical activities, but also all of the strategic and organisational considerations that relate to the survival and management of digital material.” Defined by the Digital Curation Centre (DCC) of the United Kingdom (Pennock, 2006, 1).
Digital Curation: “Maintaining and adding value to a trusted body of digital information for current and future use; specifically, the active management and appraisal of data over the life cycle of scholarly and scientific materials.” Defined by the DCC (http://www.dcc.ac.uk/about/).
Data Management: “Data Resource Management is the development and execution of architectures, policies, practices and procedures that properly manage the full data life cycle needs of an enterprise.” Defined by the Data Management Association, International (http://www.dama.org/i4a/pages/index.cfm?pageid=3339).
Data Stewardship: “All activities that preserve and improve the information content, accessibility, and usability of data and metadata. These activities include maintaining a scal¬able and reliable infrastructure to support long-term access and preservation, preserving data access and archive integrity during media migration and software evolution, providing effective data support services and tools for users, and enhancing data and metadata by adding information that is established throughout the data life cycle.” As defined by the National Research Council Committee on Archiving and Accessing Environmental and Geospatial Data at NOAA (NRC, 2007, 41).
Provenance and Context: See the Interagency Data Stewardship/LifeCycle Workshop Report
References[edit | edit source]
NRC (National Research Council). 2007. Environmental Data Management at NOAA: Archiving, Stewardship, and Access. Washington, DC: National Academies Press. 116 pp.
Cluster Chair: Ruth Duerr