Difference between revisions of "Interagency Data Stewardship/LifeCycle/Jul2009MeetingPlans"

From Earth Science Information Partners (ESIP)
m
Line 1: Line 1:
 
Please contribute your thoughts and suggestions on our upcoming plans for including Cluster Activities at the upcoming summer ESIP meeting (Santa Barbara, July 2009).
 
Please contribute your thoughts and suggestions on our upcoming plans for including Cluster Activities at the upcoming summer ESIP meeting (Santa Barbara, July 2009).
  
Current thinking is that there would be 4 two-hour sessions deliberately spread over two days to allow for discussion and reflection between sessions.  At this point it isn't entirely clear whether the results would be summarized as a workshop report though they clearly need to be captured, since they will be needed to support future activities.
+
'''This page was completely reworked on March 11, 2009 - Please review the history if you need access to prior versions.'''
  
The four sessions could be:
+
Current plans call for a session during the technology showcase on Day 1, two separate sessions early the next day, followed by a whole day spread over two days devoted to a provenance/context workshop.  Session descriptions, goals, outcomes, and potential speakers follow:
*'''Agencies'''
+
 
**The session would start with presentations by representatives of each agency (NASA, NOAA, EPA, USGS, Library of Congress, NARA, etc.) who will be asked to describe their agency's policies and procedures in regards to data stewardship/preservation; to discuss their actual practices in particular where they diverge from policy; to assess where the agency is headed and any future plans in this area; and to suggest areas where joint work might be advantageous
+
*'''Preservation technologies''' to be given during Day 1 (~2 hours)
**The intent would be to understand what is happening in other agencies on this topic, to motivate cross-agency coordination, and to determine topics ripe for joint development/work at the working level
 
**Speaker suggestions:
 
***Linda Campbell - LOC
 
***Bob Chadduck - NARA
 
*'''[[Interagency Data Stewardship/LifeCycle/Jul2009MeetingPlans#Standards Session |  Standards]]'''  
 
**Presentations would be given on the following topics
 
***Preservation standards
 
***Data formats
 
***Metadata formats
 
***Provenance
 
**The purpose of this session is both standards training and to raise awareness within the community of the standards that exist in the earth science
 
*'''Preservation technologies'''
 
 
**The intent of this session is to determine and begin to assess preservation technologies that exist in the market place (both commercial and open source)
 
**The intent of this session is to determine and begin to assess preservation technologies that exist in the market place (both commercial and open source)
**There would be presentations on technologies like Fedora, DSpace, DuraSpace, IRods, NCore, LOCKSS, etc.
+
**There would be presentations on technologies like Fedora, DSpace, DuraSpace, IRods, NCore, LOCKSS, as well as a variety of workflow related technologies, etc.
 
**Topics each speaker should cover:
 
**Topics each speaker should cover:
 
***Purpose of the technology (what aspects of data lifecycle does the technology support)
 
***Purpose of the technology (what aspects of data lifecycle does the technology support)
 
***Capabilities
 
***Capabilities
 
***Known Limitations
 
***Known Limitations
 +
***Special emphasis given to discussion of how provenance/context is handled
 
**Suggested speakers
 
**Suggested speakers
*'''Non-Earth Science disciplines'''
+
 
**What are other disciplines doing for preservation/stewardship?  Any lessons to be learned and incorporated into earth science practice?   
+
*'''Standards''' - this session should be held prior to the start of the Provenance workshop (~2 hours)
 +
**Presentations would be given on the following topics
 +
***Preservation standards
 +
****OAIS - Have someone knowledgeable about OAIS to explain what it is, how it is being used by NOAA and other agencies.  (Why is it important to use it? Is it a mandatory for agencies to use? If so, who made it mandatory?) 
 +
***Data formats - Discuss what is important in data formats to ensure long term preservation of data – talk about HDF, HDF-EOS and NetCDF in this context. What about agencies other than NASA and NOAA? What formats do they use? How does one ensure that data stored in HDF/HDF-EOS/NetCDF continue to be readable and understandable 50 years from now? Etc.
 +
***Metadata formats – treat similarly to data formats considering metadata standards currently in use (ISO standards, North American Profile, CF-1, COARDS, PREMIS).
 +
**The purpose of this session is both standards training and to raise awareness within the community of the standards that exist in the earth science.  It is also to determine where additional standards work is needed, where agency collaboration can help move things forward, etc.
 +
**Suggested speakers:
 +
***Mike Folk on HDF efforts to improve data preservation
 +
 
 +
*'''The View from the Field''' (~2 hours)
 +
**What are other disciplines doing for preservation/stewardship?  How do they deal with databases, collections of files, physical objects, ad-hoc services such as work flows?  How do they deal with provenance?  Any lessons to be learned and incorporated into earth science practice?   
 
**Biology, Astronomy, Medicine, etc. are potential disciplines to be covered
 
**Biology, Astronomy, Medicine, etc. are potential disciplines to be covered
 
**Suggested speakers:
 
**Suggested speakers:
 
***Clifford Duke - Ecological Society of America
 
***Clifford Duke - Ecological Society of America
  
== Standards Session ==
+
*The bulk of the time would be spent on a Provenance/Context Workshop (roughly 8 hours spread over 2 days) with agenda:
 
+
**'''Introduction''' - Purpose of the workshop, overview of agenda, process
'''Topical outline for discussion at cluster meeting - Santa Barbara, July 2009:'''
+
**'''Other agencies''' - should this be inside or outside the workshop?
 
+
***The session would start with presentations by representatives from the Library of Congress, NARA, and NSFNARA and LOC would be asked to cover:
#OAIS - Have someone knowledgeable about OAIS to explain what it is, how it is being used by NOAA and other agencies(Why is it important to use it? Is it a mandatory for agencies to use? If so, who made it mandatory?)
+
****What are they doing in regards to earth science data?
#Data Formats - Discuss what is important in data formats to ensure long term preservation of data – talk about HDF, HDF-EOS and NetCDF in this context. What about agencies other than NASA and NOAA? What formats do they use? How does one ensure that data stored in HDF/HDF-EOS/NetCDF continue to be readable and understandable 50 years from now? Etc.
+
****What programs do they have moving preservation practice forward?  Status and results.
#Metadata Formats – treat similarly to 2 considering metadata standards currently in use (ISO standards, North American Profile, CF-1, COARDS, PREMIS).
+
****What standards they use and promulate?
#Provenance Standards – have someone knowledgeable discuss state of the art. Should there be a common set of requirements to preserve provenance?
+
****What do they do for provenance and records tracking?
 
+
****How do they deal with harmonization issues (i.e. with heterogenous standards, policies, and practices)?
I invite everyone to look at this outline and comment. Also, either volunteers or recommendations for speakers to cover these areas would be most welcome.
+
***NSF would be asked to cover:
 
+
****A description of the NSF program, status, and results
Depending on the scheduling for the other topic areas to be covered at the meeting, we may have 1.5 to 2 hours for this area. So, 20-30 minutes for each of the four items in the above outline would be the budget.
+
***Speaker suggestions:
 
+
****Bob Chadduck - NARA
[[User:Ramapriyan|Ramapriyan]] 15:38, 5 March 2009 (EST)
+
****Laura Campbell - LOC
 +
**'''Prior work'''
 +
***OAIS descriptions of provenance/context
 +
***Hunolt document
 +
**'''Provenance research'''
 +
***Briefings on some of the research in this area
 +
***Speaker suggestions:
 +
****Jim Frew
 +
****Bruce Barkstrom
 +
****Ruth Duerr - creation of archive packages
 +
**'''Develop requirements, guidelines, best practices for provenance/context information and standards in the earth sciences'''

Revision as of 18:16, March 11, 2009

Please contribute your thoughts and suggestions on our upcoming plans for including Cluster Activities at the upcoming summer ESIP meeting (Santa Barbara, July 2009).

This page was completely reworked on March 11, 2009 - Please review the history if you need access to prior versions.

Current plans call for a session during the technology showcase on Day 1, two separate sessions early the next day, followed by a whole day spread over two days devoted to a provenance/context workshop. Session descriptions, goals, outcomes, and potential speakers follow:

  • Preservation technologies to be given during Day 1 (~2 hours)
    • The intent of this session is to determine and begin to assess preservation technologies that exist in the market place (both commercial and open source)
    • There would be presentations on technologies like Fedora, DSpace, DuraSpace, IRods, NCore, LOCKSS, as well as a variety of workflow related technologies, etc.
    • Topics each speaker should cover:
      • Purpose of the technology (what aspects of data lifecycle does the technology support)
      • Capabilities
      • Known Limitations
      • Special emphasis given to discussion of how provenance/context is handled
    • Suggested speakers
  • Standards - this session should be held prior to the start of the Provenance workshop (~2 hours)
    • Presentations would be given on the following topics
      • Preservation standards
        • OAIS - Have someone knowledgeable about OAIS to explain what it is, how it is being used by NOAA and other agencies. (Why is it important to use it? Is it a mandatory for agencies to use? If so, who made it mandatory?)
      • Data formats - Discuss what is important in data formats to ensure long term preservation of data – talk about HDF, HDF-EOS and NetCDF in this context. What about agencies other than NASA and NOAA? What formats do they use? How does one ensure that data stored in HDF/HDF-EOS/NetCDF continue to be readable and understandable 50 years from now? Etc.
      • Metadata formats – treat similarly to data formats considering metadata standards currently in use (ISO standards, North American Profile, CF-1, COARDS, PREMIS).
    • The purpose of this session is both standards training and to raise awareness within the community of the standards that exist in the earth science. It is also to determine where additional standards work is needed, where agency collaboration can help move things forward, etc.
    • Suggested speakers:
      • Mike Folk on HDF efforts to improve data preservation
  • The View from the Field (~2 hours)
    • What are other disciplines doing for preservation/stewardship? How do they deal with databases, collections of files, physical objects, ad-hoc services such as work flows? How do they deal with provenance? Any lessons to be learned and incorporated into earth science practice?
    • Biology, Astronomy, Medicine, etc. are potential disciplines to be covered
    • Suggested speakers:
      • Clifford Duke - Ecological Society of America
  • The bulk of the time would be spent on a Provenance/Context Workshop (roughly 8 hours spread over 2 days) with agenda:
    • Introduction - Purpose of the workshop, overview of agenda, process
    • Other agencies - should this be inside or outside the workshop?
      • The session would start with presentations by representatives from the Library of Congress, NARA, and NSF. NARA and LOC would be asked to cover:
        • What are they doing in regards to earth science data?
        • What programs do they have moving preservation practice forward? Status and results.
        • What standards they use and promulate?
        • What do they do for provenance and records tracking?
        • How do they deal with harmonization issues (i.e. with heterogenous standards, policies, and practices)?
      • NSF would be asked to cover:
        • A description of the NSF program, status, and results
      • Speaker suggestions:
        • Bob Chadduck - NARA
        • Laura Campbell - LOC
    • Prior work
      • OAIS descriptions of provenance/context
      • Hunolt document
    • Provenance research
      • Briefings on some of the research in this area
      • Speaker suggestions:
        • Jim Frew
        • Bruce Barkstrom
        • Ruth Duerr - creation of archive packages
    • Develop requirements, guidelines, best practices for provenance/context information and standards in the earth sciences