Difference between revisions of "Interagency Data Stewardship/LifeCycle/Jul2009MeetingPlans"
From Earth Science Information Partners (ESIP)
Ramapriyan (talk | contribs) (Updated Standards part of agenda with suggested speakers on April 3, 2009 - H. K. "Rama" Ramapriyan) |
|||
Line 5: | Line 5: | ||
Current plans call for a session during the technology showcase on Day 1, two separate sessions early the next day, followed by a whole day spread over two days devoted to a provenance/context workshop. Session descriptions, goals, outcomes, and potential speakers follow: | Current plans call for a session during the technology showcase on Day 1, two separate sessions early the next day, followed by a whole day spread over two days devoted to a provenance/context workshop. Session descriptions, goals, outcomes, and potential speakers follow: | ||
− | *'''Preservation technologies''' to be given during Day 1 ( | + | *'''Preservation technologies''' to be given during Day 1 as a series of 1.5 hour technical workshops (perhaps a few may be split in half?) |
**The intent of this session is to determine and begin to assess preservation technologies that exist in the market place (both commercial and open source) | **The intent of this session is to determine and begin to assess preservation technologies that exist in the market place (both commercial and open source) | ||
**There would be presentations on technologies like Fedora, DSpace, DuraSpace, IRods, NCore, LOCKSS, as well as a variety of workflow related technologies, etc. | **There would be presentations on technologies like Fedora, DSpace, DuraSpace, IRods, NCore, LOCKSS, as well as a variety of workflow related technologies, etc. | ||
Line 14: | Line 14: | ||
***Special emphasis given to discussion of how provenance/context is handled | ***Special emphasis given to discussion of how provenance/context is handled | ||
**Suggested speakers | **Suggested speakers | ||
− | ***Fedora and maybe Duraspace - Thornton Staples - | + | ***Fedora and maybe Duraspace - Thornton Staples - confirmed |
− | ***LOCKSS - Vicky Reich or David Rosenthal | + | ***LOCKSS - Vicky Reich or David Rosenthal - not confirmed |
− | ***iRODS - Reagan Moore | + | ***iRODS - Reagan Moore - not confirmed |
***Whom else? What about the request for presentations from workflow related technologies? | ***Whom else? What about the request for presentations from workflow related technologies? | ||
− | *'''Standards''' - this session | + | ****Brian Wilson (JPL) - sciflo |
+ | ****Carol Goebel (University of Manchester) - iExperiment.org | ||
+ | ****Jim Frew (UCSB) | ||
+ | ****John Howard (Arizona State) | ||
+ | *'''Standards''' - this session could be held in the midst of the Provenance and Context workshop (~2 hours) | ||
**Session Goals | **Session Goals | ||
***Standards training | ***Standards training | ||
Line 32: | Line 36: | ||
****Don Sawyer (overview) | ****Don Sawyer (overview) | ||
****Ken McDonald/Rob Rank/Bruce Barkstrom (NOAA usage) | ****Ken McDonald/Rob Rank/Bruce Barkstrom (NOAA usage) | ||
+ | ****Chris Drescher (USGS usage) | ||
****John Moses/Jeanne Behnke (NASA EOSDIS Data Centers' usage) | ****John Moses/Jeanne Behnke (NASA EOSDIS Data Centers' usage) | ||
***Data Formats | ***Data Formats | ||
****Mike Folk on HDF efforts to improve data preservation (and cover NetCDF as well or get Mohan Ramamurthy for NetCDF) | ****Mike Folk on HDF efforts to improve data preservation (and cover NetCDF as well or get Mohan Ramamurthy for NetCDF) | ||
***Metadata content and format standards | ***Metadata content and format standards | ||
− | **** | + | ****Siri Jodha Singh Khalsa/Ted Habermann (ISO standards, NASA ECS Data Model, FGDC) |
+ | ****Doug Nebert (FGDC) | ||
****Rebecca Guenther (PREMIS) | ****Rebecca Guenther (PREMIS) | ||
Line 44: | Line 50: | ||
**Suggested speakers: | **Suggested speakers: | ||
***Clifford Duke - Ecological Society of America | ***Clifford Duke - Ecological Society of America | ||
+ | ***Nirav Merchant - iPlant - not confirmed | ||
+ | ***Robert Hanisch or Ani Thakar - National Virtual Observatory - not confirmed | ||
**Representatives from the Library of Congress, NARA, and NSF would also be asked to present. NARA and LOC would be asked to cover: | **Representatives from the Library of Congress, NARA, and NSF would also be asked to present. NARA and LOC would be asked to cover: | ||
***What are they doing in regards to earth science data? | ***What are they doing in regards to earth science data? | ||
Line 53: | Line 61: | ||
****A description of the NSF program, status, and results | ****A description of the NSF program, status, and results | ||
**Speaker suggestions: | **Speaker suggestions: | ||
− | ***Bob Chadduck - NARA | + | ***Bob Chadduck - NARA - not confirmed |
− | ***Laura Campbell - LOC | + | ***Laura Campbell - LOC representation is confirmed |
+ | ***Frank Olken - NSF - not confirmed | ||
Revision as of 11:59, April 9, 2009
Please contribute your thoughts and suggestions on our upcoming plans for including Cluster Activities at the upcoming summer ESIP meeting (Santa Barbara, July 2009).
This page was completely reworked on March 11, 2009 - Please review the history if you need access to prior versions.
Current plans call for a session during the technology showcase on Day 1, two separate sessions early the next day, followed by a whole day spread over two days devoted to a provenance/context workshop. Session descriptions, goals, outcomes, and potential speakers follow:
- Preservation technologies to be given during Day 1 as a series of 1.5 hour technical workshops (perhaps a few may be split in half?)
- The intent of this session is to determine and begin to assess preservation technologies that exist in the market place (both commercial and open source)
- There would be presentations on technologies like Fedora, DSpace, DuraSpace, IRods, NCore, LOCKSS, as well as a variety of workflow related technologies, etc.
- Topics each speaker should cover:
- Purpose of the technology (what aspects of data lifecycle does the technology support)
- Capabilities
- Known Limitations
- Special emphasis given to discussion of how provenance/context is handled
- Suggested speakers
- Fedora and maybe Duraspace - Thornton Staples - confirmed
- LOCKSS - Vicky Reich or David Rosenthal - not confirmed
- iRODS - Reagan Moore - not confirmed
- Whom else? What about the request for presentations from workflow related technologies?
- Brian Wilson (JPL) - sciflo
- Carol Goebel (University of Manchester) - iExperiment.org
- Jim Frew (UCSB)
- John Howard (Arizona State)
- Standards - this session could be held in the midst of the Provenance and Context workshop (~2 hours)
- Session Goals
- Standards training
- Raising awareness within the community of the standards that exist in the earth science
- Determine where additional standards work is needed, where agency collaboration can help move things forward, etc.
- Presentations would be given on the following topics
- Preservation standards
- OAIS - Have someone knowledgeable about OAIS to explain what it is, how it is being used by NOAA and other agencies. (Why is it important to use it? Is it a mandatory for agencies to use? If so, who made it mandatory?)
- Data formats - Discuss what is important in data formats to ensure long term preservation of data – talk about HDF, HDF-EOS and NetCDF in this context. What about agencies other than NASA and NOAA? What formats do they use? How does one ensure that data stored in HDF/HDF-EOS/NetCDF continue to be readable and understandable 50 years from now? Etc.
- Metadata formats – treat similarly to data formats considering metadata standards currently in use (ISO standards, North American Profile, CF-1, COARDS, PREMIS).
- Preservation standards
- Suggested speakers:
- OAIS
- Don Sawyer (overview)
- Ken McDonald/Rob Rank/Bruce Barkstrom (NOAA usage)
- Chris Drescher (USGS usage)
- John Moses/Jeanne Behnke (NASA EOSDIS Data Centers' usage)
- Data Formats
- Mike Folk on HDF efforts to improve data preservation (and cover NetCDF as well or get Mohan Ramamurthy for NetCDF)
- Metadata content and format standards
- Siri Jodha Singh Khalsa/Ted Habermann (ISO standards, NASA ECS Data Model, FGDC)
- Doug Nebert (FGDC)
- Rebecca Guenther (PREMIS)
- OAIS
- Session Goals
- The View from the Field (~2 hours)
- What are other disciplines and agencies doing for preservation/stewardship? How do they deal with databases, collections of files, physical objects, ad-hoc services such as work flows? How do they deal with provenance? Any lessons to be learned and incorporated into earth science practice?
- Biology, Astronomy, Medicine, etc. are potential disciplines to be covered
- Suggested speakers:
- Clifford Duke - Ecological Society of America
- Nirav Merchant - iPlant - not confirmed
- Robert Hanisch or Ani Thakar - National Virtual Observatory - not confirmed
- Representatives from the Library of Congress, NARA, and NSF would also be asked to present. NARA and LOC would be asked to cover:
- What are they doing in regards to earth science data?
- What programs do they have moving preservation practice forward? Status and results.
- What standards they use and promulate?
- What do they do for provenance and records tracking?
- How do they deal with harmonization issues (i.e. with heterogenous standards, policies, and practices)?
- NSF would be asked to cover:
- A description of the NSF program, status, and results
- Speaker suggestions:
- Bob Chadduck - NARA - not confirmed
- Laura Campbell - LOC representation is confirmed
- Frank Olken - NSF - not confirmed
- Provenance and Context part 1
The bulk of the time would be spent on a Provenance/Context Workshop (roughly 8 hours spread over 2 days) with agenda:
- Introduction - Purpose of the workshop, overview of agenda, process-- Duerr (5min)
- Provenance and Context in the Field
- brief presentations (10 min each) from ~4 of our invited experts providing more detail on provenance and context.
- Prior work
- Summary from Winter Meeting--Raskin (15 min)
- Review of Guiding Documents: OAIS, PREMIS, USGCRP descriptions of provenance/context -- Duerr (30 Min)
- Provenance and Context research
- 3-4 15 min briefings on some of the research projects in this area and how they fit into overall research agenda
- Speaker suggestions:
- Jim Frew
- Bruce Barkstrom
- Ruth Duerr - creation of archive packages
- Open Discussion
- Review what we learned in preceeding sessions
- Determine what's missing
- Plan initial approach to creating final document
- Provenance and Context part 2
- Review prior days plan. New ideas? Modifications.
- Create outline for a recommended Research Agenda and Short-term Action Plan
- Break into small writing teams
- Reconvene and determine plan for finalizing document