Data Management Workshop

From Earth Science Information Partners (ESIP)

Background

The ESIP Federation, in cooperation with NOAA, seeks to share the community's knowledge with scientists who increasingly need to be better data managers. Over the next several years, the ESIP Federation expects to evolve training courses which seeks to improve the understanding of scientific data management among scientists and emerging scientists. Initially, a 1.5 hour workshop is to be held at the 2010 Fall meeting of the American Geophysical Union (AGU). The workshop may form the basis for an online course. Short courses, certificate programs, and university courses may be developed in the future. The AGU workshop is scheduled for Tuesday Dec. 14 from 1200h-1330h in Moscone South Rooms 228-230.

Advisory Team

  • Dave Anderson, NOAA/NCDC
  • Ken Casey, NOAA/NODC
  • Bob Cook, ORNL
  • Ruth Duerr, NSIDC/Chair, ESIP Data Preservation & Stewardship Cluster
  • Peter Fox, Rensselaer Polytechnic Institute, AGU Geoinformatics
  • Ted Habermann, NOAA/NGDC
  • Patricia Huff, NOAA/NESDIS
  • Carol Meyer, ESIP staff
  • Ken McDonald, NOAA
  • Nancy Ritchey, NOAA/NCDC
  • Ron Weaver, NSIDC

AGU Workshop Description

Writing Your Data Management Plan

Whether you need to include a data management plan in your NSF proposal, want to make data exchange in your field as transparent as possible, or just aim to maximize the visibility of your science in the Internet World, this workshop is for you. Earth scientists face increasing pressure to share their results not just in journals, but in many other settings. Data produced sometimes long ago for one purpose are now being successfully applied to emerging problems in entirely different disciplines. A concrete data management plan developed early in your research project can make you and your data more visible, more successful, and increase the impact of your science.

In this Earth Science Information Partners-sponsored workshop (ESIP), representatives from NOAA, NASA, and other data archive centers will provide an overview into the world of successful data stewardship, examine emerging standards and trends, and provide concrete steps for managing your Earth Science data. We will present our roadmap to completion of the recently distributed NSF data management requirement. We will conclude with a question and answer session. Workshop duration is 1.5 hours.

Outline

1. Introduction (20 min total)

  a. Welcome/ Goals (Anderson)
  b. Data management success story (video) (TBD)
  c. Return on your investment (video) (Fox)
  d. What not to do (video) Steffie Hampton, NCEAS Santa Barbara)
  e. Encouraging remarks from NSF (TBD)

2. Elements of a data management plan (35 min) Material adapted from DCC and other sources to be ‘from the scientists perspective’ rather than the archive perspective, trying to capture essential elements (common to any plan), avoid being prescriptive, avoid taking people down the wrong path (with info not relevant to them). Ruth leads development of this material, with Ron, Bob, possible help from Nancy Ritchey. Resource: Digital Curation Center (UK) checklist. DataOne and Data Conservancy are creating similar checklists. Possibility to make this interactive (Ron suggests scenario play).

3. Role of the long-term archive (20 min)

 a. Long term citability
 b. Long term accessibility (discovery, value added products, multiple access mechanisms)
 c. Archives are ready to receive your data
 d. Enable data mining and future uses of data
 e. Big payoff for doing this (video) (Fox) (this is our closing statement).

5. Questions from the audience (panel includes Ron Weaver (NSIDC), Ruth Duerr (NSIDC and Data Conservancy), Ken Casey (NODC), Bob Cook (ORNL and Data One), TBD (NSF/OCI), (TBD) DataNet (15 min)

HANDOUT: A one page summary with URL’s to more information.

Links

NSF Data Management Plan Description | http://www.nsf.gov/bfa/dias/policy/dmp.jsp


Workshop on "How to Prepare Ecological Data Sets for Effective Analysis and Sharing" | http://eco.confex.com/eco/2010/techprogram/S5744.HTM

Digital Curation Centre Plans | http://www.dcc.ac.uk/resources/data-management-plans

Questions for discussion session

-My data is already on my web site. Why do I have to use a long term archive?

-I intend to put into my NSF Data Management Plan that if funded, I will send my data to the National Data Center. But how do I do actually do that?"

-How do I know/determine the necessary metadata for my dataset?