Data Management Workshop
Background
The ESIP Federation, in cooperation with NOAA, seeks to share the community's knowledge with scientists who increasingly need to be better data managers. Over the next several years, the ESIP Federation expects to evolve training courses which seeks to improve the understanding of scientific data management among scientists and emerging scientists. Initially, a 1.5 hour workshop is to be held at the 2010 Fall meeting of the American Geophysical Union (AGU). The workshop may form the basis for an online course. Short courses, certificate programs, and university courses may be developed in the future. The AGU workshop is scheduled for Tuesday Dec. 14 from 1200h-1330h in Moscone South Rooms 228-230.
Advisory Team
- Dave Anderson, NOAA/NCDC
- Ken Casey, NOAA/NODC
- Bob Cook, ORNL
- Ruth Duerr, NSIDC/Chair, ESIP Data Preservation & Stewardship Cluster
- Peter Fox, Rensselaer Polytechnic Institute, AGU Geoinformatics
- Ted Habermann, NOAA/NGDC
- Patricia Huff, NOAA/NESDIS
- Carol Meyer, ESIP staff
- Ken McDonald, NOAA
- Nancy Ritchey, NOAA/NCDC
- Ron Weaver, NSIDC
AGU Workshop Description
Writing Your Data Management Plan
Whether you need to include a data management plan in your NSF proposal, want to make data exchange in your field as transparent as possible, or just aim to maximize the visibility of your science in the Internet World, this workshop is for you. Earth scientists face increasing pressure to share their results not just in journals, but in many other settings. Data produced sometimes long ago for one purpose are now being successfully applied to emerging problems in entirely different disciplines. A concrete data management plan developed early in your research project can make you and your data more visible, more successful, and increase the impact of your science.
In this Earth Science Information Partners-sponsored workshop (ESIP), representatives from NOAA, NASA, and other data archive centers will provide an overview into the world of successful data stewardship, examine emerging standards and trends, and provide concrete steps for managing your Earth Science data. We will present our roadmap to completion of the recently distributed NSF data management requirement. We will conclude with a question and answer session. Workshop duration is 1.5 hours.
Outline
1. Introduction (20 min total)
a. Welcome/ Goals (Anderson) b. Data preservation and climate change (video) (Tom Karl, Climate Service) c. Return on your investment (video) (Fox) d. What not to do (video) (Amber Budden (UNM) / Patricia Cruse (CDL), DataONE) e. Remarks from NSF (in person) (Cliff Jacobs, Geosciences Directorate) f. Outline (what this is, and is not) (Anderson)
2a. Elements of a data management plan (30 min) Ruth Duerr, Presenter Material adapted from DCC and other sources to be ‘from the scientists perspective’ rather than the archive perspective, trying to capture essential elements common to any plan, avoid being prescriptive, avoid taking people down the wrong path (with info not relevant to them). Ruth leads development of this material, with Ron, Bob, possible help from Nancy Ritchey. Resource: Digital Curation Center (UK) checklist. DataONE and Data Conservancy are creating similar checklists.
2b. Questions (panel includes Ron Weaver (NSIDC), Ruth Duerr (NSIDC and Data Conservancy), Ken Casey (NODC), Bob Cook (ORNL DAAC and DataONE), Viv Hutchison (USGS and DataONE), Cliff Jacobs, Geosciences Directorate, NSF (10 min).
3a. Long term archive topics (20 min)
a. What data goes to a long term archive, and what does not? (Ron) b. What do long term archives do with my data? (Ken Casey) c. What value do long term archives add (Long term accessibility, data mining and future use, discovery, value added products, multiple access mechanisms, long-term citability) d. Role of metadata (descriptions) in discovery and future use (Viv Hutchison, USGS and DataONE) e. Big payoff for doing this (video) (Fox) (closing statement).
3b. Second Question period (10 min)
HANDOUT: A one page summary with URL’s to more information.
Links
- NSF Data Management Plan Description | http://www.nsf.gov/bfa/dias/policy/dmp.jsp
- Workshop on "How to Prepare Ecological Data Sets for Effective Analysis and Sharing" | http://eco.confex.com/eco/2010/techprogram/S5744.HTM
- Ecological Society of America Annual Meeting, August 1, 2010
- Agenda | http://daac.ornl.gov/ESA_Workshops_2010/ESA2010_WK-13.shtml
- Elements of a Data Management Plan | http://daac.ornl.gov/ESA_Workshops_2010/data_management_plans_michener_20100731-1.pdf
- University of California Curation Center | http://www.cdlib.org/services/uc3/datamanagement/index.html
- Digital Curation Centre Plans | http://www.dcc.ac.uk/resources/data-management-plans
- DataONE | https://www.dataone.org
- Data Conservancy | http://dataconservancy.org
Questions for discussion session
-My data is already on my web site. Why do I have to use a long term archive?
-I intend to put into my NSF Data Management Plan that if funded, I will send my data to the National Data Center. But how do I do actually do that?"
-How do I know/determine the necessary metadata for my dataset?