Difference between revisions of "Data Management Workshop"

From Earth Science Information Partners (ESIP)
 
(18 intermediate revisions by 3 users not shown)
Line 12: Line 12:
 
*Patricia Huff, NOAA/NESDIS
 
*Patricia Huff, NOAA/NESDIS
 
*Carol Meyer, ESIP staff
 
*Carol Meyer, ESIP staff
*Ken McDonald, NOAA
 
 
*Nancy Ritchey, NOAA/NCDC
 
*Nancy Ritchey, NOAA/NCDC
 
*Ron Weaver, NSIDC
 
*Ron Weaver, NSIDC
Line 25: Line 24:
 
===Outline===
 
===Outline===
 
1. Introduction (20 min total)
 
1. Introduction (20 min total)
  a. Welcome/ Goals (Anderson)
+
*Welcome/ Goals (Anderson)
  b. Data preservation and climate change (video) (Tom Karl, Climate Service)  
+
*Data preservation and climate change (video) (Tom Karl, Climate Service)  
  c. Return on your investment (video) (Fox)
+
*Return on your investment (video) (Peter Fox)
  d. What not to do (video) (Amber Budden, Data One)
+
*What not to do (video) (Anderson)
  e. Remarks from NSF (in person) (Cliff Jacobs, Geosciences Directorate)
+
*Remarks from NSF (video) (Cliff Jacobs, Geosciences Directorate)
  f. Outline (what this is, and is not) (Anderson)
+
*Outline (what this is, and is not) (Anderson)
  
 
2a. Elements of a data management plan (30 min) Ruth Duerr, Presenter
 
2a. Elements of a data management plan (30 min) Ruth Duerr, Presenter
Material adapted from DCC and other sources to be ‘from the scientists perspective’ rather than the archive perspective, trying to capture essential elements common to any plan, avoid being prescriptive, avoid taking people down the wrong path (with info not relevant to them).  Ruth leads development of this material, with Ron, Bob, possible help from Nancy Ritchey.  Resource: Digital Curation Center (UK) checklist. DataOne and Data Conservancy are creating similar checklists.  
+
Material adapted from DCC and other sources to be ‘from the scientists perspective’ rather than the archive perspective, trying to capture essential elements common to any plan and avoid being prescriptive.  Ruth leads development of this material, with Ron, Bob, possible help from Nancy Ritchey.  Resource: Digital Curation Center (UK) checklist. DataONE and Data Conservancy are creating similar checklists.  
 +
* Identify the materials that will be created
 +
* Standards and organization
 +
* Access, sharing, and re-use
 +
* Backups, archiving, and preservation
  
2b. Questions (panel includes Ron Weaver (NSIDC), Ruth Duerr (NSIDC and Data Conservancy), Ken Casey (NODC), Bob Cook (ORNL and Data One), Viv Hutchison (USGS), Cliff Jacobs, Geosciences Directorate, NSF (10 min).
+
2b. Questions (panel includes Ron Weaver (NSIDC), Ruth Duerr (NSIDC and Data Conservancy), Ken Casey (NODC), Bob Cook (ORNL DAAC and DataONE), Cliff Jacobs, Geosciences Directorate, NSF (10 min)).
  
 
3a. Long term archive topics (20 min)
 
3a. Long term archive topics (20 min)
  a. What data goes to a long term archive, and what does not? (Ron)
+
*What data goes to a long term archive, and what does not? (Weaver)
  b. What do long term archives do with my data? (Ken Casey)
+
*What do long term archives do with my data? (Casey)
  c. What value do long term archives add (Long term accessibility, data mining and future use, discovery, value added products, multiple access mechanisms, long-term citability)
+
*Role of metadata (descriptions) in discovery and future use (Cook)
  d. Role of metadata (descriptions) in discovery and future use (Viv Hutchinson, USGS)  
+
*Big payoff (video) (Fox).
  e. Big payoff for doing this (video) (Fox) (closing statement).
 
 
 
3b. Second Question period (10 min)
 
  
 +
3b. Second Question period (10 min)
  
 
HANDOUT: A one page summary with URL’s to more information.
 
HANDOUT: A one page summary with URL’s to more information.
  
 
===Links===
 
===Links===
NSF Data Management Plan Description |  
+
*NSF Data Management Plan Description | http://www.nsf.gov/bfa/dias/policy/dmp.jsp
http://www.nsf.gov/bfa/dias/policy/dmp.jsp
+
**NSF FAQ on the data management and sharing policy | http://www.nsf.gov/bfa/dias/policy/dmpfaqs.jsp  
 +
*Workshop on "How to Prepare Ecological Data Sets for Effective Analysis and Sharing" | http://eco.confex.com/eco/2010/techprogram/S5744.HTM
 +
**Ecological Society of America Annual Meeting, August 1, 2010
 +
**Agenda | http://daac.ornl.gov/ESA_Workshops_2010/ESA2010_WK-13.shtml
 +
**Elements of a Data Management Plan | http://daac.ornl.gov/ESA_Workshops_2010/data_management_plans_michener_20100731-1.pdf
 +
*University of California Curation Center |  http://www.cdlib.org/services/uc3/datamanagement/index.html
 +
*Digital Curation Centre Plans | http://www.dcc.ac.uk/resources/data-management-plans
 +
*DataONE | http://www.dataone.org
 +
*Data Conservancy | http://dataconservancy.org
 +
*Survey for AGU Workshop [[Media:survey.pdf]]
 +
 
 +
===Questions for discussion session===
 +
-My data is already on my web site.  Why do I have to use a long term archive?
 +
 
 +
-I intend to put into my NSF Data Management Plan that if funded, I will send my data to the National Data Center.  But how do I do actually do that?
 +
 
 +
-How do I know/determine the necessary metadata for my dataset?
  
 +
-How can archives promise to make data available forever?
  
Workshop on "How to Prepare Ecological Data Sets for Effective Analysis and Sharing" | http://eco.confex.com/eco/2010/techprogram/S5744.HTM
+
-I hate metadata. What is the minimum information I need to provide with my data?
*Ecological Society of America Annual Meeting, August 1, 2010
 
*Agenda | http://daac.ornl.gov/ESA_Workshops_2010/ESA2010_WK-13.shtml
 
*Elements of a Data Management Plan | http://daac.ornl.gov/ESA_Workshops_2010/data_management_plans_michener_20100731-1.pdf
 
  
Digital Curation Centre Plans | http://www.dcc.ac.uk/resources/data-management-plans
+
-What is a submission agreement?
  
===Questions for discussion session===
+
-When to I have to make my data available to the public?
-My data is already on my web site.  Why do I have to use a long term archive?
 
  
-I intend to put into my NSF Data Management Plan that if funded, I will send my data to the National Data Center.  But how do I do actually do that?"
+
-What is the cloud and how does it relate to data sharing and long term preservation?
  
-How do I know/determine the necessary metadata for my dataset?
+
-What is a DOI?

Latest revision as of 18:02, April 13, 2011

Background

The ESIP Federation, in cooperation with NOAA, seeks to share the community's knowledge with scientists who increasingly need to be better data managers. Over the next several years, the ESIP Federation expects to evolve training courses which seeks to improve the understanding of scientific data management among scientists and emerging scientists. Initially, a 1.5 hour workshop is to be held at the 2010 Fall meeting of the American Geophysical Union (AGU). The workshop may form the basis for an online course. Short courses, certificate programs, and university courses may be developed in the future. The AGU workshop is scheduled for Tuesday Dec. 14 from 1200h-1330h in Moscone South Rooms 228-230.

Advisory Team

  • Dave Anderson, NOAA/NCDC
  • Ken Casey, NOAA/NODC
  • Bob Cook, ORNL
  • Ruth Duerr, NSIDC/Chair, ESIP Data Preservation & Stewardship Cluster
  • Peter Fox, Rensselaer Polytechnic Institute, AGU Geoinformatics
  • Ted Habermann, NOAA/NGDC
  • Patricia Huff, NOAA/NESDIS
  • Carol Meyer, ESIP staff
  • Nancy Ritchey, NOAA/NCDC
  • Ron Weaver, NSIDC

AGU Workshop Description

Writing Your Data Management Plan

Whether you need to include a data management plan in your NSF proposal, want to make data exchange in your field as transparent as possible, or just aim to maximize the visibility of your science in the Internet World, this workshop is for you. Earth scientists face increasing pressure to share their results not just in journals, but in many other settings. Data produced sometimes long ago for one purpose are now being successfully applied to emerging problems in entirely different disciplines. A concrete data management plan developed early in your research project can make you and your data more visible, more successful, and increase the impact of your science.

In this Earth Science Information Partners-sponsored workshop (ESIP), representatives from NOAA, NASA, and other data archive centers will provide an overview into the world of successful data stewardship, examine emerging standards and trends, and provide concrete steps for managing your Earth Science data. We will present our roadmap to completion of the recently distributed NSF data management requirement. We will conclude with a question and answer session. Workshop duration is 1.5 hours.

Outline

1. Introduction (20 min total)

  • Welcome/ Goals (Anderson)
  • Data preservation and climate change (video) (Tom Karl, Climate Service)
  • Return on your investment (video) (Peter Fox)
  • What not to do (video) (Anderson)
  • Remarks from NSF (video) (Cliff Jacobs, Geosciences Directorate)
  • Outline (what this is, and is not) (Anderson)

2a. Elements of a data management plan (30 min) Ruth Duerr, Presenter Material adapted from DCC and other sources to be ‘from the scientists perspective’ rather than the archive perspective, trying to capture essential elements common to any plan and avoid being prescriptive. Ruth leads development of this material, with Ron, Bob, possible help from Nancy Ritchey. Resource: Digital Curation Center (UK) checklist. DataONE and Data Conservancy are creating similar checklists.

  • Identify the materials that will be created
  • Standards and organization
  • Access, sharing, and re-use
  • Backups, archiving, and preservation

2b. Questions (panel includes Ron Weaver (NSIDC), Ruth Duerr (NSIDC and Data Conservancy), Ken Casey (NODC), Bob Cook (ORNL DAAC and DataONE), Cliff Jacobs, Geosciences Directorate, NSF (10 min)).

3a. Long term archive topics (20 min)

  • What data goes to a long term archive, and what does not? (Weaver)
  • What do long term archives do with my data? (Casey)
  • Role of metadata (descriptions) in discovery and future use (Cook)
  • Big payoff (video) (Fox).

3b. Second Question period (10 min)

HANDOUT: A one page summary with URL’s to more information.

Links

Questions for discussion session

-My data is already on my web site. Why do I have to use a long term archive?

-I intend to put into my NSF Data Management Plan that if funded, I will send my data to the National Data Center. But how do I do actually do that?

-How do I know/determine the necessary metadata for my dataset?

-How can archives promise to make data available forever?

-I hate metadata. What is the minimum information I need to provide with my data?

-What is a submission agreement?

-When to I have to make my data available to the public?

-What is the cloud and how does it relate to data sharing and long term preservation?

-What is a DOI?