Data Management Course Outline
From Earth Science Information Partners (ESIP)
Revision as of 10:44, July 17, 2012 by Tyler Stevens (Tstevens) (talk | contribs) (→Local Data Management)
Back to the main data management training page
NOTE: We agreed that the target audience initially would be scientists
Caution!!!!
All of the modules on this site are draft materials only! They are made available here so that interested parties can see what is in development and have the chance to comment. Once modules have completed the peer and editorial review process, they will be moved to the ESIP Information Commons and placed under revision control.
Module template and Author Guidelines
- Data management training module template (ppt) (Updated - Now with Voiceover scripts & revised Refs/Resources slides)
- Author Guidelines
- Module Review Criteria
For Scientists
The case for data stewardship
- Agency requirements - Ruth
- NSF data management plans - Ruth
- NASA science data policy - Ron Weaver
- NOAA Administrative Order 212-15, Management of Environmental and Geospatial Data and Information - Jeff Arnfield/NCDC
- Return on Investment - Erin/Carol
- Return on Scientist's investment - Ruth
- Return on the public's investment - Ruth
- Verifiable science - Tying your data to standards, metrics, and benchmarks
- Facilitating science through interoperable discovery and access - Jeff Arnfield/NCDC
- Enhancing your reputation - Matt Mayernik/NCAR
- Preserving the Scientific Record - Matt
- What Not to do when Archiving Data!
Data Management plans
- Why do a data management plan? - Ruth
- Elements of a plan - Ruth
- Estimating effort and resources required - Ruth
- Some available resources to help with developing your plan - Ruth
Local Data Management
- Managing your data - Ruth
- Data identifiers and locators
- File naming conventions Bob Cook/ORNL
- Backing up your data Bob Cook/ORNL
- Write it down! Maintaining contemporaneous documentation
- Who, what, when, where, why, how
- Tracking and describing changes to the data
- Lab-based approaches to Data Management - Lynn Yarmey/NSIDC
- Data Formats - Ruth
- Avoiding proprietary formats - Al Fleig
- Choosing and adopting community accepted standards - Curt Tilmes/NASA
- Building understandable spreadsheets - Jeff Arnfield/NCDC
- Using self-describing data formats - Curt Tilmes/NASA
- Creating documentation and metadata
- Introduction to Metadata and Metadata Standards - Lynn Yarmey/NSIDC
- Developing a citation for your data Bob Cook/ORNL
- Recording provenance and context - Jeff Arnfield/NCDC
- For your collections as a whole
- Creating item level metadata
- Metadata for discovery - Tyler Stevens/GCMD
- Metadata for access and use - Jeff Arnfield/NCDC
- Metadata for archiving - Jeff Arnfield/NCDC
- Metadata for tracking data processing
- Individual agencies, archives and registries may have specific requirements
- Working with your archive organization - Ron Weaver/NSIDC
- Planning for longer term preservation - Jeff Arnfield/NCDC
- Work with your archive early and often - Jeff Arnfield/NCDC
- Broadening your user community - Bob Downs
- Advertising your data - Nancy Hoebelheinrich/Knowledge Motifs
- Agency/institution requirements for publishing metadata - Nancy Hoebelheinrich/Knowledge Motifs
- Journals and publications
- Agency/institution web sites
- Using portals and registries - Nancy Hoebelheinrich/Knowledge Motifs
- Publishing metadata to a Web Accessible Folder
- Publishing metadata to GCMD - Tyler Stevens/GCMD
- Publishing metadata to ECHO
- Publishing metadata to Data.Gov
- NOTE: Need to address additional portals and registries beyond GCMD & ECHO. Add other entries as appropriate
- Datacasting - Nancy Hoebelheinrich/Knowledge Motifs
- Providing access to your data - Bob Downs/Chris Lenhardt/Ron Weaver (whole section); Rama has volunteered to review this section
- Determining your audience - Bob Downs
- Who gets to access your data
- Agency best practices & policies
- Access mechanisms - Bob Downs
- Tracking data usage - Bob Downs
- Handling Sensitive Data - Bob Downs
- Rights - Bob Downs
- Additional Products
- Writing Sharable Code - Lynn Yarmey/NSIDC
Preservation strategies
I have added draft sections below, the references need work -Ron Weaver
- Sponsor (e.g., Agency) or institution requirements - Ron Weaver /NSIDC
- Options for archiving your data - Ron Weaver/NSIDC
- What archives are out there? - Ron Weaver/NSIDC (part of above)
- Discipline or institutional archives (part of above)
- Finding an archive (part of above)
- What to do if there is no archive out there - Ron Weaver/NSIDC (part of above)
- What archives are out there? - Ron Weaver/NSIDC (part of above)
- What data goes into a Long-term archive? - Ron Weaver/NSIDC
- What do long term archives do with my data? - Jeff Arnfield/NCDC
- Data transfer & submission agreements - Ron Weaver/NSIDC
- See "Submission Agreements" section under "For Data Managers"
- Agency/archive specific requirements my vary
- Intro to the OAIS Reference Model - Curt Tilmes
- Emerging standards for preservation - Ron Weaver/NSIDC
- Metadata
Responsible Data Use
- Citation and credit - Matt Mayernik/NCAR
- Data restrictions - Bob Downs
- Fair access and use - Matt
- Providing Feedback - Ruth
- Collaboration
- Community participation
For Data Managers
- Data Management plan support
- Collection or acquisition policies
- Intro to OAIS reference model
- Initial Assessment and appraisal
- Identify information to be preserved
- main features and properties
- dependencies on information here or elsewhere
- Identify objects to be received
- Establish complementary information needs (e.g., format, data descriptions, provenance, reference information, context, fixity information)
- What complementary information is needed for data useful for climate studies (USGCRP list)
- Assessing potential designated communities
- Assessing probable curation duration
- Assessing data transfer options
- Defining access paths
- Assessing costs and feasibility
- Metadata, metadata standards, and levels of metadata
- Identify information to be preserved
- Submission agreements
- Data integrity
- Contacts
- Schedule
- Operational Procedures
- Error reconciliation
- Constraints
- other aspects necessary for understanding how to support the data
- Preparing for ingest
- Ingesting data
- Validation checks
- Identifiers
- Citations
- Levels of service
- Periodic re-assessment
- Curation activities
- Media migration
- Format migration