Difference between revisions of "Documenting the Big Earth Data Initiative"

From Earth Science Information Partners (ESIP)
 
(17 intermediate revisions by 3 users not shown)
Line 1: Line 1:
The purpose of the Big Earth Data Initiative (BEDI) is to improve the discoverability, accessibility, and usability of Federal data and information products derived from civil Earth observations. This activity is an OSTP initiative which will be coordinated through the US Group on Earth Observations (USGEO) Subcommittee of the National Science and Technology Council (NSTC) Committee on Environment Natural Resources and Sustainability (CENRS).
+
=Overview=
 +
The ESIP Documentation Cluster is working with the U.S. Big Earth Data Initiative (BEDI) to improve the discoverability, accessibility, and usability of Federal data and information products derived from civil Earth observations. BEDI is an initiative of the White House Office of Science Technology and Policy (OSTP) (see http://www.whitehouse.gov/blog/2013/04/19/taking-pulse-our-planet-new-strategy-earth-observations) that will be coordinated through the US Group on Earth Observations (USGEO) Subcommittee of the National Science and Technology Council (NSTC) Committee on Environment Natural Resources and Sustainability (CENRS).
  
===Overview===
+
Metadata are a critical component of the Big Earth Data Initiative (BEDI).  Metadata are essential for facilitating data discovery, access, usability and understanding. The purpose of this BEDI project is to provide data producers with the guidance necessary to create high quality metadata.  We are developing a new ensemble approach for identifying criteria for high quality metadata in multiple earth science documentation dialects and providing quantitative tools for evaluating and improving metadata using those criteria. The Documentation Connections Category on the ESIP Documentation Cluster Wiki is the community focal point for this activity.
A critical component of the BEDI is metadata.  Metadata is essential for facilitating data discovery, access, useability and understanding. The purpose of this page is to provide data producer's with the guidance necessary to create high quality metadata.  This page will identify criteria for what constitutes high quality metadata and explore the suitability of multiple earth science documentation dialects.
 
  
===Documentation Purpose===
+
=Documentation Terminology=
Metadata is essential for facilitating data discovery, accessibility, useability and understanding. This project will identify quality assessment criteria for each of these data use categories. Thus providing a structure from which metadata quality can be assessed and evaluated.
+
The terminology used in this work has been developed over many years of metadata and documentation related work in NOAA, NASA and other U.S. Federal Agencies. Our [[Documentation Terminology|terminology ]] is described  to ensure effective communication.  
  
===Documentation Dialects===
+
=Metadata Recommendations=
Numerous documentation dialects exist within the Earth Science Community. This project will review the suitability of these dialects for each of the documentation purpose categories listed above. The dialects being considered include: Data Interchange Format (DIF), Data Catalog Vocabulary (DCAT), DRYAD, ECHO, ECS, Ecology Markup Language (EML), FGDC Content Standard for Geospatial Metadata (CSDGM), HDF5 Original XML, ISO 19110 Metadata for Services, ISO 19115, ISO 19115-1, NASA Unified Metadata Model (UMM), OGC Sensor Observation Service (SOS), THREDDS, netCDF.
+
Data and observations are many times collected within small research groups or organizations in order to address specific scientific questions. Preparing data for re-use or sharing with other groups or non-experts brings a different set of documentation requirements and many research groups look for guidance about how to satisfy those new requirements. Many groups within the U.S. and the global environmental data community have addressed this need for guidance, generally in the form of lists of metadata elements required, recommended, or suggested for a particular documentation need. We term these lists [[:Category:Documentation Recommendations|recommendations]].
  
===Concepts & Spirals===
+
=Concepts & Spirals=
Spirals provide a criteria for evaluating the information content captured for information categories; such as citations, people or identifiers. The elements that comprise a spiral are called concepts.  Some concepts may be reused in multiple spirals.
+
Documentation recommendations are made up of lists of concepts, usually given in some dialect. Managers must identify recommendations that are relevant to their situations and concepts that are included in those recommendations. We term the selection of relevant recommendations a [[:Category:Selection Scenario|Selection Scenario]]. Typically a group of concepts will be required to address a particular documentation needs or use cases. We term these groups ''spirals''.
 +
 
 +
Spirals provide criteria for evaluating the completeness of documentation content for different metadata use cases such as discovery, access, usage or understanding, and for various metadata topics such as citations, people or identifiers. The elements that comprise a spiral are called concepts.  Some concepts may be reused in multiple spirals. See the [[Concepts Glossary|Concepts Glossary]] page to view a reference list of metadata concepts. See the [[Documentation Spirals|Documentation Spirals]] page to view metadata completeness spirals for metadata types, and topics.
 +
 
 +
=Documentation Dialects=
 +
Numerous documentation [[Metadata_Dialects|dialects]] exist within the Earth Science Community. Once required concepts are identified, dialects must be selected that include those concepts. Our goal is to provide information about concepts in many dialects.
 +
 
 +
 
 +
[[Metadata Implementation]]

Latest revision as of 11:46, July 29, 2015

Overview

The ESIP Documentation Cluster is working with the U.S. Big Earth Data Initiative (BEDI) to improve the discoverability, accessibility, and usability of Federal data and information products derived from civil Earth observations. BEDI is an initiative of the White House Office of Science Technology and Policy (OSTP) (see http://www.whitehouse.gov/blog/2013/04/19/taking-pulse-our-planet-new-strategy-earth-observations) that will be coordinated through the US Group on Earth Observations (USGEO) Subcommittee of the National Science and Technology Council (NSTC) Committee on Environment Natural Resources and Sustainability (CENRS).

Metadata are a critical component of the Big Earth Data Initiative (BEDI). Metadata are essential for facilitating data discovery, access, usability and understanding. The purpose of this BEDI project is to provide data producers with the guidance necessary to create high quality metadata. We are developing a new ensemble approach for identifying criteria for high quality metadata in multiple earth science documentation dialects and providing quantitative tools for evaluating and improving metadata using those criteria. The Documentation Connections Category on the ESIP Documentation Cluster Wiki is the community focal point for this activity.

Documentation Terminology

The terminology used in this work has been developed over many years of metadata and documentation related work in NOAA, NASA and other U.S. Federal Agencies. Our terminology is described to ensure effective communication.

Metadata Recommendations

Data and observations are many times collected within small research groups or organizations in order to address specific scientific questions. Preparing data for re-use or sharing with other groups or non-experts brings a different set of documentation requirements and many research groups look for guidance about how to satisfy those new requirements. Many groups within the U.S. and the global environmental data community have addressed this need for guidance, generally in the form of lists of metadata elements required, recommended, or suggested for a particular documentation need. We term these lists recommendations.

Concepts & Spirals

Documentation recommendations are made up of lists of concepts, usually given in some dialect. Managers must identify recommendations that are relevant to their situations and concepts that are included in those recommendations. We term the selection of relevant recommendations a Selection Scenario. Typically a group of concepts will be required to address a particular documentation needs or use cases. We term these groups spirals.

Spirals provide criteria for evaluating the completeness of documentation content for different metadata use cases such as discovery, access, usage or understanding, and for various metadata topics such as citations, people or identifiers. The elements that comprise a spiral are called concepts. Some concepts may be reused in multiple spirals. See the Concepts Glossary page to view a reference list of metadata concepts. See the Documentation Spirals page to view metadata completeness spirals for metadata types, and topics.

Documentation Dialects

Numerous documentation dialects exist within the Earth Science Community. Once required concepts are identified, dialects must be selected that include those concepts. Our goal is to provide information about concepts in many dialects.


Metadata Implementation