Difference between revisions of "FAIR Dataset Quality Information"

From Earth Science Information Partners (ESIP)
Line 20: Line 20:
  
 
* Production system: <br>  
 
* Production system: <br>  
** CORE-CLIMAX production system maturity matrix [https://www.eumetsat.int/website/home/Data/ClimateService/index.html (EUMETSAT 2013)] <br>
+
** CORE-CLIMAX Production System Maturity Matrix [https://www.eumetsat.int/website/home/Data/ClimateService/index.html (EUMETSAT 2013)] <br>
 
** DKRZ Quality Maturity Matrix [https://www.dkrz.de/pdfs/poster/Hoeck_et_al_EGU2015_maturitymatrices_15apr.pdf?lang=de (Hock et al. 2015)]<br>
 
** DKRZ Quality Maturity Matrix [https://www.dkrz.de/pdfs/poster/Hoeck_et_al_EGU2015_maturitymatrices_15apr.pdf?lang=de (Hock et al. 2015)]<br>
 
** QA4ECV [https://www.mdpi.com/2072-4292/10/8/1254 (Nightingale et al. 2018)] <br>
 
** QA4ECV [https://www.mdpi.com/2072-4292/10/8/1254 (Nightingale et al. 2018)] <br>
Line 26: Line 26:
 
* Scientific quality: <br>  
 
* Scientific quality: <br>  
 
** NASA Technical Readiness Levels for Operations [http://www.onethesis.com/wp-content/uploads/2016/11/1-s2.0-S0094576509002008-main.pdf (Mankins 2009)]<br>
 
** NASA Technical Readiness Levels for Operations [http://www.onethesis.com/wp-content/uploads/2016/11/1-s2.0-S0094576509002008-main.pdf (Mankins 2009)]<br>
** NOAA STAR data product algorithm maturity matrix [https://www.mdpi.com/2072-4292/8/2/139 (Zhou, Divakarla & Liu 2016)] <br>
+
** NOAA STAR Data Product Algorithm Maturity Matrix [https://www.mdpi.com/2072-4292/8/2/139 (Zhou, Divakarla & Liu 2016)] <br>
** Perspectives of data uncertainty [https://esip.figshare.com/articles/Understanding_the_Various_Perspectives_of_Earth_Science_Observational_Data_Uncertainty/10271450 (Moroni et al. 2019)] <br>
+
** Perspectives of Data Uncertainty [https://esip.figshare.com/articles/Understanding_the_Various_Perspectives_of_Earth_Science_Observational_Data_Uncertainty/10271450 (Moroni et al. 2019)] <br>
 
** OGC UncertML (Williams et al. 2009) <br>
 
** OGC UncertML (Williams et al. 2009) <br>
 
** Operational Readiness Levels For Disaster Operations (ESIP Disasters Cluster 2018) <br>  
 
** Operational Readiness Levels For Disaster Operations (ESIP Disasters Cluster 2018) <br>  
  
 
* Product quality: <br>  
 
* Product quality: <br>  
** NOAA CDR product maturity matrix [https://agupubs.onlinelibrary.wiley.com/doi/abs/10.1029/2012EO440006 (Bates and Privette 2012)]<br>  
+
** NOAA CDR Product Maturity Matrix [https://agupubs.onlinelibrary.wiley.com/doi/abs/10.1029/2012EO440006 (Bates and Privette 2012)]<br>  
  
 
* Stewardship quality: <br>
 
* Stewardship quality: <br>
** NCEI/CICS-NC scientific data stewardship maturity matrix [https://datascience.codata.org/articles/abstract/10.2481/dsj.14-049/ (Peng et al. 2015)]<br>
+
** NCEI/CICS-NC Scientific Data Stewardship Maturity Matrix [https://datascience.codata.org/articles/abstract/10.2481/dsj.14-049/ (Peng et al. 2015)]<br>
** CEOS WGISS data management and stewardship maturity matrix [http://ceos.org/document_management/Working_Groups/WGISS/Interest_Groups/Data_Stewardship/White_Papers/WGISS%20Data%20Management%20and%20Stewardship%20Maturity%20Matrix.pdf (WGISS DSIG 2017)]<br>
+
** CEOS WGISS Data Management and Stewardship Maturity Matrix [http://ceos.org/document_management/Working_Groups/WGISS/Interest_Groups/Data_Stewardship/White_Papers/WGISS%20Data%20Management%20and%20Stewardship%20Maturity%20Matrix.pdf (WGISS DSIG 2017)]<br>
** WMO stewardship maturity matrix for climate data [https://figshare.com/articles/The_WMO-Wide_Stewardship_Maturity_Matrix_for_Climate_Data/7006028 (SMM-CD Working Group 2019)]<br>
+
** WMO Stewardship Maturity Matrix for Climate Data [https://figshare.com/articles/The_WMO-Wide_Stewardship_Maturity_Matrix_for_Climate_Data/7006028 (SMM-CD Working Group 2019)]<br>
 
** GEOSS Data Management Principles and Data Sharing Principles [https://www.earthobservations.org/documents/dswg/201504_data_management_principles_long_final.pdf (GEO DMP TF 2015;] [https://www.earthobservations.org/documents/dswg/10_GEOSS%20Data%20Sharing%20Principles%20post%202015.pdf GEO DSWG 2014)] <br>
 
** GEOSS Data Management Principles and Data Sharing Principles [https://www.earthobservations.org/documents/dswg/201504_data_management_principles_long_final.pdf (GEO DMP TF 2015;] [https://www.earthobservations.org/documents/dswg/10_GEOSS%20Data%20Sharing%20Principles%20post%202015.pdf GEO DSWG 2014)] <br>
  
 
* Service quality:<br>  
 
* Service quality:<br>  
** Level of services models: NSIDC (Duerr et al. 2009) and [https://earthdata.nasa.gov/collaborate/new-missions/level-of-service NASA Earth Science Data System] <br>
+
** Level of Services Models: NSIDC (Duerr et al. 2009) and [https://earthdata.nasa.gov/collaborate/new-missions/level-of-service NASA Earth Science Data System] <br>
** NCEI tiered scientific data stewardship services [http://www.dlib.org/dlib/may16/peng/05peng.html (Peng et al. 2016)] <br>
+
** NCEI Tiered Scientific Data Stewardship Services [http://www.dlib.org/dlib/may16/peng/05peng.html (Peng et al. 2016)] <br>
 
** [https://www.ncdc.noaa.gov/gosic/gcos-essential-climate-variable-ecv-data-access-matrix GCOS ECV Data and Information Access Matrix] <br>
 
** [https://www.ncdc.noaa.gov/gosic/gcos-essential-climate-variable-ecv-data-access-matrix GCOS ECV Data and Information Access Matrix] <br>
 
** [https://www.goosocean.org/index.php?option=com_content&view=article&id=125&Itemid=113 Global Ocean Observing System (GOOS) framework] <br>
 
** [https://www.goosocean.org/index.php?option=com_content&view=article&id=125&Itemid=113 Global Ocean Observing System (GOOS) framework] <br>
** NCEI/ESIP-DSC data use and services maturity matrix [https://figshare.com/articles/MM-Serv_ESIP_2018sum_v2r1_20180709_pdf/6855020 (Serv-MM Working Group 2018)] <br>
+
** NCEI/ESIP-DSC Data Use and Services Maturity Matrix [https://figshare.com/articles/MM-Serv_ESIP_2018sum_v2r1_20180709_pdf/6855020 (Serv-MM Working Group 2018)] <br>
** Data use and impact [https://esip.figshare.com/articles/Assessing_the_Science_Impact_of_Gridded_Population_Data_A_Pilot_Study/10028369 (Downs 2019)] <br><br>
+
** Data Use and Impact [https://esip.figshare.com/articles/Assessing_the_Science_Impact_of_Gridded_Population_Data_A_Pilot_Study/10028369 (Downs 2019)] <br><br>
  
 
<big>'''Dataset-level metadata quality: ''' </big><br>  
 
<big>'''Dataset-level metadata quality: ''' </big><br>  

Revision as of 15:19, June 14, 2020

Document

This is the document for community guidelines of consistently curating and representing dataset quality information, in line with the FAIR principles.

Overview

This document provides resources for developing community guidelines for consistently curating and representing dataset quality information and captures the outcomes. The guidelines aim to help curate dataset quality information that is findable and accessible, machine- and human-readable, interoperable, and reusable.

The target community is any entity that produces, publishes, manages, or uses digital Earth Science datasets or products. However, the guidelines will be general enough to be applicable to digital datasets of other disciplines.

Resources

Multi-dimensions of Data and Information Quality:

Existing Fitness for Purpose Assessment approaches Through the Full Life Cycle of Earth Science Datasets:

  • Scientific quality:
    • NASA Technical Readiness Levels for Operations (Mankins 2009)
    • NOAA STAR Data Product Algorithm Maturity Matrix (Zhou, Divakarla & Liu 2016)
    • Perspectives of Data Uncertainty (Moroni et al. 2019)
    • OGC UncertML (Williams et al. 2009)
    • Operational Readiness Levels For Disaster Operations (ESIP Disasters Cluster 2018)

Dataset-level metadata quality:

Portfolio Management and Repository Certifications

FAIR Data Principles

Organizational Challenges & Approaches

Data Quality Management Framework

Definitions

  • Data can refer to anything that is collected, observed, or derived and used as a basis for reasoning, discussion, or calculation. Data can be either structured or unstructured, and can be represented in quantitative, qualitative, or physical forms.
  • Scientific or research data is defined as: the recorded factual material commonly accepted in the scientific community as necessary to validate research findings.
  • Digital data, distinguished from physical records, such as paper weather reports, are represented in discrete numerical form that can be used by a computer or electronic device.
  • Data product refers to “a product that facilitates an end goal through the use of data,” usually with a well-thought out algorithm or approach (Patil 2012). Data products tend to be structured and can be raw measurements or scientific products derived from raw measurements or other products. Products can also be statistical or numerical model outputs, including analyses, reanalyses, predictions, or projections. Earth Science data products may be further categorized based on their processing levels.
  • Dataset is an identifiable collection of physical records, a digital rendition of factual materials, or a product of a given version of an algorithm/model. A dataset may contain one or many physical samples or data files in an identical format, having the same geophysical variable(s) and product specification(s), such as the geospatial location or spatial grid. Dataset and data product may be used interchangeably.
  • Information is considered as data being processed, organized, structured, or presented in a given context, while knowledge is gained from an understanding of the significance of information [(Mosely et al. 2009), available at: https://technicspub.com/dmbok]. Data and information may overlap and may be used interchangeably.
  • Dataset quality includes quality of both data and associated information.

Intended Users

  • Data producers, publishers, providers, and service providers for improved data sharing and reuse;
  • Data quality management professionals for improved data quality and usability;
  • Entities or organizations that manage and steward Earth science datasets during any stage of their full life cycle for improved enterprise data management and stewardship;
  • End users who integrate various datasets and associated quality information for improved interoperability and reusability.

---

Return to Pre-ESIP Workshop: About
Return to Information Quality Cluster Homepage