Difference between revisions of "ADN Standards"
|(One intermediate revision by one other user not shown)|
Latest revision as of 09:58, October 3, 2012
- EEA Technical report 5/2012 on Reporting and exchanging air quality information in Europe using e-Reporting
- INSPIRE directive aims to create a European Union (EU) spatial data infrastructure. It came into force on 15 May 2007 and will be implemented in various stages, with full implementation required by 2019. (Text of the directive); draft text of data specifications for atmospheric conditions and meteorological features
- There are two fora in Europe aiming to establish standards in air quality monitoring and modelling: AQUILA and FAIRMODE
Data formats and data exchange protocols
- | OGC-WCS: The Open Geospatial Consortium develops and maintains the interface standard for interoperable protocols such as WCS (web coverage service), WFS (web feature service), and WMS (web map service). The current version of WCS is 1.1 (apparently, version 1.1.2 was deprecated recently due to the lack of a standard testing procedure). Version 2.0 is in preparation. The WCS 2.0 core specification is available, but this contains very limitied specific information on how the standard can actually be implemented. The relevant extensions (for example for netcdf encoding) are still awaiting a committee decision [July 2012]. A summary description how WCS works can be found here: Web Coverage Service (WCS)
- Network Common Data Format (netCDF): Netcdf is a binary file encoding format that allows to store attributes (i.e. metadata information) along with the data variables. For further information check out data model
General metadata specifications for geospatial information
These standards are mainly describing ways to specify the necessary information to find and access a specific data set, including for example information on the responsible contact person. Technically, this is implemented with XML documents that can be validated against XML schema files. The international standard for describing geospatial metadata is ISO 19115. The implementation of ISO 19115 is described in ISO 19139. In Europe, the INSPIRE directive (see ) is establishing another standard, which is largely compatible with ISO19115, but in some areas less and in other areas more stringent (concerning required metadata). Unfortunately, ISO documents are not freely available. However, it is possible to find a good part of information about it on the internet.
- Discovery-Level Metadata Content Standards: covers FGDC, ISO 19115, WMO core, DIF, CF, and THREDDS
- ISO 19139 codelists: These codelists contain the controlled vocabulary defined by ISO 19139
- NOAA GEO-IDE Metadata and Coverages and netcdf attribute convention for dataset discovery including its conformance test at | C-Test. A helpful introduction to ISO metadata is available at ISO Building Blocks
- GEOMS metadata standard for ground-based data and satellite validation
- INSPIRE Implementing Rules provides access to INSPIRE documents including metadata legislation and guidance documents, data specifications, etc.
- Climate and Forecasting (CF) metadata conventions: These metadata conventions are very different from ISO 19115 and INSPIRE in that they establish a standard for describing the information content in a data file (whereas ISO and INSPIRE describe information about a file or service). Nevertheless, there is a close relation between the two: firstly, the information should be consistent, secondly, information contained in the file can be harvested to (automatically) create XML metadata files which should then follow the ISO (and INSPIRE) rules.
- AQ Community metadata "facets" discussion on Solta: In order to enable the air quality data network, the initial approach has been to define so-called metadata facets based on the content requirements that were seen as essential for the dataset discovery (see for example the Core.uFind catalogue at WUSTL or the ACTRIS catalogue at NILU. These metadata facets are not necessarily compliant with the ISO or INSPIRE standards (and they may be incomplete). Discussions to extend the thematic information that should be available in metadata records and how thi sinformation can be mapped onto the existing ISO and INSPIRE standards are at the heart of the GEo AQ CoP Dublin workshop 2012.
- GAWSIS metadata definition: Description of discovery metadata information for Global Atmosphere Watch (GAW) (station) data sets [provided by J. Klausen, MeteoSwiss]
- SeaDataNet (an EU project) established a community metadata standard which could potentially serve as a role-model for the AQ/AC community.
- PASODOBLE MyAir metadata standard [provided by A. de Rudder]
Describing Data Quality
There are various attempts to define data quality indicators and good practices for quality assurance and quality control. Here, we just refer to the QA4EO (Quality Assurance for Earth Observations) initiative and their QA4EO Guiding Principles document.
Mapping ISO, netCDF and HDF metadata
As the distinction between discovery metadata (the realm of ISO and INSPIRE) and descriptive metadata (primary focus of netcdf-cf and HDF conventions, such as GEOMS) is blurred, it is important to map one standard onto the other. Ted Haberman has done some pioneering work on this: .
Tools for handling (ISO and INSPIRE) metadata
There are commercial and free metadata editors available which include schema specifications for ISO19115 and INSPIRE. The INSPIRE Geoportal also makes a metadata editor and validator available.
- CatMDEdit is an open source metadata editor which is farily straightforward to use and can at least help to obatin a better understanding of the ISO 19115 and INSPIRE metadata structures (see example at ISO_INSPIRE_Metadata_Structure).
Data set type specific issues
- atmospheric models generally use a spherical lat/lon coordinate system. There is a need to define the appropriate coordinate reference system (CRS) in the associated metadata. Terms that are often seen in this context are "WGS84" and "urn:ogc:def:crs:OGC:2:84" or "urn:ogc:def:crs:EPSG::4326". An email exchange with EPSG on this provides some further input to this discussion. Also check out the discussion on CF-ESRI mail archive.
- (discovery) metadata records for continuous data need to specify the begin and end of a time series. In the case of continuous data (i.e. data that are continuously updated until "now"), this can be accomplished via specific gml tags, namely indetermibatePosition. The example below was provided by Paul Hasenohr of EEA, Kopenhagen:
<gmd:extent> <gmd:EX_Extent> <gmd:temporalElement> <gmd:EX_TemporalExtent> <gmd:extent> <gml:TimePeriod gml:id="d28e322a1049886"> <gml:begin> <gml:TimeInstant gml:id="tstart"> <gml:timePosition>2011-04-01T00:00:00</gml:timePosition> </gml:TimeInstant> </gml:begin> <gml:end> <gml:TimeInstant gml:id="tend"> <gml:timePosition indeterminatePosition="now"/> </gml:TimeInstant> </gml:end> </gml:TimePeriod> </gmd:extent> </gmd:EX_TemporalExtent> </gmd:temporalElement> </gmd:EX_Extent> </gmd:extent>
Copied from 
<gmd:extent> <gmd:EX_Extent> <gmd:description> <gco:CharacterString>coordinates of ship track</gco:CharacterString> </gmd:description> <gmd:geographicElement> <gmd:EX_BoundingPolygon> <gmd:polygon> <gml:LineString gml:id="coordsId1" srsName="urn:ogc:def:crs:EPSG::4326"> <gml:posList> 120.27748,22.61839 120.18761,22.63769 120.17214,22.53991 ... 120.28458,22.61497 </gml:posList> </gml:LineString> </gmd:polygon> </gmd:EX_BoundingPolygon> </gmd:geographicElement> </gmd:EX_Extent> </gmd:extent>
Back to [| Dublin workshop] page