Difference between revisions of "Data Discovery (DCAT)"

From Earth Science Information Partners (ESIP)
(<div id="DCAT_Discovery-Mandatory"> updated)
 
(11 intermediate revisions by 3 users not shown)
Line 1: Line 1:
The spirals below document concepts for the Common Core metadata specification based on the Data Catalog Vocabulary (DCAT). Where compatible the spiral provides xpath references for DIF, ECHO, ECS, DCAT, ISO, ISO-1 and FGDC documentation dialects.
+
The Data Catalog Vocabulary (DCAT) is an RDF vocabulary designed to facilitate interoperability between data catalogs published on the Web. The spirals below document concepts for the Common Core metadata specification based on the Data Catalog Vocabulary (DCAT). Where compatible the spiral provides xpath references for DIF, ECHO, ECS, DCAT, ISO, ISO-1 and FGDC documentation dialects.
  
 +
<div id="DCAT_Discovery-Mandatory">
 
==DCAT for Data Discovery - Mandatory==
 
==DCAT for Data Discovery - Mandatory==
 
The Data Catalog Vocabulary (DCAT) is an RDF vocabulary designed to facilitate interoperability between data catalogs published on the Web. This spirals includes the required common core attributes for DCAT.
 
The Data Catalog Vocabulary (DCAT) is an RDF vocabulary designed to facilitate interoperability between data catalogs published on the Web. This spirals includes the required common core attributes for DCAT.
<i xmlns:xsl="http://www.w3.org/1999/XSL/Transform">Source: </i> [http://project-open-data.github.io/schema/ Common Core Reference]<table xmlns:xsl="http://www.w3.org/1999/XSL/Transform" width="95%" border="1" cellpadding="3" cellspacing="3" style="border-collapse: collapse;"><tr><th width="15%">Concept</th><th width="25%">Description</th><th width="60%">Dialect (Fit) Paths</th></tr><tr><td>[[Concepts_Glossary#Resource_Title|Resource Title]]</td><td>A short description of the resource. The title should be descriptive enough so that when a user is presented with a list of titles the general content of the data set can be determined.</td><td style="word-break:break-all;"><b>DIF</b> /dif:DIF/dif:Entry_Title<br><b>DIF</b> /dif:DIF/dif:Data_Set_Citation/dif:Dataset_Title<br><b>ECHO</b> /*/echo:ShortName&gt;/*/echo:LongName<br><b>ECS</b> /*/ecs:ShortName &gt; /*/ecs:LongName<br><b>DCAT</b> dct:title<br><b>FGDC</b> /fgdc:metadata/fgdc:idinfo/fgdc:citation/fgdc:citeinfo/fgdc:title<br><b>ISO</b> /*/gmd:identificationInfo/*/gmd:citation/gmd:CI_Citation/gmd:title/gco:CharacterString<br><b>ISO-1</b> /mdb:MD_Metadata/mdb:identificationInfo/*/mri:citation/cit:CI_Citation/cit:title/gco:CharacterString<br></td></tr><tr><td>[[Concepts_Glossary#Abstract|Abstract]]</td><td>A paragraph describing the resource.<br><br><i>Note: This concept is called "Desciption" in Catalog Services for the Web.</i></td><td style="word-break:break-all;"><b>DIF</b> /dif:DIF/dif:Summary/dif:Abstract<br><b>ECHO</b> /*/echo:Description<br><b>ECS</b> /*/ecs:CollectionDescription<br><b>DCAT</b> dct:description<br><b>FGDC</b> /fgdc:metadata/fgdc:idinfo/fgdc:descript/fgdc:abstract<br><b>ISO</b> /*/gmd:identificationInfo/*/gmd:abstract/gco:CharacterString<br><b>ISO-1</b> /mdb:MD_Metadata/mdb:identificationInfo/*/mri:abstract/gco:CharacterString<br></td></tr><tr><td>[[Concepts_Glossary#Keyword|Keyword]]</td><td>A word or phrase that describes some aspect of a resource. Can be one of several types.<br><br><i>Note: The general identification keywords usually have a type of "theme" and are refered to as "theme keywords". Other types and vocabularies are used for other information. Service Entry Resource Format (SERF) requires a Science and a Service GCMD Keyword. This concept is called "Subject" in the CSW Specification.</i></td><td style="word-break:break-all;"><b>DIF</b> (1)  /dif:DIF/dif:Parameters/dif:Category<br><b>DIF</b> (1)  /dif:DIF/dif:Parameters/dif:Topic<br><b>DIF</b> (1)  /dif:DIF/dif:Parameters/dif:Term<br><b>DIF</b> (1)  /dif:DIF/dif:Parameters/dif:Variable_Level_1<br><b>DIF</b> (1)  /dif:DIF/dif:Parameters/dif:Variable_Level_2<br><b>DIF</b> (1)  /dif:DIF/dif:Parameters/dif:Variable_Level_3<br><b>DIF</b> (1)  /dif:DIF/dif:Parameters/dif:Detailed_Variable<br><b>ECHO</b> (1)  /*/echo:ScienceKeywords/echo:ScienceKeyword/echo:CategoryKeyword<br><b>ECHO</b> (1)  /*/echo:ScienceKeywords/echo:ScienceKeyword/echo:TopicKeyword<br><b>ECHO</b> (1)  /*/echo:ScienceKeywords/echo:ScienceKeyword/echo:TermKeyword<br><b>ECHO</b> (1)  /*/echo:ScienceKeywords/echo:ScienceKeyword/echo:VariableLevel1Keyword/echo:Value<br><b>ECHO</b> (1)  /*/echo:ScienceKeywords/echo:ScienceKeyword/echo:VariableLevel2Keyword/echo:Value<br><b>ECHO</b> (1)  /*/echo:ScienceKeywords/echo:ScienceKeyword/echo:VariableLevel3Keyword<br><b>ECHO</b> (1)  /*/echo:ScienceKeywords/echo:ScienceKeyword/echo:DetailedVariableKeyword<br><b>ECS</b> (1)  /*/ecs:DisciplineTopicParameters/ecs:DisciplineKeyword<br><b>ECS</b> (1)  /*/ecs:DisciplineTopicParameters/ecs:TopicKeyword<br><b>ECS</b> (1)  /*/ecs:DisciplineTopicParameters/ecs:TermKeyword<br><b>ECS</b> (1)  /*/ecs:DisciplineTopicParameters/ecs:VariableKeyword<br><b>DCAT</b> dct:keyword<br><b>FGDC</b> (1)  /fgdc:metadata/fgdc:idinfo/fgdc:keywords/fgdc:theme/fgdc:themekey<br><b>ISO</b> (1)  /*/gmd:identificationInfo/*/gmd:descriptiveKeywords/gmd:MD_Keywords[gmd:type/gmd:MD_KeywordTypeCode='theme']/gmd:keyword/gco:CharacterString<br><b>ISO-1</b> /mdb:MD_Metadata/mdb:identificationInfo/*/mri:descriptiveKeywords/mri:MD_Keywords[mri:type/mri:MD_KeywordTypeCode='theme']/mri:keyword/gco:CharacterString<br></td></tr><tr><td>[[Concepts_Glossary#Resource_Creation/Revision_Date|Resource Creation/Revision Date]]</td><td>The date the resource was created</td><td style="word-break:break-all;"><b>DIF</b> /dif:DIF/dif:Data_Set_Citation/dif:Dataset_Release_Date<br><b>ECHO</b> /*/echo:InsertTime<br><b>ECHO</b> /*/echo:LastUpdate<br><b>ECS</b> /*/ecs:RevisionDate<br><b>DCAT</b> dct:modified<br><b>ISO</b> //gmd:CI_Citation/gmd:date/gmd:CI_Date[gmd:dateType/gmd:CI_DateTypeCode='creation']/gmd:date/gco:Date<br><b>ISO</b> //gmd:CI_Citation/gmd:date/gmd:CI_Date[gmd:dateType/gmd:CI_DateTypeCode='creation']/gmd:date/gco:DateTime<br><b>ISO</b> //gmd:CI_Citation/gmd:date/gmd:CI_Date[gmd:dateType/gmd:CI_DateTypeCode='revision']/gmd:date/gco:Date<br><b>ISO</b> //gmd:CI_Citation/gmd:date/gmd:CI_Date[gmd:dateType/gmd:CI_DateTypeCode='revision']/gmd:date/gco:DateTime<br><b>ISO</b> //gmd:CI_Citation/gmd:date/gmd:CI_Date[gmd:dateType/gmd:CI_DateTypeCode='publication']/gmd:date/gco:Date<br><b>ISO</b> //gmd:CI_Citation/gmd:date/gmd:CI_Date[gmd:dateType/gmd:CI_DateTypeCode='publication']/gmd:date/gco:DateTime<br><b>ISO-1</b> //cit:CI_Citation/cit:date/cit:CI_Date[cit:dateType/cit:CI_DateTypeCode='creation']/cit:date/gco:DateTime<br><b>ISO-1</b> //cit:CI_Citation/cit:date/cit:CI_Date[cit:dateType/cit:CI_DateTypeCode='revision']/cit:date/gco:DateTime<br><b>ISO-1</b> //cit:CI_Citation/cit:date/cit:CI_Date[cit:dateType/cit:CI_DateTypeCode='publication']/cit:date/gco:DateTime<br></td></tr><tr><td>[[Concepts_Glossary#Publisher|Publisher]]</td><td>Publisher of the cited resource</td><td style="word-break:break-all;"><b>DIF</b> /dif:DIF/dif:Data_Set_Citation/dif:Dataset_Publisher<br><b>DIF</b> /dif:DIF/dif:Reference/dif:Publisher<br><b>DCAT</b> dct:publisher<br><b>ISO</b> //gmd:CI_Citation/gmd:citedResponsibleParty/gmd:CI_ResponsibleParty[gmd:role/gmd:CI_RoleCode = 'publisher']/gmd:CI_Organisation/gmd:name/gco:CharacterString<br><b>ISO-1</b> //cit:CI_Citation/cit:citedResponsibleParty/cit:CI_Responsibility[cit:role/cit:CI_RoleCode='publisher']/cit:party/cit:CI_Organisation/cit:name/gco:CharacterString<br></td></tr><tr><td>[[Concepts_Glossary#Resource_Contact|Resource Contact]]</td><td>The organization or person responsible for answering questions about the resource.</td><td style="word-break:break-all;"><b>DIF</b> /dif:DIF/dif:Personnel[dif:Role='INVESTIGATOR']<br><b>ECHO</b> /*/echo:Contacts/echo:Contact<br><b>ECS</b> /CollectionMetaDataFile/CollectionMetaDataSets/Collections/CollectionMetaData/Contact[Role='User Services']<br><b>DCAT</b> dct:contactPoint<br><b>ISO</b> /*/gmd:identificationInfo/*/gmd:pointOfContact<br><b>ISO-1</b> /mdb:MD_Metadata/mdb:identificationInfo/*/mri:pointOfContact<br></td></tr><tr><td>[[Concepts_Glossary#Metadata_Identifier|Metadata Identifier]]</td><td>A phrase or string which uniquely identifies the metadata file/record.</td><td style="word-break:break-all;"><b>DIF</b> /dif:DIF/dif:Entry_Id<br><b>ISO</b> /*/gmd:fileIdentifier/gco:CharacterString<br><b>ISO-1</b> /mdb:MD_Metadata/mdb:metadataIdentifier/mcc:MD_Identifier<br></td></tr><tr><td>[[Concepts_Glossary#Resource_Access_Constraints|Resource Access Constraints]]</td><td>Information about any constraints for accessing the data set. This includes any special restrictions, legal prerequisites, limitations and/or warnings on obtaining the data set. Some words that may be used in this field include: Public, In-house, Limited, Additional detailed instructions on how to access the data can be entered in this field.</td><td style="word-break:break-all;"><b>DIF</b> /dif:DIF/dif:Access_Constraints<br><b>ECHO</b> /*/echo:RestrictionFlag<br><b>ECHO</b> /*/echo:RestrictionComment<br><b>ECHO</b> /*/echo:Visible<br><b>ECS</b> /*/ecs:AccessConstraints<br><b>DCAT</b> /dct:accessLevel<br><b>FGDC</b> /fgdc:metadata/fgdc:idinfo/fgdc:accconst<br><b>ISO</b> /*/gmd:identificationInfo/*/gmd:resourceConstraints/gmd:MD_LegalConstraints/gmd:accessConstraints/gco:CharacterString<br><b>ISO</b> /*/gmd:identificationInfo/*/gmd:resourceConstraints/gmd:MD_LegalConstraints/gmd:otherConstraints/gco:CharacterString<br><b>ISO-1</b> /mdb:MD_Metadata/mdb:identificationInfo/*/mri:resourceConstraints/mco:MD_LegalConstraints/mco:accessConstraints/gco:CharacterString<br><b>ISO-1</b> /mdb:MD_Metadata/mdb:identificationInfo/*/mri:resourceConstraints/mco:MD_LegalConstraints/mco:otherConstraints/gco:CharacterString<br></td></tr></table>
+
<i xmlns:xsl="http://www.w3.org/1999/XSL/Transform">Source: </i> [http://project-open-data.github.io/schema/ Common Core Reference]<table xmlns:xsl="http://www.w3.org/1999/XSL/Transform" width="95%" border="1" cellpadding="3" cellspacing="3" style="border-collapse: collapse;" id="DCAT_Discovery-Mandatory"><tr><th width="15%">Concept</th><th width="25%">Description</th><th width="60%">Dialect (Fit) Paths</th></tr><tr><td>[[Concepts_Glossary#Resource_Title|Resource Title]]</td><td>A short description of the resource. The title should be descriptive enough so that when a user is presented with a list of titles the general content of the data set can be determined.</td><td style="word-break:break-all;"><b>ADIwg</b> /adiwg:project/adiwg:idinfo/adiwg:citation/adiwg:citeinfo/adiwg:title<br><b>DIF</b> /dif9:DIF/dif9:Entry_Title<br><b>DIF</b> /dif9:DIF/dif9:Data_Set_Citation/dif9:Dataset_Title<br><b>DCAT</b> /dct:title<br><b>Dryad</b> dcterms:title<br><b>ECHO</b> /*/echo:ShortName&gt;/*/echo:LongName<br><b>ECS</b> /*/ecs:ShortName &gt; /*/ecs:LongName<br><b>EML</b> /eml:dataset/eml:title/eml:text<br><b>FGDC</b> /fgdc:metadata/fgdc:idinfo/fgdc:citation/fgdc:citeinfo/fgdc:title<br><b>HDF5.1</b> /hdf5:HDF5-File/hdf5:RootGroup/hdf5:Attribute[@Name='title']/hdf5:Data/hdf5:DataFromFile<br><b>HDF5.1</b> /hdf5:HDF5-File/hdf5:RootGroup/hdf5:Group[@Name='METADATA']/hdf5:Group[@Name='COLLECTIONMETADATA']/hdf5:Attribute[@Name='LongName']/hdf5:Data/hdf5:DataFromFile<br><b>ISO</b> /*/gmd:identificationInfo/*/gmd:citation/gmd:CI_Citation/gmd:title/gco:CharacterString<br><b>ISO-1</b> /mdb:MD_Metadata/mdb:identificationInfo/*/mri:citation/cit:CI_Citation/cit:title/gco:CharacterString<br><b>UMM</b> /umm:UMM/umm:CollectionCitation/umm:Title<br><b>OGC-SOS</b> /sos:Capabilities/ows:ServiceIdentification/ows:Title<br><b>SERF</b> /serf:SERF/serf:Entry_Title<br><b>THREDDS</b> /thredds:catalog/thredds:dataset/@name<br><b>THREDDS</b> /thredds:catalog/thredds:dataset/thredds:metadata/dc:title<br><b>THREDDS</b> //thredds:dataset[1]/@name<br><b>netCDF</b> /nc:netcdf/nc:attribute[@nc:name=title]/@nc:value<br></td></tr><tr><td>[[Concepts_Glossary#Abstract|Abstract]]</td><td>A paragraph describing the resource.<br><br><i>Note: This concept is called "Desciption" in Catalog Services for the Web.</i></td><td style="word-break:break-all;"><b>ADIwg</b> /adiwg:project/adiwg:idinfo/adiwg:descript/adiwg:abstract<br><b>DIF</b> /dif9:DIF/dif9:Summary/dif9:Abstract<br><b>DCAT</b> /dct:description<br><b>Dryad</b> dcterms:description<br><b>ECHO</b> /*/echo:Description<br><b>ECS</b> /*/ecs:CollectionDescription<br><b>EML</b> /eml:dataset/eml:abstract/eml:descendant<br><b>FGDC</b> /fgdc:metadata/fgdc:idinfo/fgdc:descript/fgdc:abstract<br><b>HDF5.1</b> /hdf5:HDF5-File/hdf5:RootGroup/hdf5:Attribute[@Name='summary']/hdf5:Data/hdf5:DataFromFile<br><b>HDF5.1</b> /hdf5:HDF5-File/hdf5:RootGroup/hdf5:Group[@Name='METADATA']/hdf5:Group[@Name='COLLECTIONMETADATA']/hdf5:Attribute[@Name='CollectionDescription']/hdf5:Data/hdf5:DataFromFile<br><b>ISO</b> /*/gmd:identificationInfo/*/gmd:abstract/gco:CharacterString<br><b>ISO-1</b> /mdb:MD_Metadata/mdb:identificationInfo/*/mri:abstract/gco:CharacterString<br><b>UMM</b> /umm:UMM/umm:Summary<br><b>OGC-SOS</b> /sos:Capabilities/ows:ServiceIdentification/ows:Abstract<br><b>SERF</b> /serf:SERF/serf:Summary/serf:Abstract<br><b>THREDDS</b> //thredds:dataset/thredds:documentation[@type='abstract']<br><b>THREDDS</b> //thredds:metadata/thredds:documentation[@type='abstract']<br><b>netCDF</b> /nc:netcdf/nc:attribute[@nc:name='summary']/@nc:value<br></td></tr><tr><td>[[Concepts_Glossary#Theme_Keyword|Theme Keyword]]</td><td>A word or phrase that describes some aspect of a res
 
''xPath Note:'' The xPaths included in this table use several wildcards. // means
 
''xPath Note:'' The xPaths included in this table use several wildcards. // means
 
any path, so //gmd:CI_ResponsibleParty indicates a gmd:CI_ResponsibleParty anywhere
 
any path, so //gmd:CI_ResponsibleParty indicates a gmd:CI_ResponsibleParty anywhere
Line 14: Line 15:
 
''Fit:'' The fit of the dialect path with the concept is estimated on a scale of 1
 
''Fit:'' The fit of the dialect path with the concept is estimated on a scale of 1
 
= excellent two-way fit, 2 = one-way fit or some other problem, 3 - extension required.
 
= excellent two-way fit, 2 = one-way fit or some other problem, 3 - extension required.
[[Category:Documentation Recommendations]]
+
 
 +
 
 +
[[Data Discoverability]]

Latest revision as of 01:49, August 15, 2015

The Data Catalog Vocabulary (DCAT) is an RDF vocabulary designed to facilitate interoperability between data catalogs published on the Web. The spirals below document concepts for the Common Core metadata specification based on the Data Catalog Vocabulary (DCAT). Where compatible the spiral provides xpath references for DIF, ECHO, ECS, DCAT, ISO, ISO-1 and FGDC documentation dialects.

DCAT for Data Discovery - Mandatory

The Data Catalog Vocabulary (DCAT) is an RDF vocabulary designed to facilitate interoperability between data catalogs published on the Web. This spirals includes the required common core attributes for DCAT.

Source: Common Core Reference
ConceptDescriptionDialect (Fit) Paths
Resource TitleA short description of the resource. The title should be descriptive enough so that when a user is presented with a list of titles the general content of the data set can be determined.ADIwg /adiwg:project/adiwg:idinfo/adiwg:citation/adiwg:citeinfo/adiwg:title
DIF /dif9:DIF/dif9:Entry_Title
DIF /dif9:DIF/dif9:Data_Set_Citation/dif9:Dataset_Title
DCAT /dct:title
Dryad dcterms:title
ECHO /*/echo:ShortName>/*/echo:LongName
ECS /*/ecs:ShortName > /*/ecs:LongName
EML /eml:dataset/eml:title/eml:text
FGDC /fgdc:metadata/fgdc:idinfo/fgdc:citation/fgdc:citeinfo/fgdc:title
HDF5.1 /hdf5:HDF5-File/hdf5:RootGroup/hdf5:Attribute[@Name='title']/hdf5:Data/hdf5:DataFromFile
HDF5.1 /hdf5:HDF5-File/hdf5:RootGroup/hdf5:Group[@Name='METADATA']/hdf5:Group[@Name='COLLECTIONMETADATA']/hdf5:Attribute[@Name='LongName']/hdf5:Data/hdf5:DataFromFile
ISO /*/gmd:identificationInfo/*/gmd:citation/gmd:CI_Citation/gmd:title/gco:CharacterString
ISO-1 /mdb:MD_Metadata/mdb:identificationInfo/*/mri:citation/cit:CI_Citation/cit:title/gco:CharacterString
UMM /umm:UMM/umm:CollectionCitation/umm:Title
OGC-SOS /sos:Capabilities/ows:ServiceIdentification/ows:Title
SERF /serf:SERF/serf:Entry_Title
THREDDS /thredds:catalog/thredds:dataset/@name
THREDDS /thredds:catalog/thredds:dataset/thredds:metadata/dc:title
THREDDS //thredds:dataset[1]/@name
netCDF /nc:netcdf/nc:attribute[@nc:name=title]/@nc:value
AbstractA paragraph describing the resource.

Note: This concept is called "Desciption" in Catalog Services for the Web.
ADIwg /adiwg:project/adiwg:idinfo/adiwg:descript/adiwg:abstract
DIF /dif9:DIF/dif9:Summary/dif9:Abstract
DCAT /dct:description
Dryad dcterms:description
ECHO /*/echo:Description
ECS /*/ecs:CollectionDescription
EML /eml:dataset/eml:abstract/eml:descendant
FGDC /fgdc:metadata/fgdc:idinfo/fgdc:descript/fgdc:abstract
HDF5.1 /hdf5:HDF5-File/hdf5:RootGroup/hdf5:Attribute[@Name='summary']/hdf5:Data/hdf5:DataFromFile
HDF5.1 /hdf5:HDF5-File/hdf5:RootGroup/hdf5:Group[@Name='METADATA']/hdf5:Group[@Name='COLLECTIONMETADATA']/hdf5:Attribute[@Name='CollectionDescription']/hdf5:Data/hdf5:DataFromFile
ISO /*/gmd:identificationInfo/*/gmd:abstract/gco:CharacterString
ISO-1 /mdb:MD_Metadata/mdb:identificationInfo/*/mri:abstract/gco:CharacterString
UMM /umm:UMM/umm:Summary
OGC-SOS /sos:Capabilities/ows:ServiceIdentification/ows:Abstract
SERF /serf:SERF/serf:Summary/serf:Abstract
THREDDS //thredds:dataset/thredds:documentation[@type='abstract']
THREDDS //thredds:metadata/thredds:documentation[@type='abstract']
netCDF /nc:netcdf/nc:attribute[@nc:name='summary']/@nc:value
Theme KeywordA word or phrase that describes some aspect of a res

xPath Note: The xPaths included in this table use several wildcards. // means any path, so //gmd:CI_ResponsibleParty indicates a gmd:CI_ResponsibleParty anywhere in an XML file. /*/ indicates a single level with several possible elements. This usually indicates one of several concrete realizations of an abstract object. For example /*/gmd:identificationInfo could be gmd:MD_Metadata/gmd:identificationInfo or gmi:MI_Metadata/gmd:identificationInfo and gmd:identificationInfo//*/gmd:descriptiveKeywords could be gmd:identificationInfo/gmd:MD_DataIdentification/gmd:descriptiveKeywords or gmd:identificationInfo/srv:SV_ServiceIdentification/gmd:descriptiveKeywords. Fit: The fit of the dialect path with the concept is estimated on a scale of 1 = excellent two-way fit, 2 = one-way fit or some other problem, 3 - extension required.


Data Discoverability