Data Discovery (DCAT)

From Earth Science Information Partners (ESIP)
Revision as of 18:58, June 11, 2015 by Epauli (talk | contribs)

The spirals below document concepts for the Common Core metadata specification based on the Data Catalog Vocabulary (DCAT). Where compatible the spiral provides xpath references for DIF, ECHO, ECS, DCAT, ISO, ISO-1 and FGDC documentation dialects.

DCAT for Data Discovery - Mandatory

The Data Catalog Vocabulary (DCAT) is an RDF vocabulary designed to facilitate interoperability between data catalogs published on the Web. This spirals includes the required common core attributes for DCAT.

Source: Common Core Reference

ConceptDescriptionDialect (Fit) Paths
Resource TitleA short description of the resource. The title should be descriptive enough so that when a user is presented with a list of titles the general content of the data set can be determined.DIF /dif:DIF/dif:Entry_Title
DIF /dif:DIF/dif:Data_Set_Citation/dif:Dataset_Title
ECHO /*/echo:ShortName>/*/echo:LongName
ECS /*/ecs:ShortName > /*/ecs:LongName
DCAT dct:title
FGDC /fgdc:metadata/fgdc:idinfo/fgdc:citation/fgdc:citeinfo/fgdc:title
ISO /*/gmd:identificationInfo/*/gmd:citation/gmd:CI_Citation/gmd:title/gco:CharacterString
ISO-1 /mdb:MD_Metadata/mdb:identificationInfo/*/mri:citation/cit:CI_Citation/cit:title/gco:CharacterString
AbstractA paragraph describing the resource.

Note: This concept is called "Desciption" in Catalog Services for the Web.
DIF /dif:DIF/dif:Summary/dif:Abstract
ECHO /*/echo:Description
ECS /*/ecs:CollectionDescription
DCAT dct:description
FGDC /fgdc:metadata/fgdc:idinfo/fgdc:descript/fgdc:abstract
ISO /*/gmd:identificationInfo/*/gmd:abstract/gco:CharacterString
ISO-1 /mdb:MD_Metadata/mdb:identificationInfo/*/mri:abstract/gco:CharacterString
KeywordA word or phrase that describes some aspect of a resource. Can be one of several types.

Note: The general identification keywords usually have a type of "theme" and are refered to as "theme keywords". Other types and vocabularies are used for other information. Service Entry Resource Format (SERF) requires a Science and a Service GCMD Keyword. This concept is called "Subject" in the CSW Specification.
DIF (1) /dif:DIF/dif:Parameters/dif:Category
DIF (1) /dif:DIF/dif:Parameters/dif:Topic
DIF (1) /dif:DIF/dif:Parameters/dif:Term
DIF (1) /dif:DIF/dif:Parameters/dif:Variable_Level_1
DIF (1) /dif:DIF/dif:Parameters/dif:Variable_Level_2
DIF (1) /dif:DIF/dif:Parameters/dif:Variable_Level_3
DIF (1) /dif:DIF/dif:Parameters/dif:Detailed_Variable
ECHO (1) /*/echo:ScienceKeywords/echo:ScienceKeyword/echo:CategoryKeyword
ECHO (1) /*/echo:ScienceKeywords/echo:ScienceKeyword/echo:TopicKeyword
ECHO (1) /*/echo:ScienceKeywords/echo:ScienceKeyword/echo:TermKeyword
ECHO (1) /*/echo:ScienceKeywords/echo:ScienceKeyword/echo:VariableLevel1Keyword/echo:Value
ECHO (1) /*/echo:ScienceKeywords/echo:ScienceKeyword/echo:VariableLevel2Keyword/echo:Value
ECHO (1) /*/echo:ScienceKeywords/echo:ScienceKeyword/echo:VariableLevel3Keyword
ECHO (1) /*/echo:ScienceKeywords/echo:ScienceKeyword/echo:DetailedVariableKeyword
ECS (1) /*/ecs:DisciplineTopicParameters/ecs:DisciplineKeyword
ECS (1) /*/ecs:DisciplineTopicParameters/ecs:TopicKeyword
ECS (1) /*/ecs:DisciplineTopicParameters/ecs:TermKeyword
ECS (1) /*/ecs:DisciplineTopicParameters/ecs:VariableKeyword
DCAT dct:keyword
FGDC (1) /fgdc:metadata/fgdc:idinfo/fgdc:keywords/fgdc:theme/fgdc:themekey
ISO (1) /*/gmd:identificationInfo/*/gmd:descriptiveKeywords/gmd:MD_Keywords[gmd:type/gmd:MD_KeywordTypeCode='theme']/gmd:keyword/gco:CharacterString
ISO-1 /mdb:MD_Metadata/mdb:identificationInfo/*/mri:descriptiveKeywords/mri:MD_Keywords[mri:type/mri:MD_KeywordTypeCode='theme']/mri:keyword/gco:CharacterString
Resource Creation/Revision DateThe date the resource was createdDIF /dif:DIF/dif:Data_Set_Citation/dif:Dataset_Release_Date
ECHO /*/echo:InsertTime
ECHO /*/echo:LastUpdate
ECS /*/ecs:RevisionDate
DCAT dct:modified
ISO //gmd:CI_Citation/gmd:date/gmd:CI_Date[gmd:dateType/gmd:CI_DateTypeCode='creation']/gmd:date/gco:Date
ISO //gmd:CI_Citation/gmd:date/gmd:CI_Date[gmd:dateType/gmd:CI_DateTypeCode='creation']/gmd:date/gco:DateTime
ISO //gmd:CI_Citation/gmd:date/gmd:CI_Date[gmd:dateType/gmd:CI_DateTypeCode='revision']/gmd:date/gco:Date
ISO //gmd:CI_Citation/gmd:date/gmd:CI_Date[gmd:dateType/gmd:CI_DateTypeCode='revision']/gmd:date/gco:DateTime
ISO //gmd:CI_Citation/gmd:date/gmd:CI_Date[gmd:dateType/gmd:CI_DateTypeCode='publication']/gmd:date/gco:Date
ISO //gmd:CI_Citation/gmd:date/gmd:CI_Date[gmd:dateType/gmd:CI_DateTypeCode='publication']/gmd:date/gco:DateTime
ISO-1 //cit:CI_Citation/cit:date/cit:CI_Date[cit:dateType/cit:CI_DateTypeCode='creation']/cit:date/gco:DateTime
ISO-1 //cit:CI_Citation/cit:date/cit:CI_Date[cit:dateType/cit:CI_DateTypeCode='revision']/cit:date/gco:DateTime
ISO-1 //cit:CI_Citation/cit:date/cit:CI_Date[cit:dateType/cit:CI_DateTypeCode='publication']/cit:date/gco:DateTime
PublisherPublisher of the cited resourceDIF /dif:DIF/dif:Data_Set_Citation/dif:Dataset_Publisher
DIF /dif:DIF/dif:Reference/dif:Publisher
DCAT dct:publisher
ISO //gmd:CI_Citation/gmd:citedResponsibleParty/gmd:CI_ResponsibleParty[gmd:role/gmd:CI_RoleCode = 'publisher']/gmd:CI_Organisation/gmd:name/gco:CharacterString
ISO-1 //cit:CI_Citation/cit:citedResponsibleParty/cit:CI_Responsibility[cit:role/cit:CI_RoleCode='publisher']/cit:party/cit:CI_Organisation/cit:name/gco:CharacterString
Resource ContactThe organization or person responsible for answering questions about the resource.DIF /dif:DIF/dif:Personnel[dif:Role='INVESTIGATOR']
ECHO /*/echo:Contacts/echo:Contact
ECS /CollectionMetaDataFile/CollectionMetaDataSets/Collections/CollectionMetaData/Contact[Role='User Services']
DCAT dct:contactPoint
ISO /*/gmd:identificationInfo/*/gmd:pointOfContact
ISO-1 /mdb:MD_Metadata/mdb:identificationInfo/*/mri:pointOfContact
Metadata IdentifierA phrase or string which uniquely identifies the metadata file/record.DIF /dif:DIF/dif:Entry_Id
ISO /*/gmd:fileIdentifier/gco:CharacterString
ISO-1 /mdb:MD_Metadata/mdb:metadataIdentifier/mcc:MD_Identifier
Resource Access ConstraintsInformation about any constraints for accessing the data set. This includes any special restrictions, legal prerequisites, limitations and/or warnings on obtaining the data set. Some words that may be used in this field include: Public, In-house, Limited, Additional detailed instructions on how to access the data can be entered in this field.DIF /dif:DIF/dif:Access_Constraints
ECHO /*/echo:RestrictionFlag
ECHO /*/echo:RestrictionComment
ECHO /*/echo:Visible
ECS /*/ecs:AccessConstraints
DCAT /dct:accessLevel
FGDC /fgdc:metadata/fgdc:idinfo/fgdc:accconst
ISO /*/gmd:identificationInfo/*/gmd:resourceConstraints/gmd:MD_LegalConstraints/gmd:accessConstraints/gco:CharacterString
ISO /*/gmd:identificationInfo/*/gmd:resourceConstraints/gmd:MD_LegalConstraints/gmd:otherConstraints/gco:CharacterString
ISO-1 /mdb:MD_Metadata/mdb:identificationInfo/*/mri:resourceConstraints/mco:MD_LegalConstraints/mco:accessConstraints/gco:CharacterString
ISO-1 /mdb:MD_Metadata/mdb:identificationInfo/*/mri:resourceConstraints/mco:MD_LegalConstraints/mco:otherConstraints/gco:CharacterString

xPath Note: The xPaths included in this table use several wildcards. // means any path, so //gmd:CI_ResponsibleParty indicates a gmd:CI_ResponsibleParty anywhere in an XML file. /*/ indicates a single level with several possible elements. This usually indicates one of several concrete realizations of an abstract object. For example /*/gmd:identificationInfo could be gmd:MD_Metadata/gmd:identificationInfo or gmi:MI_Metadata/gmd:identificationInfo and gmd:identificationInfo//*/gmd:descriptiveKeywords could be gmd:identificationInfo/gmd:MD_DataIdentification/gmd:descriptiveKeywords or gmd:identificationInfo/srv:SV_ServiceIdentification/gmd:descriptiveKeywords. Fit: The fit of the dialect path with the concept is estimated on a scale of 1 = excellent two-way fit, 2 = one-way fit or some other problem, 3 - extension required.


Data Discoverability /Accessibility