Difference between revisions of "Documenting Identifiers"

From Earth Science Information Partners (ESIP)
(Created page with "The importance of unique identifiers for data is well understood in terms of data citation and unambiguous attribution. As metadata are shared between National and Internation...")
 
Line 23: Line 23:
 
<td style="word-break:break-all;">Resource Identifier</td>
 
<td style="word-break:break-all;">Resource Identifier</td>
 
<td style="word-break:break-all;">Identifier for the cited resource</td>
 
<td style="word-break:break-all;">Identifier for the cited resource</td>
<td style="word-break:break-all;"><b>DIF</b> /dif:DIF/dif:Data_Set_Citation/dif:Dataset_DOI<br><b>DIF</b> /dif:DIF/dif:Reference/dif:DOI<br><b>ECHO</b> /echo:Collection/echo:DataSetId<br><b>ISO</b> /*/gmd:identificationInfo/*/gmd:citation/gmd:CI_Citation/gmd:identifier/gmd:MD_Identifier/gmd:code/gco:CharacterString<br><b>THREDDS</b> /catalog/dataset/@ID<br><b>netCDF</b> /nc:netcdf/nc:attribute[@name=id]<br></td>
+
<td style="word-break:break-all;"><b>DIF</b> /dif:DIF/dif:Data_Set_Citation/dif:Dataset_DOI<br><b>DIF</b> /dif:DIF/dif:Reference/dif:DOI<br><b>ECHO</b> /echo:Collection/echo:DataSetId<br><b>ISO</b> /*/gmd:identificationInfo/*/gmd:citation/gmd:CI_Citation/gmd:identifier/gmd:MD_Identifier/gmd:code/gco:CharacterString<br><b>THREDDS</b> /thredds:catalog/thredds:dataset/@ID<br><b>netCDF</b> /nc:netcdf/nc:attribute[@name=id]<br></td>
 
</tr>
 
</tr>
 
<tr>
 
<tr>
Line 29: Line 29:
 
<td style="word-break:break-all;">The organization responsible for the maintenance of the identifier (namespace)</td>
 
<td style="word-break:break-all;">The organization responsible for the maintenance of the identifier (namespace)</td>
 
<td style="word-break:break-all;"><b>ISO</b> /*/gmd:identificationInfo/*/gmd:citation/gmd:CI_Citation/gmd:identifier/gmd:MD_Identifier/gmd:authority<br><b>ISO</b> /*/gmd:identificationInfo/*/gmd:citation/gmd:CI_Citation/gmd:identifier/gmd:MD_Identifier/gmd:codeSpace
 
<td style="word-break:break-all;"><b>ISO</b> /*/gmd:identificationInfo/*/gmd:citation/gmd:CI_Citation/gmd:identifier/gmd:MD_Identifier/gmd:authority<br><b>ISO</b> /*/gmd:identificationInfo/*/gmd:citation/gmd:CI_Citation/gmd:identifier/gmd:MD_Identifier/gmd:codeSpace
(19115-1)<br><b>THREDDS</b> /catalog/dataset/@ID<br><b>netCDF</b> /nc:netcdf/nc:attribute[@name=naming_authority]<br></td>
+
(19115-1)<br><b>THREDDS</b> /thredds:catalog/thredds:dataset/@ID<br><b>netCDF</b> /nc:netcdf/nc:attribute[@name=naming_authority]<br></td>
 
</tr>
 
</tr>
 
<tr>
 
<tr>
Line 37: Line 37:
 
</tr>
 
</tr>
 
</table>
 
</table>
 +
''xPath Note:'' The xPaths included in this table use several wildcards. // means
 +
any path, so //gmd:CI_ResponsibleParty indicates a gmd:CI_ResponsibleParty anywhere
 +
in an XML file. /*/ indicates a single level with several possible elements. This
 +
usually indicates one of several concrete realizations of an abstract object. For
 +
example /*/gmd:identificationInfo could be gmd:MD_Metadata/gmd:identificationInfo
 +
or gmi:MI_Metadata/gmd:identificationInfo and gmd:identificationInfo//*/gmd:descriptiveKeywords
 +
could be gmd:identificationInfo/gmd:MD_DataIdentification/gmd:descriptiveKeywords
 +
or gmd:identificationInfo/srv:SV_ServiceIdentification/gmd:descriptiveKeywords.
 
[[Category:Documentation Connections]]
 
[[Category:Documentation Connections]]

Revision as of 09:16, October 21, 2012

The importance of unique identifiers for data is well understood in terms of data citation and unambiguous attribution. As metadata are shared between National and International repositories it is becoming increasing important to be able to unambiguously identify and refer to specific records. This requirement is facilitated by including an identifier for metadata and data in the metadata. Some mechanism must exist for ensuring that these identifiers are unique. This is accomplished by specifying the naming authority or namespace for the identifier. It is the responsibility of the manager of the namespace to ensure that the identifiers in that namespace are unique.

Crosswalks

Concept Description Paths
Metadata Identifier A unique reference to the record within the catalogue (Metadata Identifier) DIF /dif:Entry_Id
ISO /*/gmd:fileIdentifier/gco:CharacterString
Resource Identifier Identifier for the cited resource DIF /dif:DIF/dif:Data_Set_Citation/dif:Dataset_DOI
DIF /dif:DIF/dif:Reference/dif:DOI
ECHO /echo:Collection/echo:DataSetId
ISO /*/gmd:identificationInfo/*/gmd:citation/gmd:CI_Citation/gmd:identifier/gmd:MD_Identifier/gmd:code/gco:CharacterString
THREDDS /thredds:catalog/thredds:dataset/@ID
netCDF /nc:netcdf/nc:attribute[@name=id]
Naming Authority The organization responsible for the maintenance of the identifier (namespace) ISO /*/gmd:identificationInfo/*/gmd:citation/gmd:CI_Citation/gmd:identifier/gmd:MD_Identifier/gmd:authority
ISO /*/gmd:identificationInfo/*/gmd:citation/gmd:CI_Citation/gmd:identifier/gmd:MD_Identifier/gmd:codeSpace (19115-1)
THREDDS /thredds:catalog/thredds:dataset/@ID
netCDF /nc:netcdf/nc:attribute[@name=naming_authority]
Parent Identifier A unique identifier for a parent dataset or collection ISO /*/gmd:parentIdentifier/gco:CharacterString

xPath Note: The xPaths included in this table use several wildcards. // means any path, so //gmd:CI_ResponsibleParty indicates a gmd:CI_ResponsibleParty anywhere in an XML file. /*/ indicates a single level with several possible elements. This usually indicates one of several concrete realizations of an abstract object. For example /*/gmd:identificationInfo could be gmd:MD_Metadata/gmd:identificationInfo or gmi:MI_Metadata/gmd:identificationInfo and gmd:identificationInfo//*/gmd:descriptiveKeywords could be gmd:identificationInfo/gmd:MD_DataIdentification/gmd:descriptiveKeywords or gmd:identificationInfo/srv:SV_ServiceIdentification/gmd:descriptiveKeywords.