Main Page/Start here
|Line 42:||Line 42:|
== data ==
== Existing data facilities ==
== Existing data facilities ==
Revision as of 22:54, 3 September 2014
This is the test page for the Editor's Roundtable - an Initiative for Best Practices for Data Publication
What is the Editor's Roundtable?
The Editor's Roundtable is a community-based initiative, conceived by editors, publishers, and operators of data facilities at a series of meetings held in conjunction with major scientific conferences that IEDA facilitated. It is an effort to facilitate and foster communication and knowledge exchange among editors and publishers of Earth Science journals and Earth Science data facilities with the goal to develop, implement, and promote guidelines and best practices for scholarly publishing with particular emphasis on the publication of data in support of open access policies.
The Editors Roundtable builds on a successful initiative started in 2007 by EarthChem, which since 2010 is part of the IEDA data facility, to develop and promote best practices for the reporting of geochemical data in scholarly articles and data systems, and that resulted in a Policy Recommendation ‘Requirements for the Publication of Geochemical Data’ (Goldstein et al. 2014, doi:10.1594/IEDA/100426), which was endorsed by all major scientific journals that publish geochemical data and has guided policies for the disclosure and documentation of geochemical data.
The “best practices” will include standards for:
- Archiving and curating data sets;
- Setting references and identifiers;
- Linking datasets to publications;
- Integrating with emerging data citation practices and bibliometrics for data;
- Complying with interoperability standards.
- Data Accessibility and Format
Access to the complete data, upon which new scientific discovery and knowledge is based, is a fundamental requirement for the reproducibility of scientific results. All NEW geochemical data used in a publication must be made available for future use by (1) submission to an accessible, persistent source such as a public database or data archive (for example, personal web sites are not persistent data archives), if it exists for the specific data type, or by (2) listing the data explicitly in a data table associated with the publication. The data must in any case be available in downloadable format. For chemical abundance data of samples, elemental or oxide abundance data must be given unless a compelling reason can be provided; elemental abundance ratios are acceptable only if the compositional data do not exist. Isotope ratios are, of course, acceptable.
Data should be reported in tabular format. Data must always be available as a downloadable file in a format that can be easily converted into spreadsheet format (for example, .csv, .txt). The file should include units for the listed measured values. This means that if a publication contains a data table in the main text or a pdf or image version of the data table as an electronic supplement, the data in the table(s) must also be available in a downloadable form that can be easily converted to spreadsheet format.
- Data Quality Information
Recommended practices for data submission
Recommended practices for citations
The core required elements of a citation are Author(s)--the people or organizations responsible for the intellectual work to develop the data set. The data creators. Release Date--when the particular version of the data set was first made available for use (and potential citation) by others. Title--the formal title of the data set Version--the precise version of the data used. Careful version tracking is critical to accurate citation. Archive and/or Distributor--the organization distributing or caring for the data, ideally over the long term. Locator/Identifier--this could be a URL but ideally it should be a persistent service, such as a DOI, Handle or ARK, that resolves to the current location of the data in question. Access Date and Time--because data can be dynamic and changeable in ways that are not always reflected in release dates and versions, it is important to indicate when on-line data were accessed. Additional fields can be added as necessary to credit other people and institutions, etc. Additionally, it is important to provide a scheme for users to indicate the precise subset of data that were used. This could be the temporal and spatial range of the data, the types of files used, a specific query id, or other ways of describing how the data were subsetted. An example citation: Cline, D., R. Armstrong, R. Davis, K. Elder, and G. Liston. 2002, Updated 2003. CLPX-Ground: ISA snow depth transects and related measurements ver. 2.0. Edited by M. Parsons and M. J. Brodzik. National Snow and Ice Data Center. Data set accessed 2008-05-14 athttp://dx.doi.org/10.5060/D4MW2F23z
Existing data facilities
- Data cite
- EarthCube: Council of Data Facilities
- USGS publication warehouse
- NERC data centers includes the following: