Data Risk Factors

From Earth Science Information Partners (ESIP)

Data collections can face a variety of risk factors. This page provides a set of resources that will be used by members of the ESIP Data Stewardship Committee to analyze and categorize these factors. The goal of this activity is to inform and enable a) individuals and organizations who manage data collections, and b) individuals and organizations who are wanting to help to reduce the risks associated with data preservation and stewardship.

MEETING NOTES


GROUP DOCUMENTS


RELEVANT LITERATURE

  1. Anderson, William L., Faundeen, John L., Greenberg, Jane, & Taylor, Fraser. (2011). Metadata for data rescue and data at risk. In Conference on Ensuring Long-Term Preservation in Adding Value to Scientific and Technical Data. http://hdl.handle.net/2152/20056
  2. Brunet, M., & Jones, P. (2011). Data rescue initiatives: bringing historical climate data into the 21st century. Climate Research, 47(1), 29–40. https://doi.org/10.3354/cr00960
  3. Downs, Robert R. & Chen, Robert S. (2017). Curation of scientific data at risk of loss: Data rescue and dissemination. In Johnston, Lisa (Ed). Curating Research Data. Volume One, Practical Strategies for Your Digital Repository. Association of College and Research Libraries. http://dx.doi.org/10.7916/D8W09BMQ
  4. Griffin, R.E. (2015). When are old data new data? GeoResJ, 6: 92–97. http://dx.doi.org/10.1016/j.grj.2015.02.004
  5. Griffin, E. (2018). The data that get forgotten. In Data Science Landscape (pp. 183–198). Springer Singapore. https://doi.org/10.1007/978-981-10-7515-5_13
  6. Hsu, L., Lehnert, K.A., Goodwillie, A., Delano, J.W., Gill, J.B., Tivey, M.A., Ferrini, V.L., Carbotte, S.M., Arko R.A. (2015). Rescue of long-tail data from the ocean bottom to the Moon: IEDA Data Rescue Mini-Awards, GeoResJ, 6:108-114, http://dx.doi.org/10.1016/j.grj.2015.02.012
  7. Knapp, K. R., Bates, J. J., & Barkstrom, B. (2007). Scientific Data Stewardship: Lessons Learned from a Satallite–Data Rescue Effort. Bulletin of the American Meteorological Society, 88(9), 1359–1362. https://doi.org/10.1175/BAMS-88-9-1359
  8. Levitus, S., (2012). The UNESCO-IOC-IODE "Global Oceanographic Data Archeology and Rescue" (GODAR) Project and "World Ocean Database" Project. Data Science Journal, 11, pp. 46–71. http://doi.org/10.2481/dsj.012-014
  9. Michener, William K., et al. “Nongeospatial Metadata for the Ecological Sciences.” Ecological Applications, vol. 7, no. 1, Feb. 1997, pp. 330–42. https://doi.org/10.1890/1051-0761(1997)007[0330:NMFTES]2.0.CO;2
  10. Park, E. G., Burr, G., Slonosky, V., Sieber, R., & Podolsky, L. (2018). Data rescue archive weather (DRAW). Journal of Documentation, 74(4), 763–780. https://doi.org/10.1108/jd-10-2017-0150
  11. Poli, P., Dee, D. P., Saunders, R., John, V. O., Rayer, P., Schulz, J., … Bojinski, S. (2017). Recent Advances in Satellite Data Rescue. Bulletin of the American Meteorological Society, 98(7), 1471–1484. https://doi.org/10.1175/BAMS-D-15-00194.1
  12. Ryan, C., Duffy, C., Broderick, C., Thorne, P. W., Curley, M., Walsh, S., … Murphy, C. (2018). Integrating Data Rescue into the Classroom. Bulletin of the American Meteorological Society, 99(9), 1757–1764. https://doi.org/10.1175/BAMS-D-17-0147.1
  13. Ryan, H. (2014). Occam’s razor and file format endangerment factors. Proceedings of the 11th International Conference on Digital Preservation (iPres), October 6-10, 2014: Melbourne, Australia (pp. 179-188). https://www.nla.gov.au/sites/default/files/ipres2014-proceedings-version_1.pdf
  14. Thompson, C.A., Robertson, W. D., & Greenberg, J. (2014). Where have all the scientific data gone? LIS perspective on the data-at-risk predicament. College & Research Libraries, 75(6), 842-861. https://doi.org/10.5860/crl.75.6.842


ORGANIZATIONS & GUIDELINES

  1. Guidelines to the Rescue of Data At Risk, https://www.rd-alliance.org/guidelines-rescue-data-risk
  2. Research Data Alliance Data Rescue Interest Group. https://www.rd-alliance.org/groups/data-rescue.html
  3. US Geological Survey. 2016 Data at Risk Project. https://www.fort.usgs.gov/ldi/2016-data-at-risk-project
    1. Link to Risk Evaluations and Reporting criteria: https://www.fort.usgs.gov/ldi/evaluations-reports
  4. National Oceanic and Atmospheric Administration. Climate Database Modernization Program. https://www.ncdc.noaa.gov/climate-information/research-programs/climate-database-modernization-program
  5. International Environmental Data Rescue Organization (IEDRO), http://iedro.org/
  6. Tan, L. S., S. Burton, R. Crouthamel, A. van Engelen, R. Hutchinson, L. Nicodemus, T. C. Peterson, F. Rahimzadeh. (2004). Guidelines on Climate Data Rescue. WMO/TD No. 1210. Ed. by P. Llansó and H. Kontongomde. Geneva, Switzerland: World Meteorological Organization. http://www.wmo.int/pages/prog/wcp/wcdmp/documents/WCDMP-55.pdf
  7. Beaverson, Sheena. (2009). Data Deaccessioning Guidelines for the NSIDC DAAC. http://wiki.esipfed.org/images/8/87/Beaverson_DataPlan.pdf
  8. NESDIS Data Archive Board, (2002). Scientific Data Stewardship - Executive Summary.
  9. Digital Preservation Coalition - The 'Bit List' of Digitally Endangered Species


CODATA/RDA 2016 WORKSHOP

The Rescue of Data At Risk: An RDA / CODATA Workshop http://www.codata.org/task-groups/data-at-risk/dar-workshops

Agenda - http://www.codata.org/uploads/RescueofDataatRisk-WorkshopAgenda.pdf

Videos

Presentation slides - https://drive.google.com/drive/folders/0B_NJP_ik1Aj-M0Z5aG1tSFNDVmM?usp=sharing


TRUSTED DATA REPOSITORY EVALUATION AND CERTIFICATIONS

  1. Center for Research Libraries and Online Computer Library Center, Inc. (2007). Trustworthy Repositories Audit & Certification: Criteria and Checklist. https://www.crl.edu/archiving-preservation/digital-archives/metrics-assessing-and-certifying/iso16363
  2. Data Preservation Sub-Committee of the USGS Fundamental Science Practices Advisory Committee (FSPAC). (2015). FSPAC Data Preservation Criteria for a Trusted Digital Repository. https://docs.google.com/spreadsheets/d/1DQ3XDRoe7THmoFxwPoV2lcEqkn0Q7SkrPQfizxm6fLU/edit#gid=1446608780
  3. Data Seal of Approval and ICSU World Data System. (2016). Core Trustworthy Data Repositories Requirements. https://drive.google.com/file/d/0B4qnUFYMgSc-eDRSTE53bDUwd28/view
  4. McGovern, N. Y. (2016). DPM “Stack”: A Management Infrastructure Frame for Digital Preservation that Parallels Technical Infrastructure. https://www.dataone.org/sites/default/files/sites/all/documents/dws13dec2016.pdf
  5. Nestor Certification Working Group. (2013). Explanatory Notes on the Nestor Seal for Trustworthy Digital Archives. http://d-nb.info/1047613859/34
  6. Peng, G. (2015). The scientific data stewardship maturity assessment model template. Version: NCDC-CICS-SMM-0001-Rev.1 v4.0 06/23/2015. https://dx.doi.org/10.6084/m9.figshare.1211954
  7. Rans, J. and Whyte, A. (2017). Using RISE: the Research Infrastructure Self-Evaluation Framework. http://www.dcc.ac.uk/resources/how-guides/RISE
  8. Stall, S. (2016, December). So You want to Be Trustworthy: A repository’s Guide to Taking Reasonable Steps Towards Achieving ISO16363. Poster presented at the 2016 AGU Fall Meeting, San Francisco, CA.


COMMITTEE MEMBER EXPERIENCE

  1. Downs, RR. 2009. Managing Risks to Scientific Data, Presented at the NYU/IBM Workshop on Managing Data Risk: Acquisition, Processing, Retention and Governance. New York, NY, April 24, 2009. Columbia University Academic Commons, https://doi.org/10.7916/D8697G5P.
  2. Ramapriyan, H.K. 2017. NASA's EOSDIS, Trust and Certification. Presented at 2017 ESIP Summer Meeting, Bloomington, IN. https://figshare.com/articles/NASA_s_EOSDIS_Trust_and_Certification/5258047