Sustainable Data Management/20200509 telcon notes

From Earth Science Information Partners (ESIP)

To connect

  • Dial in using your phone: :United States: +1 (571) 317-3122
    • Access Code: 618-011-013

Agenda

  1. Review notes and ideas from February
  2. 3 Slides for ESIP Collaboration Area Highlights

Goal: summarize our activities to the rest of the Federation, solicit ideas or members.


Attending

  • Megan Carter
  • Erin Antognoii (with NAL, Ag Commons)
  • Ruth Duerr
  • Corinna Gries
  • Shelley Stall
  • Rebecca Koskela
  • Margaret O'Brien (scribe)
  • Bob Downs
  • Philip Tarrant

Regrets

  • Cyndy Parr

Notes

feedback from data stewardship group: Nice idea, needs some scoping.

  • Use cases
    • Primary: someone suggests starting a new repo. asks: what do I need to do?
    • Primary: one of the 300+ domain repos asks, what dos it mean for me to be a repository in this community?
    • Secondary: NSF can use this doc to review proposals for repositories.


  • We would not detail the defining services, products for any repo. Instead, offer a set of common features across all. guidelines for implementation.
  • can we guide funding-limited repos on what goals they might want to plan for.
  • differentiate from Core trust seal

TRUST Principles: transparency (T), responsibility (R), user community (U), sustainability (S) and technology (T):

similar to the FAIR principles (ie. high level). began(?) as RDA initiative (Philadelphia, WDS cert)


one approach - organize guidelines around these facets, ask what do repos have to implement to enable that?

possible format:

  • index? or
  • checklist?
  • for x see y
    • examples
      • for identifiers: point to a founding doc
      • data access: (our paper on tech interop) also something from NASA. when do you want to use an OGC service, when netcdf.

does Core Trust Seal have any recommendations for implementation? not really, unless you are working on an application.

some suggestions that they tighten requirements, did not happen.

Goal should be: a repository is "CTS, TRUSTed FAIR OAIS"

Others working in this space:

Next things to focus on

  • look over available material
  • Decide what repos aspects we want to cover, continue scoping
  • Decide on scope: probably earth/env science (not all research)

Action Items:

Chat log

(might have some additional links):

Megan Carter (to All): Nice to meet you, Erin.

1:04 PMErin Antognoli (to All): My email: Erin.Antognoli@usda.gov

1:14 PMMegan Carter (to All): http://wiki.esipfed.org/index.php/Sustainable_Data_Management/20200410_telcon_notes

1:14 PMTo All: http://wiki.esipfed.org/index.php/Sustainable_Data_Management/20200410_telcon_notes

1:29 PMPhilip Tarrant (to All): Got to drop off. It was good to see everyone.

1:30 PMRebecca Koskela (to All): Bob - is this the RDA group? https://www.rd-alliance.org/ig-rdawds-certification-digital-repositories-rda-13th-plenary-meeting-0

1:36 PMShelley Stall (to All): Here's the white paper: https://docs.google.com/document/d/1pbRw1uf-W-BRMxjj8ZNZJHQTI-ggLjRWq8jiu4Oy6Yc/edit?usp=sharing

1:36 PMShelley Stall (to All): for TRUST

1:42 PMRobert Downs (to All): Yes, that is the right group and the correct white paper

1:44 PMShelley Stall (to All): https://docs.google.com/document/d/1TIV8Q5-4uBhFhdsA3cPvndKrE2P9ZIKyGJ8XBo22ufk/edit?usp=sharing

1:49 PMRebecca Koskela (to All): https://datatogether.org/

1:50 PMShelley Stall (to All): https://www.fairsfair.eu

1:53 PMRebecca Koskela (to All): and also this - same name but they don't reference each other datatogether300.jpg

To download, add https protocol to this: www.rd-alliance.org/system/files/datatogether300.jpg

Follow up Notes

  • From Nancy:
more context to the suggestion about creating guidelines on repositories based on the discussion that happened at the Data Stewardship face to face meeting in January which gave rise to the idea. The context might change the focus or approach a bit, so wanted to provide it since not all of those who are discussing the idea were present at that meeting.
The issue came up from both a senior researcher and a data manager who said that they are often asked from other researchers (or “customers” I suppose you could say from the data manager’s POV) what is the best way/ place to store their research data (and why). While each of them can answer that question in their own way based on their experience and exposure to archives and/or repositories, they have not been able to find a reference that they can point to which would provide good, fairly quick answers to these kinds of questions. They thought a guide targeted to _researchers_ would be the most effective; we also discussed turning the information into a tutorial or learning resource that could be referenced as well (and, of course, included in registries of such learning resources!). We talked about taking this on as an activity as part of the Data Stewardship Committee, but haven’t yet come up with a plan for how to do that. So, I am very glad that the Sustainability cluster is interested in working on this. What I might suggest as an approach, though, based on the discussion at the ESIP Winter meeting is to see if we could come up with a survey or some other way to ask the folks (our colleagues) who get these kinds of questions from others what are the most frequently asked questions that they get from researchers or the areas that are most puzzling and troublesome to researchers who are [simply] trying to be responsible about putting their research data somewhere, but don’t want to get into the nitty gritty details. The survey results could help guide the areas to discuss within this broad topic, and may also help the group decide how best to represent the answers to the questions, i.e., as a manuscript è paper, or a checklist, a FAQ, a tutorial, etc.
Anyway, I will plan on attending future cluster meetings, and offer what help I can on this activity. I think this could be a good resource! I am assuming that the group would be looking for already existing resources that might provide useful, and that is part of what could / should be included in this folder or in zotero. Is that correct?
  • From Shelly:
an ESIP session related to Nancy's comment
Supporting the Earth Science Community in Repository Discovery
There remains a need for the Earth Science Community to be supported in determining the most appropriate repository for submission of their data for sharing, reuse and long term preservation. There are several criteria of importance to data creators in addition to specific funder, institutional and/or publisher guidance and multiple education, outreach and catalog resources have been developed in support of this challenge (e.g. re3data.org, FAIRsFAIR, repository finder decision tree, DataSeer). In this session we will briefly discuss existing resources in order to understand gaps and opportunities. Through a facilitated discussion we will identify opportunities for unique contribution to this space in support of the Earth science community, and next steps. The remainder of the session will be run as a working activity to begin to forward these activities.