Interagency Data Stewardship/LifeCycle/Preservation Forum/TeleconNotes/2017-02-13meetingnotes

From Earth Science Information Partners (ESIP)

Meeting Notes - Data Stewardship Committee - 2017-02-13 1 p.m. EST / 11 a.m. MST (meeting was not held at usual 2 p.m. EST meeting time)

  • Join the meeting from your computer, tablet or smartphone.
  • You can also dial in using your phone.
  • United States: +1 (408) 650-3123
  • Access Code: 453-694-565


Attendees: Matt Mayernik, Bruce Caron, Shelley Stall, Sophie Hou, Rama, Tamar Norkin, Vicky Wolf, Ruth Duerr, Madison Langseth, Heather Brown, Nancy Hoebelheinrich, Bob Downs, Jamie Collins


Notes:

1) Introduction of new chairs and student fellow (Matt, Sophie, Jamie)


2) Some general thoughts - priorities for the upcoming year:

  • Matt: Very interested in fostering collaborations across ESIP focus areas/clusters; interested in hearing ideas from cluster members about how to best achieve this
  • Ruth: ESIP is about to have its 20th anniversary, and ESIP currently does not have a preservation/stewardship policy. It is a topic that Ruth is helping to bring to ESIP leadership’s attention and an opportunity to work with other groups (e.g., the sustainability cluster).


3) Discussion re: creation of 2017 DS cluster Strategic Plan

  • Will use previous year’s plan (see link) as a blueprint
  • New document has been started on Wiki
  • Goal will be to have plan in place within ~ 1 month
  • Will look to formalize by next month’s telecon


4) ESIP Winter Meeting recap/updates pertinent to DS activities

  • Saw a suite of nice processing/analysis methods that various ESIP members currently use
  • Might be a logical group with which DS cluster can collaborate
  • Rama was one of the co-chairs for the Info Quality cluster meeting; he was not able to attend the meeting in person, but the other co-chairs, David Moroni and Ge Peng, were present
  • Focus of IQ cluster meeting was forging relationships with other clusters
  • Good discussion about CMIP5 data
  • IQ cluster has several members who might be great as guest speakers/presenters in our own DS telecons over course of next year
  • Madison: With Sophie, presented some usability tests (using cognitive walkthrough, remote user study, and group user study techniques)
  • Nancy H.: DMT working group will be collaborating with Usability Cluster regarding Clearinghouse testing
  • Ruth: Sustainability Cluster would be another good group to collaborate with
  • Matt: Thanks to Rama and Justin for leadership of cluster over past year!


5) Cluster Roundtable, with objective of soliciting ideas/priorities for cluster over next year

  • Matt's prompt: What are you thinking about in your own work that you think we (in DS cluster) can focus on over course of next year?
  • Much of this input will be used to help build the 2017 Strategic Plan
  • Matt:
  • How can NCAR promote connectivity in related resources across NCAR’s different repositories? (probably something other organizations/institutions also dealing with)
  • e.g., provenance, Semantic Web
  • Rama:
  • Has made some significant progress in getting it adopted as an ISO standard
  • IQ cluster has a parallel activity in the NASA Data Quality Working Group
  • Next month: Meeting of data science working groups within NASA
  • Looking forward to forging new/better connections between IQ and DS clusters
  • Jamie:
  • Very interested in stewardship of software products
  • Many current practices in data stewardship and preservation are broadly applicable to software sustainability
  • Rama adds that NASA has a working group which has been looking at software citations; thus curation/management of software would certainly be a relevant topic for the Data Stewardship Committee
  • Bob Downs (NASA):
  • Part of NASA Earth Science Data Systems Working Group, focus on data quality/software citations (with Rama)
  • Senior Digital Archivist at Center for International Earth Science Information Network (CIESIN)
  • Interested in sustainability of repositories and the data we put in them; also has an interest in figuring out ways to justify the impact of repositories (particularly in times of tight funding)
  • Heather Brown (NCEI):
  • Interested in continuing/maintaining the cluster’s Data Management Training (see last year’s strategic plan); particularly in making it relevant to NCEI scientists and archived data. End-to-end stewardship.
  • +1 on previous comments re: sustainability
  • +1 on previous comments re: data management training.
  • Citation, Identifiers, software, “trusted repository”, etc.
  • Madison Langseth (USGS):
  • Like Matt, trying to ensure that data are well-connected to publications and other products
  • Technologies to make those links
  • Also interested in data management training, and pushing it out to the scientists who generate it
  • Dynamic data citation
  • Usability of data repositories both by end users and data producers.
  • Nancy Hoebelheinrich (Knowledge Motifs, LLC):
  • Also interested in how we capture the value of particular data repositories
  • Continued focus on ESIP's Data Management Training
  • How do we define the “core skills” for data professionals and data scientists?
  • Area of possible collaboration with other clusters
  • CLEAN cluster focusing on educating potential generators and users of data as early on as possible in the educational continuum; might be something DS cluster wants to explore
  • Ruth Duerr (Ronin Institute):
  • +1 comments on sustainability
  • +1 comments on new technologies to make connections between data and publications
  • Need to update ESIP citation guidelines to reflect dynamic data citation technologies
  • Good example: RDA group, has some good webinars
  • Could invite Andreas Rauber from RDA Data Citation Working Group
  • Shelley Stall (AGU):
  • In her job at AGU: promoting best practices for data management and accession of data into repositories
  • Will have some assessments, but currently much more in the “education” phase
  • Personal passion over past year: Focus on education of early career community:
  • What do students/early career scientists need to know, and how can we deliver that information?
  • Has been working with COPDESS: Coalition for Publishing Data in the Earth and Space Sciences
  • These are issues that many agencies/organizations are dealing with
  • Helping Ruth with dynamic data citation
  • Sophie Hou (NCAR):
  • +1 sustainability (both in terms of infrastructure of repositories themselves, and in terms of culture/best practices)
  • +1 education: need to get training out to all constituencies:
  • Data managers
  • Scientists/data generators
  • Students/early career community
  • Attribution & acknowledgment need to be part of the citation discussion
  • What technologies can we use to enhance connections between data managers and between data managers and other stakeholders/members of data community
  • Need to think about carrots and sticks -- what can we do to ensure people are adhering to DS best practices (to keep stakeholders happy)
  • Tamar Norkin (USGS):
  • +1 on ways to increase/promote links between repositories, sustainability, data citations
  • Vicky Wolf (NASA):
  • Use of the Cloud for both data and citations
  • Need to think about how we sustain and keep track of provenance/origin of data and any derivative products once we move into the cloud


6) Discussion of possible DS Committee response to NSF "2030" Cyberinfrastructure Dear Colleague Letter (DCL)

  • Matt and Ruth encouraged everyone to share their thoughts on Loomio (see above)