Interagency Data Stewardship/LifeCycle/Preservation Forum/TeleconNotes/2017-02-13meetingnotes
Meeting Notes - Data Stewardship Committee - 2017-02-13 1 p.m. EST / 11 a.m. MST (meeting was not held at usual 2 p.m. EST meeting time)
- Join the meeting from your computer, tablet or smartphone.
- You can also dial in using your phone.
- United States: +1 (408) 650-3123
- Access Code: 453-694-565
Attendees: Matt Mayernik, Bruce Caron, Shelley Stall, Sophie Hou, Rama, Tamar Norkin, Vicky Wolf, Ruth Duerr, Madison Langseth, Heather Brown, Nancy Hoebelheinrich, Bob Downs, Jamie Collins
Notes:
1) Introduction of new chairs and student fellow (Matt, Sophie, Jamie)
2) Some general thoughts - priorities for the upcoming year:
- Matt: Very interested in fostering collaborations across ESIP focus areas/clusters; interested in hearing ideas from cluster members about how to best achieve this
- Ruth: ESIP is about to have its 20th anniversary, and ESIP currently does not have a preservation/stewardship policy. It is a topic that Ruth is helping to bring to ESIP leadership’s attention and an opportunity to work with other groups (e.g., the sustainability cluster).
3) Discussion re: creation of 2017 DS cluster Strategic Plan
- Plan (in development): 2017 Strategic Plan
- For reference: 2016 Strategic Plan
- Notes:
- Will use previous year’s plan (see link) as a blueprint
- New document has been started on Wiki
- Goal will be to have plan in place within ~ 1 month
- Will look to formalize by next month’s telecon
4) ESIP Winter Meeting recap/updates pertinent to DS activities
- Sophie: Attended "intro"/debut meeting for new Data Analytics cluster
- Saw a suite of nice processing/analysis methods that various ESIP members currently use
- Might be a logical group with which DS cluster can collaborate
- Rama was one of the co-chairs for the Info Quality cluster meeting; he was not able to attend the meeting in person, but the other co-chairs, David Moroni and Ge Peng, were present
- Focus of IQ cluster meeting was forging relationships with other clusters
- Good discussion about CMIP5 data
- IQ cluster has several members who might be great as guest speakers/presenters in our own DS telecons over course of next year
- Madison: With Sophie, presented some usability tests (using cognitive walkthrough, remote user study, and group user study techniques)
- Nancy H.: DMT working group will be collaborating with Usability Cluster regarding Clearinghouse testing
- Ruth: Sustainability Cluster would be another good group to collaborate with
- Matt: Thanks to Rama and Justin for leadership of cluster over past year!
5) Cluster Roundtable, with objective of soliciting ideas/priorities for cluster over next year
- Matt's prompt: What are you thinking about in your own work that you think we (in DS cluster) can focus on over course of next year?
- Much of this input will be used to help build the 2017 Strategic Plan
- Matt:
- How can NCAR promote connectivity in related resources across NCAR’s different repositories? (probably something other organizations/institutions also dealing with)
- e.g., provenance, Semantic Web
- Ruth offered some ideas, e.g., RMAP (https://test.rmap-hub.org/app/) and use of ORCID to connect people/datasets/other products. A poster about it is at https://dataconservancy.org/poster-presentation-on-linked-data-and-share-at-2016-charleston-conference/. A presentation about it happened in a session at the last ESIP meeting (attached at http://commons.esipfed.org/node/9512)
- Rama:
- Focus remains on data stewardship and preservation
- Provenance and Context Content Standard (PCCS) has been adopted by NASA
- Has made some significant progress in getting it adopted as an ISO standard
- IQ cluster has a parallel activity in the NASA Data Quality Working Group
- Next month: Meeting of data science working groups within NASA
- Looking forward to forging new/better connections between IQ and DS clusters
- Jamie:
- Very interested in stewardship of software products
- Many current practices in data stewardship and preservation are broadly applicable to software sustainability
- Rama adds that NASA has a working group which has been looking at software citations; thus curation/management of software would certainly be a relevant topic for the Data Stewardship Committee
- Bob Downs (NASA):
- Part of NASA Earth Science Data Systems Working Group, focus on data quality/software citations (with Rama)
- Senior Digital Archivist at Center for International Earth Science Information Network (CIESIN)
- Interested in sustainability of repositories and the data we put in them; also has an interest in figuring out ways to justify the impact of repositories (particularly in times of tight funding)
- Heather Brown (NCEI):
- Interested in continuing/maintaining the cluster’s Data Management Training (see last year’s strategic plan); particularly in making it relevant to NCEI scientists and archived data. End-to-end stewardship.
- +1 on previous comments re: sustainability
- +1 on previous comments re: data management training.
- Citation, Identifiers, software, “trusted repository”, etc.
- Madison Langseth (USGS):
- Like Matt, trying to ensure that data are well-connected to publications and other products
- Technologies to make those links
- Also interested in data management training, and pushing it out to the scientists who generate it
- Dynamic data citation
- Usability of data repositories both by end users and data producers.
- Nancy Hoebelheinrich (Knowledge Motifs, LLC):
- Also interested in how we capture the value of particular data repositories
- Continued focus on ESIP's Data Management Training
- How do we define the “core skills” for data professionals and data scientists?
- Area of possible collaboration with other clusters
- CLEAN cluster focusing on educating potential generators and users of data as early on as possible in the educational continuum; might be something DS cluster wants to explore
- Ruth Duerr (Ronin Institute):
- +1 comments on sustainability
- +1 comments on new technologies to make connections between data and publications
- Need to update ESIP citation guidelines to reflect dynamic data citation technologies
- Good example: RDA group, has some good webinars
- Could invite Andreas Rauber from RDA Data Citation Working Group
- Good example of citations to a dataset: http://dx.doi.org/10.7265/N55M63M1
- Shelley Stall (AGU):
- In her job at AGU: promoting best practices for data management and accession of data into repositories
- Will have some assessments, but currently much more in the “education” phase
- Personal passion over past year: Focus on education of early career community:
- What do students/early career scientists need to know, and how can we deliver that information?
- Has been working with COPDESS: Coalition for Publishing Data in the Earth and Space Sciences
- These are issues that many agencies/organizations are dealing with
- Helping Ruth with dynamic data citation
- Sophie Hou (NCAR):
- +1 sustainability (both in terms of infrastructure of repositories themselves, and in terms of culture/best practices)
- +1 education: need to get training out to all constituencies:
- Data managers
- Scientists/data generators
- Students/early career community
- Attribution & acknowledgment need to be part of the citation discussion
- What technologies can we use to enhance connections between data managers and between data managers and other stakeholders/members of data community
- Need to think about carrots and sticks -- what can we do to ensure people are adhering to DS best practices (to keep stakeholders happy)
- Tamar Norkin (USGS):
- +1 on ways to increase/promote links between repositories, sustainability, data citations
- Vicky Wolf (NASA):
- Use of the Cloud for both data and citations
- Need to think about how we sustain and keep track of provenance/origin of data and any derivative products once we move into the cloud
6) Discussion of possible DS Committee response to NSF "2030" Cyberinfrastructure Dear Colleague Letter (DCL)
- The letter itself: Request for Information on Future Needs for Advanced Cyberinfrastructure to Support Science and Engineering Research (NSF CI 2030)
- Development of an ESIP-wide response is underway on Loomio; participate & add your thoughts by responding to the RFI: https://www.loomio.org/invitations/70d1c101c699ae5c3a22
- Ruth: Right now, collecting ideas; will formalize in a few weeks - stay tuned to Monday updates for the schedule
- Because development of ESIP-wide response is in progress, the Committee decided not to mount a separate response activity
- Matt and Ruth encouraged everyone to share their thoughts on Loomio (see above)