Difference between revisions of "Interagency Data Stewardship/LifeCycle/Preservation Forum/TeleconNotes/2017-02-13meetingnotes"

From Earth Science Information Partners (ESIP)
 
(3 intermediate revisions by the same user not shown)
Line 1: Line 1:
''' Meeting Agenda and Notes - DS Committee - 2017-02-13 1 p.m. EST / 11 a.m. MST'''
+
''' Meeting Notes - Data Stewardship Committee - 2017-02-13 1 p.m. EST / 11 a.m. MST'''
(note difference from usual 2 p.m. EST meeting time)
+
(meeting was not held at usual 2 p.m. EST meeting time)
 
* Join the meeting from your computer, tablet or smartphone.
 
* Join the meeting from your computer, tablet or smartphone.
 
:* Click: https://www.gotomeeting.com/join/453694565
 
:* Click: https://www.gotomeeting.com/join/453694565
Line 14: Line 14:
  
 
1) Introduction of new chairs and student fellow (Matt, Sophie, Jamie)
 
1) Introduction of new chairs and student fellow (Matt, Sophie, Jamie)
2) Some general thoughts - priorities for the upcoming year:
+
* Notes:
+
 
:* Matt: Very interested in fostering collaborations across ESIP focus areas/clusters; interested in hearing ideas from cluster members about how to best achieve this
+
2) Some general thoughts - priorities for the upcoming year:  
:* Ruth: ESIP is about to have its 20th anniversary, and ESIP currently does not have a preservation/stewardship policy. It is a topic that Ruth is helping to bring to ESIP leadership’s attention and an opportunity to work with other groups (e.g., the sustainability cluster).
+
* Matt: Very interested in fostering collaborations across ESIP focus areas/clusters; interested in hearing ideas from cluster members about how to best achieve this
 +
* Ruth: ESIP is about to have its 20th anniversary, and ESIP currently does not have a preservation/stewardship policy. It is a topic that Ruth is helping to bring to ESIP leadership’s attention and an opportunity to work with other groups (e.g., the sustainability cluster).
 +
 
 +
 
 
3) Discussion re: creation of 2017 DS cluster Strategic Plan
 
3) Discussion re: creation of 2017 DS cluster Strategic Plan
* Plan in development: [http://wiki.esipfed.org/index.php/ESIP_Data_Stewardship_Strategic_Plan_Calendar_Year_2017 2017 Strategic Plan]
+
* Plan (in development): [http://wiki.esipfed.org/index.php/ESIP_Data_Stewardship_Strategic_Plan_Calendar_Year_2017 2017 Strategic Plan]
 
* For reference: [http://wiki.esipfed.org/index.php/ESIP_Data_Stewardship_Strategic_Plan_Calendar_Year_2016 2016 Strategic Plan]
 
* For reference: [http://wiki.esipfed.org/index.php/ESIP_Data_Stewardship_Strategic_Plan_Calendar_Year_2016 2016 Strategic Plan]
 
* Notes:
 
* Notes:
Line 27: Line 30:
 
:* Will look to formalize by next month’s telecon
 
:* Will look to formalize by next month’s telecon
  
3. Committee round table discussion: interests and priorities
 
  
4. Response to NSF "2030" Cyberinfrastructure Dear Colleague Letter (DCL)
+
4) ESIP Winter Meeting recap/updates pertinent to DS activities
:* View letter at NSF site: [https://www.nsf.gov/publications/pub_summ.jsp?ods_key=nsf17031 Request for Information on Future Needs for Advanced Cyberinfrastructure to Support Science and Engineering Research (NSF CI 2030)]
+
* Sophie: Attended "intro"/debut meeting for new [http://wiki.esipfed.org/index.php/Earth_Science_Data_Analytics Data Analytics] cluster
:* View [https://www.loomio.org/g/L8mBsAEp/nsf-cyberinfrastructure-rfi-esip-response ESIP response] and Join the ESIP-wide Loomio group responding to this RFI: [https://www.loomio.org/invitations/70d1c101c699ae5c3a22]
+
:* Saw a suite of nice processing/analysis methods that various ESIP members currently use
 +
:* Might be a logical group with which DS cluster can collaborate
 +
* Rama was one of the co-chairs for the Info Quality cluster meeting; he was not able to attend the meeting in person, but the other co-chairs, David Moroni and Ge Peng, were present
 +
:* Focus of IQ cluster meeting was forging relationships with other clusters
 +
:* Good discussion about CMIP5 data
 +
:* IQ cluster has several members who might be great as guest speakers/presenters in our own DS telecons over course of next year
 +
* Madison: With Sophie, presented some usability tests (using cognitive walkthrough, remote user study, and group user study techniques)
 +
* Nancy H.: DMT working group will be collaborating with Usability Cluster regarding Clearinghouse testing
 +
* Ruth: Sustainability Cluster would be another good group to collaborate with
 +
* Matt: Thanks to Rama and Justin for leadership of cluster over past year!
 +
 
 +
 
 +
5) Cluster Roundtable, with objective of soliciting ideas/priorities for cluster over next year
 +
* Matt's prompt: What are you thinking about in your own work that you think we (in DS cluster) can focus on over course of next year?
 +
* Much of this input will be used to help build the [http://wiki.esipfed.org/index.php/ESIP_Data_Stewardship_Strategic_Plan_Calendar_Year_2017 2017 Strategic Plan]
 +
:* '''Matt''':
 +
::* How can NCAR promote connectivity in related resources across NCAR’s different repositories? (probably something other organizations/institutions also dealing with)
 +
:::* e.g., provenance, Semantic Web
 +
::* '''Ruth''' offered some ideas, e.g., RMAP (https://test.rmap-hub.org/app/) and use of ORCID to connect people/datasets/other products.  A poster about it is at https://dataconservancy.org/poster-presentation-on-linked-data-and-share-at-2016-charleston-conference/.  A presentation about it happened in a session at the last ESIP meeting (attached at http://commons.esipfed.org/node/9512)
 +
:* '''Rama''':
 +
::* Focus remains on data stewardship and preservation
 +
::* [http://wiki.esipfed.org/index.php/Provenance_and_Context_Content_Standard Provenance and Context Content Standard (PCCS)] has been adopted by NASA
 +
:::* Has made some significant progress in getting it adopted as an ISO standard
 +
::* IQ cluster has a parallel activity in the NASA Data Quality Working Group
 +
:::* Next month: Meeting of data science working groups within NASA
 +
:::* Looking forward to forging new/better connections between IQ and DS clusters
 +
:* '''Jamie''':
 +
::* Very interested in stewardship of software products
 +
::* Many current practices in data stewardship and preservation are broadly applicable to software sustainability
 +
::* '''Rama''' adds that NASA has a working group which has been looking at software citations; thus curation/management of software would certainly be a relevant topic for the Data Stewardship Committee
 +
:* '''Bob Downs''' (NASA):
 +
::* Part of NASA Earth Science Data Systems Working Group, focus on data quality/software citations (with Rama)
 +
::* Senior Digital Archivist at Center for International Earth Science Information Network (CIESIN)
 +
::* Interested in sustainability of repositories and the data we put in them; also has an interest in figuring out ways to justify the impact of repositories (particularly in times of tight funding)
 +
:* '''Heather Brown''' (NCEI):
 +
::* Interested in continuing/maintaining the cluster’s Data Management Training (see last year’s strategic plan); particularly in making it relevant to NCEI scientists and archived data. End-to-end stewardship.
 +
::* +1 on previous comments re: sustainability
 +
::* +1 on previous comments re: data management training.
 +
:::* Citation, Identifiers, software, “trusted repository”, etc.
 +
:* '''Madison Langseth''' (USGS):
 +
::* Like Matt, trying to ensure that data are well-connected to publications and other products
 +
:::* Technologies to make those links
 +
::* Also interested in data management training, and pushing it out to the scientists who generate it
 +
::* Dynamic data citation
 +
::* Usability of data repositories both by end users and data producers.
 +
:* '''Nancy Hoebelheinrich''' (Knowledge Motifs, LLC):
 +
::* Also interested in how we capture the value of particular data repositories
 +
::* Continued focus on ESIP's Data Management Training
 +
::* How do we define the “core skills” for data professionals and data scientists?
 +
:::* Area of possible collaboration with other clusters
 +
::* CLEAN cluster focusing on educating potential generators and users of data as early on as possible in the educational continuum; might be something DS cluster wants to explore
 +
:* '''Ruth Duerr''' (Ronin Institute):
 +
::* +1 comments on sustainability
 +
::* +1 comments on new technologies to make connections between data and publications
 +
::* Need to update ESIP citation guidelines to reflect dynamic data citation technologies
 +
:::* Good example: RDA group, has some good webinars
 +
:::* Could invite [https://www.rd-alliance.org/user/174 Andreas Rauber] from RDA Data Citation Working Group
 +
::* Good example of citations to a dataset: http://dx.doi.org/10.7265/N55M63M1
 +
:* '''Shelley Stall''' (AGU):
 +
::* In her job at AGU: promoting best practices for data management and accession of data into repositories
 +
:::* Will have some assessments, but currently much more in the “education” phase
 +
::* Personal passion over past year: Focus on education of early career community:
 +
:::* What do students/early career scientists need to know, and how can we deliver that information?
 +
::* Has been working with COPDESS: Coalition for Publishing Data in the Earth and Space Sciences
 +
:::* These are issues that many agencies/organizations are dealing with
 +
::* Helping Ruth with dynamic data citation
 +
:* '''Sophie Hou''' (NCAR):
 +
::* +1 sustainability (both in terms of infrastructure of repositories themselves, and in terms of culture/best practices)
 +
::* +1 education: need to get training out to all constituencies:
 +
:::* Data managers
 +
:::* Scientists/data generators
 +
:::* Students/early career community
 +
::* Attribution & acknowledgment need to be part of the citation discussion
 +
::* What technologies can we use to enhance connections between data managers and between data managers and other stakeholders/members of data community
 +
::* Need to think about carrots and sticks -- what can we do to ensure people are adhering to DS best practices (to keep stakeholders happy)
 +
:* '''Tamar Norkin''' (USGS):
 +
::* +1 on ways to increase/promote links between repositories, sustainability, data citations
 +
:* '''Vicky Wolf''' (NASA):
 +
::* Use of the Cloud for both data and citations
 +
:::* Need to think about how we sustain and keep track of provenance/origin of data and any derivative products once we move into the cloud
 +
 
 +
 
 +
6) Discussion of possible DS Committee response to NSF "2030" Cyberinfrastructure Dear Colleague Letter (DCL)
 +
* The letter itself: [https://www.nsf.gov/publications/pub_summ.jsp?ods_key=nsf17031 Request for Information on Future Needs for Advanced Cyberinfrastructure to Support Science and Engineering Research (NSF CI 2030)]
 +
* Development of an ESIP-wide response [https://www.loomio.org/g/L8mBsAEp/nsf-cyberinfrastructure-rfi-esip-response is underway on Loomio]; participate & add your thoughts by responding to the RFI: https://www.loomio.org/invitations/70d1c101c699ae5c3a22  
 +
* Ruth: Right now, collecting ideas; will formalize in a few weeks - stay tuned to Monday updates for the schedule
 +
* Because development of ESIP-wide response is in progress, the Committee decided not to mount a separate response activity
 +
:* Matt and Ruth encouraged everyone to share their thoughts on Loomio (see above)

Latest revision as of 17:11, February 15, 2017

Meeting Notes - Data Stewardship Committee - 2017-02-13 1 p.m. EST / 11 a.m. MST (meeting was not held at usual 2 p.m. EST meeting time)

  • Join the meeting from your computer, tablet or smartphone.
  • You can also dial in using your phone.
  • United States: +1 (408) 650-3123
  • Access Code: 453-694-565


Attendees: Matt Mayernik, Bruce Caron, Shelley Stall, Sophie Hou, Rama, Tamar Norkin, Vicky Wolf, Ruth Duerr, Madison Langseth, Heather Brown, Nancy Hoebelheinrich, Bob Downs, Jamie Collins


Notes:

1) Introduction of new chairs and student fellow (Matt, Sophie, Jamie)


2) Some general thoughts - priorities for the upcoming year:

  • Matt: Very interested in fostering collaborations across ESIP focus areas/clusters; interested in hearing ideas from cluster members about how to best achieve this
  • Ruth: ESIP is about to have its 20th anniversary, and ESIP currently does not have a preservation/stewardship policy. It is a topic that Ruth is helping to bring to ESIP leadership’s attention and an opportunity to work with other groups (e.g., the sustainability cluster).


3) Discussion re: creation of 2017 DS cluster Strategic Plan

  • Will use previous year’s plan (see link) as a blueprint
  • New document has been started on Wiki
  • Goal will be to have plan in place within ~ 1 month
  • Will look to formalize by next month’s telecon


4) ESIP Winter Meeting recap/updates pertinent to DS activities

  • Saw a suite of nice processing/analysis methods that various ESIP members currently use
  • Might be a logical group with which DS cluster can collaborate
  • Rama was one of the co-chairs for the Info Quality cluster meeting; he was not able to attend the meeting in person, but the other co-chairs, David Moroni and Ge Peng, were present
  • Focus of IQ cluster meeting was forging relationships with other clusters
  • Good discussion about CMIP5 data
  • IQ cluster has several members who might be great as guest speakers/presenters in our own DS telecons over course of next year
  • Madison: With Sophie, presented some usability tests (using cognitive walkthrough, remote user study, and group user study techniques)
  • Nancy H.: DMT working group will be collaborating with Usability Cluster regarding Clearinghouse testing
  • Ruth: Sustainability Cluster would be another good group to collaborate with
  • Matt: Thanks to Rama and Justin for leadership of cluster over past year!


5) Cluster Roundtable, with objective of soliciting ideas/priorities for cluster over next year

  • Matt's prompt: What are you thinking about in your own work that you think we (in DS cluster) can focus on over course of next year?
  • Much of this input will be used to help build the 2017 Strategic Plan
  • Matt:
  • How can NCAR promote connectivity in related resources across NCAR’s different repositories? (probably something other organizations/institutions also dealing with)
  • e.g., provenance, Semantic Web
  • Rama:
  • Has made some significant progress in getting it adopted as an ISO standard
  • IQ cluster has a parallel activity in the NASA Data Quality Working Group
  • Next month: Meeting of data science working groups within NASA
  • Looking forward to forging new/better connections between IQ and DS clusters
  • Jamie:
  • Very interested in stewardship of software products
  • Many current practices in data stewardship and preservation are broadly applicable to software sustainability
  • Rama adds that NASA has a working group which has been looking at software citations; thus curation/management of software would certainly be a relevant topic for the Data Stewardship Committee
  • Bob Downs (NASA):
  • Part of NASA Earth Science Data Systems Working Group, focus on data quality/software citations (with Rama)
  • Senior Digital Archivist at Center for International Earth Science Information Network (CIESIN)
  • Interested in sustainability of repositories and the data we put in them; also has an interest in figuring out ways to justify the impact of repositories (particularly in times of tight funding)
  • Heather Brown (NCEI):
  • Interested in continuing/maintaining the cluster’s Data Management Training (see last year’s strategic plan); particularly in making it relevant to NCEI scientists and archived data. End-to-end stewardship.
  • +1 on previous comments re: sustainability
  • +1 on previous comments re: data management training.
  • Citation, Identifiers, software, “trusted repository”, etc.
  • Madison Langseth (USGS):
  • Like Matt, trying to ensure that data are well-connected to publications and other products
  • Technologies to make those links
  • Also interested in data management training, and pushing it out to the scientists who generate it
  • Dynamic data citation
  • Usability of data repositories both by end users and data producers.
  • Nancy Hoebelheinrich (Knowledge Motifs, LLC):
  • Also interested in how we capture the value of particular data repositories
  • Continued focus on ESIP's Data Management Training
  • How do we define the “core skills” for data professionals and data scientists?
  • Area of possible collaboration with other clusters
  • CLEAN cluster focusing on educating potential generators and users of data as early on as possible in the educational continuum; might be something DS cluster wants to explore
  • Ruth Duerr (Ronin Institute):
  • +1 comments on sustainability
  • +1 comments on new technologies to make connections between data and publications
  • Need to update ESIP citation guidelines to reflect dynamic data citation technologies
  • Good example: RDA group, has some good webinars
  • Could invite Andreas Rauber from RDA Data Citation Working Group
  • Shelley Stall (AGU):
  • In her job at AGU: promoting best practices for data management and accession of data into repositories
  • Will have some assessments, but currently much more in the “education” phase
  • Personal passion over past year: Focus on education of early career community:
  • What do students/early career scientists need to know, and how can we deliver that information?
  • Has been working with COPDESS: Coalition for Publishing Data in the Earth and Space Sciences
  • These are issues that many agencies/organizations are dealing with
  • Helping Ruth with dynamic data citation
  • Sophie Hou (NCAR):
  • +1 sustainability (both in terms of infrastructure of repositories themselves, and in terms of culture/best practices)
  • +1 education: need to get training out to all constituencies:
  • Data managers
  • Scientists/data generators
  • Students/early career community
  • Attribution & acknowledgment need to be part of the citation discussion
  • What technologies can we use to enhance connections between data managers and between data managers and other stakeholders/members of data community
  • Need to think about carrots and sticks -- what can we do to ensure people are adhering to DS best practices (to keep stakeholders happy)
  • Tamar Norkin (USGS):
  • +1 on ways to increase/promote links between repositories, sustainability, data citations
  • Vicky Wolf (NASA):
  • Use of the Cloud for both data and citations
  • Need to think about how we sustain and keep track of provenance/origin of data and any derivative products once we move into the cloud


6) Discussion of possible DS Committee response to NSF "2030" Cyberinfrastructure Dear Colleague Letter (DCL)

  • Matt and Ruth encouraged everyone to share their thoughts on Loomio (see above)