Difference between revisions of "Earth Science Data Analytics/2015-5-21 Telecon"

From Earth Science Information Partners (ESIP)
 
(4 intermediate revisions by the same user not shown)
Line 3: Line 3:
 
===Known Attendees:===
 
===Known Attendees:===
  
ESIP Host (Erin Robinson), Steve Kempler, Chung-lin Shie, Tiffany Mathews, Brand Niemann, Joan Aron, Soren(?), Suhung Shen
+
ESIP Host (Erin Robinson), Steve Kempler, Robert Downs, Chung-lin Shie, Tiffany Mathews, Brand Niemann, Joan Aron, Soren(?), Suhung Shen
  
 
===Agenda:===
 
===Agenda:===
Line 12: Line 12:
  
 
2.  Summer Meeting Planning – 2 Sessions scheduled:
 
2.  Summer Meeting Planning – 2 Sessions scheduled:
 +
 
-  Teaching Science Data Analytics Skills, and the Earth Science Data Scientist (http://commons.esipfed.org/node/7999)
 
-  Teaching Science Data Analytics Skills, and the Earth Science Data Scientist (http://commons.esipfed.org/node/7999)
 +
 
-  The Need for Earth Science Data Analytics to Facilitate Community Resilience (and other applications) (http://commons.esipfed.org/node/7998)
 
-  The Need for Earth Science Data Analytics to Facilitate Community Resilience (and other applications) (http://commons.esipfed.org/node/7998)
  
Line 20: Line 22:
 
Presentations:
 
Presentations:
  
None, this time.  Worked off Google Doc:  Use Case Information:  https://docs.google.com/document/d/1U1mAt4ZjJqXeNmtRoE4VbI1nBgS1v7DzeHib_7mzOF8/edit
+
None, this time.   
 +
 
 +
Use Case Information:  https://docs.google.com/document/d/1U1mAt4ZjJqXeNmtRoE4VbI1nBgS1v7DzeHib_7mzOF8/edit
  
 
===Notes:===
 
===Notes:===
  
  
Thank you all for attending. Making good progress...
+
Thank you all for attending. Making progress...
  
At this telecon, we discussed the Use Case Google Doc that we generated from our Use Case Google Spreadsheet, depicting the Use Case template in text form, for populationThe document also contains it's first Use CaseThe template consists of:
+
At this telecon, we continued our discussion on locating ESDA Use Cases.  Brand provided several links from which we can glean use cases specific to Earth science.  This led to a discuss regarding the various ways 'Data Analytics' is used.  On one level it is used as an abstract term that encompasses situations that demonstrate concepts and thus, needed capabilities to examine/analyze datasets, and/or co-analyze heterogeneous datasets.  On another level, the term data analytics is refers to the actual implementation of specific tools (logic) to solve a particular problem to provides results that yields a better understanding towards solving the problemWhereas the former interpretation addresses more strategic questions like 'Return of Investment', and whip gets the most value for efforts expended on Big Data Initiatives, the latter interpretation, in terms of Earth science data analytics, would be one of greater interest to information tool developers, providers, and users (users=researchers)Other discussion inputs include: Need to find best ways to use data; Need ways to create datasets tools that work with all data; Need to communicate and share available tools.
  
* Items 'a' through 'c' that provides Use Case identifying information
 
* Items 'd' through 'f' that describes the data analytics being used and technical reasons for needing the used data analytics
 
* Items 'g' through 'l' that, along with 'd', 'e', and 'f', allow us to categorize the Use Cases for further analysis
 
  
Sources for additional Use Cases were provided:
+
Summer Cluster Planning
  
* Erin suggested we look at other ESIP clusters for potential Use Cases:  Agriculture, Disasters
 
* Brand provided the following links:  http://bigdatawg.nist.gov/usecases.php, https://github.com/demand-driven-open-data/ddod-intake/issues
 
* Precision farming scenarion (hmm... something the Federation addressed many years ago)
 
  
See Actions below…
+
We will have two cluster session at the summer meeting:
  
 
Next we discussed the possibility of sponsoring a data analytics workshop at the summer meeting.  Recapping some questions from our March telecon:
 
 
* Who is our audience? ESIP technologists, who, once hearing problem/needs of at a analytics users, can go solve it.
 
* Who are our experts? Suggestions made: Power users, who deal with lots of distributed heterogeneous data; Decision makers who can express what their needs; Google; Chris Mattmann (already in the Federation); Larrabee Strow (AIRS Scientist); of course, Peter Fox
 
 
At this telecon, potential themes focused on one or more of the following:
 
 
* Applying analytics to environmental issues
 
* The Progression of data analytics tool capabilities currently available to potential capabilities that will be needed
 
* Matching data analytics needs with available tools
 
 
Commentary: All look good and we probably have time for all.  The second and third bullets can be tied together.  For the environmental issues, I look to Ethan who not only works in an agency that has a the word 'Environmental' in it's title, but has been focused on applying data analytics.  Is there a way to present your findings:  how and where data analytics are or could be applied in addressing environmental issues.  Or suggest a speaker.  For the tools oriented bullets, I will do some searching for west coast 'experts'
 
 
 
Summer Cluster Planning
 
 
We will have two cluster session at the summer meeting:
 
  
 
1.  Teaching Science Data Analytics Skills, and the Earth Science Data Scientist (http://commons.esipfed.org/node/7999)
 
1.  Teaching Science Data Analytics Skills, and the Earth Science Data Scientist (http://commons.esipfed.org/node/7999)
Line 67: Line 47:
  
 
Wade Bishop, School of Information Sciences, University of Tennessee
 
Wade Bishop, School of Information Sciences, University of Tennessee
Peter Fox
+
 
 +
Peter Fox, Earth & Environmental Sciences, Tetherless World Constellation, Rensselaer Polytechnic Institute
 +
 
 
Lewis McGibbney, Computer Science for Data Intensive Systems Group, Jet Propulsion Laboratory
 
Lewis McGibbney, Computer Science for Data Intensive Systems Group, Jet Propulsion Laboratory
 +
 
Karen Stocks, Director, Geological Data Center, Scripps Institution of Oceanography  
 
Karen Stocks, Director, Geological Data Center, Scripps Institution of Oceanography  
  
1.  ESDA activities related:  Use Case discussions; Surveying and documenting technologies; How to move forward on gap analysis; other items?
 
  
2.  Knowledge, skills needed to be an Earth Science Data Scientist (gotta be a better way to say this). Candidate speakers:  Peter Fox, Chris Mattmann.
+
2.  The Need for Earth Science Data Analytics to Facilitate Community Resilience (and other applications) (http://commons.esipfed.org/node/7998)
  
Commentary on #2: Peter gave a real good talk to us at a previous ESIP cluster session on Data Analytics.  And there are several AGU presentations and articles on being a Data Scientist.  Next, I think we should include experiences.  How about something like:  '''Practical Experiences in Data Science'''If this is good, I will contact Peter and Chris.  It would be good to get 2 more speakers.  Maybe Karen Stocks (see last AGU abstract submission below)
+
This cluster session will review our current work (for new participants), followed by discussion on the extent of social, economic, and environmental issues, as well as science research, in which the advancement of Earth science data analytics have had an impactThe goal of this discussion is to gain sufficient information to categorize how Earth science data analytics has come to be used in our society, and identify use cases that exemplify this
  
  
FYI
+
FYI:
  
 
As of this moment, the following Data Analytics related session abstract have been proposed to the AGU:
 
As of this moment, the following Data Analytics related session abstract have been proposed to the AGU:
Line 104: Line 86:
  
 
===Next Telecon:===
 
===Next Telecon:===
Thursday, May 14, 2015, 3:00 EST  
+
Thursday, June 18, 2015, 3:00 EST  
  
 
Agenda:   
 
Agenda:   
  
1.  Discuss Use Cases inserted in document
+
1.  Summer Cluster Planning - Discuss outline (to soon be provided)
  
2.  Potential ESIP ESDA workshop: Fish or cut bait
+
2.  Open Mic
 
 
3.  Summer Cluster Planning
 
 
 
4.  Open Mic
 
  
  
Line 122: Line 100:
  
 
All - Please e-mail Steve (Steven.J.Kempler@nasa.gov) with comments/suggestions on cluster topics by next telecon
 
All - Please e-mail Steve (Steven.J.Kempler@nasa.gov) with comments/suggestions on cluster topics by next telecon
 
All, Steve - Please respond: Does Cluster Session 2 above look reasonable (Steven.J.Kempler@nasa.gov)?  If so, Steve will contact potential presenters
 
  
 
All Other Participants - E-mail Steve (Steven.J.Kempler@nasa.gov) so you can be an Active Participant  Soon.  But don't need to be an Active Participant to review use cases...so please do  
 
All Other Participants - E-mail Steve (Steven.J.Kempler@nasa.gov) so you can be an Active Participant  Soon.  But don't need to be an Active Participant to review use cases...so please do  
  
 
BTW, my e-mail is: Steven.J.Kempler@nasa.gov
 
BTW, my e-mail is: Steven.J.Kempler@nasa.gov

Latest revision as of 07:18, June 1, 2015

ESDA Telecon notes – 5/21/15

Known Attendees:

ESIP Host (Erin Robinson), Steve Kempler, Robert Downs, Chung-lin Shie, Tiffany Mathews, Brand Niemann, Joan Aron, Soren(?), Suhung Shen

Agenda:

Agenda:

1. ESDA Use Cases

2. Summer Meeting Planning – 2 Sessions scheduled:

- Teaching Science Data Analytics Skills, and the Earth Science Data Scientist (http://commons.esipfed.org/node/7999)

- The Need for Earth Science Data Analytics to Facilitate Community Resilience (and other applications) (http://commons.esipfed.org/node/7998)

3. Open Mic


Presentations:

None, this time.

Use Case Information: https://docs.google.com/document/d/1U1mAt4ZjJqXeNmtRoE4VbI1nBgS1v7DzeHib_7mzOF8/edit

Notes:

Thank you all for attending. Making progress...

At this telecon, we continued our discussion on locating ESDA Use Cases. Brand provided several links from which we can glean use cases specific to Earth science. This led to a discuss regarding the various ways 'Data Analytics' is used. On one level it is used as an abstract term that encompasses situations that demonstrate concepts and thus, needed capabilities to examine/analyze datasets, and/or co-analyze heterogeneous datasets. On another level, the term data analytics is refers to the actual implementation of specific tools (logic) to solve a particular problem to provides results that yields a better understanding towards solving the problem. Whereas the former interpretation addresses more strategic questions like 'Return of Investment', and whip gets the most value for efforts expended on Big Data Initiatives, the latter interpretation, in terms of Earth science data analytics, would be one of greater interest to information tool developers, providers, and users (users=researchers). Other discussion inputs include: Need to find best ways to use data; Need ways to create datasets tools that work with all data; Need to communicate and share available tools.


Summer Cluster Planning


We will have two cluster session at the summer meeting:


1. Teaching Science Data Analytics Skills, and the Earth Science Data Scientist (http://commons.esipfed.org/node/7999)

We will have 4 speakers, who will provide their experiences in being, or needing, a Data Scientist in their work. The goal of this session is to discuss and extract real project data scientist/analytics experience needs, initiated by presentation and discussed by session participants. Of special interest is bringing together people who have needs for data scientists (data analytics) and will be able to articulate those needs by the end of the session, and/or; stir ideas: for the use of data analytics in their research or to build tools/services for others.

Presenters:

Wade Bishop, School of Information Sciences, University of Tennessee

Peter Fox, Earth & Environmental Sciences, Tetherless World Constellation, Rensselaer Polytechnic Institute

Lewis McGibbney, Computer Science for Data Intensive Systems Group, Jet Propulsion Laboratory

Karen Stocks, Director, Geological Data Center, Scripps Institution of Oceanography


2. The Need for Earth Science Data Analytics to Facilitate Community Resilience (and other applications) (http://commons.esipfed.org/node/7998)

This cluster session will review our current work (for new participants), followed by discussion on the extent of social, economic, and environmental issues, as well as science research, in which the advancement of Earth science data analytics have had an impact. The goal of this discussion is to gain sufficient information to categorize how Earth science data analytics has come to be used in our society, and identify use cases that exemplify this


FYI:

As of this moment, the following Data Analytics related session abstract have been proposed to the AGU:

  • Advanced Information Systems to Support Climate Projection Data Analysis

Gerald L Potter, Tsengdar J Lee, Dean Norman Williams, and Chris A Mattmann

  • Big Data Analytics for Scientific Data

Emily Law, Michael M Little, Daniel J Crichton, and Padma A Yanamandra-Fisher

  • Big Data in Earth Science – From Hype to Reality

Kwo-Sen Kuo, Rahul Ramachandran, Ben James Kingston Evans. and Mike M Little

  • Big Data in the Geosciences: New Analytics Methods and Parallel Algorithms

Jitendra Kumar and Forrest M Hoffman

  • Geophysical Science Data Analytics Use Case Scenarios

Steven J Kempler, Robert R Downs, Tiffany Joi Mathews, and John S Hughes

  • Partnerships and Big Data Facilities in a Big Data World

Kenneth S Casey and Danie Kinkade

  • Towards a Career in Data Science: Pathways and Perspectives

Karen I Stocks, Lesley A Wyborn, Ruth Duerr, and Lynn Yarmey

Next Telecon:

Thursday, June 18, 2015, 3:00 EST

Agenda:

1. Summer Cluster Planning - Discuss outline (to soon be provided)

2. Open Mic


Actions:

Active Participants - Review Use Case document and provide feedback. Insert 3 use cases each. Due: By next telecon: May 14.

All - Please e-mail Steve (Steven.J.Kempler@nasa.gov) with comments/suggestions on cluster topics by next telecon

All Other Participants - E-mail Steve (Steven.J.Kempler@nasa.gov) so you can be an Active Participant Soon. But don't need to be an Active Participant to review use cases...so please do

BTW, my e-mail is: Steven.J.Kempler@nasa.gov