Difference between revisions of "Earth Science Data Analytics/2014-04-17 Telecon"

From Earth Science Information Partners (ESIP)
Line 3: Line 3:
 
===Known Attendees:===
 
===Known Attendees:===
  
 +
 +
Will be provided soon
  
  
 
===Agenda:===
 
===Agenda:===
  
10 minutes – Steve
+
1 Present new Cluster Information Sharing Webasites -Steve
  
 
Introduction to the Earth Science Data Analytics Discussion Forum - http://wiki.esipfed.org/index.php/Earth_Science_Data_Analytics/Discussion_Forum
 
Introduction to the Earth Science Data Analytics Discussion Forum - http://wiki.esipfed.org/index.php/Earth_Science_Data_Analytics/Discussion_Forum
Line 13: Line 15:
  
  
10 minutes – Joan Aron – To Present:
+
2 – Joan Aron – To Present: Data Analytics Needs Scenario
 
 
Data Analytics Needs Scenario
 
 
 
  
10 minutes – Rudy Husar – To present:
 
  
User-Oriented Data Analytics and Tools in DataFed
+
3 – Rudy Husar – To present:  User-Oriented Data Analytics and Tools in DataFed
  
  
20 minutes – Tiffany Matthews – To lead discussion:
+
4 – Tiffany Matthews – To lead discussion:
  
 
'enabling users to leverage data to observe more phenomena than what can be identified when studying an average'.
 
'enabling users to leverage data to observe more phenomena than what can be identified when studying an average'.
Line 33: Line 31:
 
Presentations:
 
Presentations:
 
* [[Media: Aron-Data Analytics Needs Scenario.pptx|  Joan Aron: Data Analytics Needs Scenario - 4/17/14]]
 
* [[Media: Aron-Data Analytics Needs Scenario.pptx|  Joan Aron: Data Analytics Needs Scenario - 4/17/14]]
 +
* [[Media: Rudy140417_ESIP_DataAnalytics2.pptx |  Tiffany Mathews: User-Oriented Data Analytics and Tools using the Federated Data System� DataFed� - 4/17/14]]
 
* [[Media: ASDC Analytics Discussion.pdf|  Tiffany Mathews: Atmospheric Science Data Center Sample Analytics Use Cases - 4/17/14]]
 
* [[Media: ASDC Analytics Discussion.pdf|  Tiffany Mathews: Atmospheric Science Data Center Sample Analytics Use Cases - 4/17/14]]
  
Line 39: Line 38:
 
From March 20th Telecom:
 
From March 20th Telecom:
  
Good News:  We had 3 excellent speakers to discuss: Data Science/Data Analytics (Brand Niemann); An application of Data Analytics on the MERRA (data assimilation) dataset (John), and; Data Analytics Master Program Approach/Overview at DePaul University (Bamshad)
+
Today, from Joan, Rudy, and Tiffany, we received three excellent, insightful presentations regarding the need of data analytics from a user perspective, and a data discovery perspective, as well as useful tools that can help the data user.
 
 
Bad News: The Telecom Convener did not plan enough time for what the presentations deservingly used, thus we did not get to Agenda Item 3. (More on item 3 later)
 
 
 
The cluster started with some thoughts for Cluster objectives and direction based on February’s telecom ideas (see notes from February’s telecom).  Basically, It seems that this Cluster can serve multiple purposes to address the various levels of members understanding and interests regarding Data Analytics.  This includes:
 
 
 
-  ‘Academic’ discussions that allow all of us to be better educated and on the same page in understanding the various aspects of Data Analytics
 
 
 
-  Bringing in guest speakers to describe overviews of external efforts and further teach us about the broader use of Data Analytics.  (We can always invite speakers back to learn more)
 
 
 
-  Activities that ESIP members can actually address and tackle
 
 
 
As a start, this will lay groundwork for our understanding, as the field evolves, and the individual and collective interests of this cluster evolve, in turn, the cluster objectives can evolve.
 
This will be put out as the basis of the ESDA cluster mission/objectives.  Please take a look at tit at the top of our Wiki ‘ESDA Home Page’. 
 
Please provide comments on what you think of it, does it address your expectations, and/or what else we should include.
 
 
 
 
 
Take a look at Brand’s presentation.  It provides a real breadth of information regarding Data Science, Data Analytics, what Data Scientists do, current activities in the field, more.  Remember: ‘…try to make a story out of the data’.
 
 
 
John’s presentation was equally interesting, describing how he applies analytics (MapReduce) to the MERRA datasets.
 
 
 
Not to be outdone, Bamshad gave a great overview of DePaul University’s Data Analytics program, the types of course taught, a little philosophy behind the program, and the domain areas on which the program focus.
 
 
 
BTW, here is my new favorite predictive analytics figure describing the CRISP-DM process found in both, Brand and Bamshad’s presentations.  Only I would substitute ‘Business Understanding’ with ‘Domain Expertise’, to make it more generic.
 
  
 +
Figure (I believe) Rudy was alluding to:
  
 
[[Image:predanal.png|500px]]
 
[[Image:predanal.png|500px]]

Revision as of 08:26, April 18, 2014

SDA Telecom notes – 4/17/14

Known Attendees:

Will be provided soon


Agenda:

1 – Present new Cluster Information Sharing Webasites -Steve

Introduction to the Earth Science Data Analytics Discussion Forum - http://wiki.esipfed.org/index.php/Earth_Science_Data_Analytics/Discussion_Forum Introduction to the Use Case Collection webpage - http://wiki.esipfed.org/index.php/Use_Case_Collection


2 – Joan Aron – To Present: Data Analytics Needs Scenario


3 – Rudy Husar – To present: User-Oriented Data Analytics and Tools in DataFed


4 – Tiffany Matthews – To lead discussion:

'enabling users to leverage data to observe more phenomena than what can be identified when studying an average'.

Tiffany will initiate discussion with her presentation entitled: " Atmospheric Science Data Center Sample Data Analytics Use Cases."


Presentations:

Notes:

From March 20th Telecom:

Today, from Joan, Rudy, and Tiffany, we received three excellent, insightful presentations regarding the need of data analytics from a user perspective, and a data discovery perspective, as well as useful tools that can help the data user.

Figure (I believe) Rudy was alluding to:

Predanal.png



Time ran out to discuss the third agenda item. This will be discussed at the next telecom (April 17), and provided here for your contemplation: ESDA Activity - Compile use cases (include producer/supplier and data user analytics utilization) - Need 2 to 4 owners - Compile analytics tools (internal and external to ESIP) – Need 2 to 4 owners (preferably different) - Do gap analysis – Need to 2 to 4 owners (different or some from above groups)

And Potential Future Activities (as of today) - Examine project long case studies to determine successfulness of using data analytics in the project (i.e., lessons learned) - Oh yeah: Create a Cluster Mission Statement and Objectives - Report out to the Federation All


Next Telecon:

  • May 15, 3:00 EST (third Thursday of each month)
  • Agenda (as of now)

- Analytics related topic to better understand. DOES ANYBODY HAVE A TOPIC THEY WISH TO BETTER UNDERSTAND

- Listen and Learn - We will have 2 guest speakers to discuss their Analytics activities

- ESDA Activities