Difference between revisions of "Interoperability and Technology/Tech Dive Webinar Series"

From Earth Science Information Partners (ESIP)
(37 intermediate revisions by 2 users not shown)
Line 1: Line 1:
 
'''Bold text'''= Tech Dive Webinars =
 
'''Bold text'''= Tech Dive Webinars =
  
== 10 Jan 2019: "Cloud Native Geoprocessing of Earth Observation Satellite Data with Pangeo": Scott Henderson (University of Washington) ==
+
== 14 November 2019: "TBD":  ==
  
'''Time''': Thursday, 10 Jan, 2019 (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific)  
+
'''Time''': Thursday, 14, November, 2019 (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific)
 +
https://www.gotomeeting.com/join/533510693
 +
 
 +
== 10 October 2019: "ELFIE: Environmental Linked Features Interoperability Experiment":  ==
 +
 
 +
'''Summary'''
 +
This webinar will provide an update on activities of the OGC Environmental Linked Features Interoperability Experiment (ELFIE) and a preview of activities of the Second ELFIE (SELFIE). This will be an interactive demonstration-based session focused on practical application of JSON-LD to link features and observations.
 +
 
 +
Presenter: [https://www.usgs.gov/staff-profiles/david-l-blodgett David Blodgett, Civil Engineer, U.S. Geological Survey Water Mission Area]
 +
 
 +
[https://opengeospatial.github.io/ELFIE/ More info on ELFIE here.]
 +
 
 +
'''Recording'''
 +
 
 +
<html><iframe width="560" height="315" src="https://www.youtube.com/embed/lvy9vdv96-s" frameborder="0" allowfullscreen></iframe></html>
 +
 
 +
== 12 September 2019: "STARE: SpatioTemporal Adaptive Resolution Encoding for Scalable Integrative Analysis": Michael Rilee ==
 +
 
 +
'''Summary'''
 +
Title: STARE: SpatioTemporal Adaptive Resolution Encoding for Scalable Integrative Analysis
 +
 
 +
Abstract: The Earth Science enterprise has generated a great volume and variety of high-quality, high-spatiotemporal resolution data (e.g. level 1 and 2 swath data), that could be integrated to open wide arenas of new, improved scientific advances. Yet aligning and integrating different kinds of Earth Science data is a laborious process, leading most researchers to focus on more generic, high level data products that are more easily compared. Dealing with the great volume and variety is the goal of the NASA/ACCESS-17 STARE project. STARE is a unifying indexing scheme addressing variety and is well suited for applying distributed storage and computing resources to address volume. We will describe the STARE approach and how it relates to other geolocation schemes, e.g. DGGS or simulation grids.
 +
 
 +
Presenter: Michael Rilee, Ph.D., Rilee Systems Technologies LLC.
 +
 
 +
Bio: Michael Rilee, PI of the STARE project, has been involved with high-end computing and modeling and simulation for about 25 years. For about the past 5 years or so he has been researching advanced computing techniques for a range of Earth Science applications including parallel computing, array databases, regridding, kriging, and atmospheric chemistry. Before that he was involved with several NASA efforts involving spacecraft autonomy, on-board data processing, and high-end computing. He was initially trained in plasma astrophysics, receiving his Ph.D. from Cornell University.
 +
 
 +
[https://esip.figshare.com/articles/STARE_Spatiotemporal_Adaptive_Resolution_Encoding_for_Integrative_Compatibility_Across_Variety_of_Volume/8942291 See poster about STARE here.]
 +
[http://odl.unl.edu/index.php/map_interface_user_guide/ See demo here.]
 +
[https://www.researchgate.net/profile/Kwo-Sen_Kuo See related research here.]
 +
 
 +
'''Recording'''
 +
 
 +
<html><iframe width="560" height="315" src="https://www.youtube.com/embed/uodeCiYc6Mg" frameborder="0" allowfullscreen></iframe></html>
 +
 
 +
== 8 August 2019: "The Challenge of Location and How Discrete Global Grid Systems can enable Spatial Data Integration.": Matthew Purss ==
 +
 
 +
'''Summary'''
 +
Everything has a location. Location can be defined using descriptive terms (e.g. place names, suburb names, and rivers), position and geometry (e.g. geographic coordinates like latitude and longitude) and/or index notations (e.g. mesh block or parcel identifiers). A street address is useful for service delivery; land parcels for revenue raising and investment; coordinates and grids for positioning and monitoring changes in the landscape; various administrative boundaries for law enforcement and service management; and statistical (e.g. health, society or economy) information to analyse and improve life in Australian. However, existing approaches and disconnected infrastructures coupled with the myriad of ways to describe and store location information limit our ability to discover, access and integrate spatial data across organisation and jurisdiction boundaries to produce reliable and actionable information. The Location Index project (LOC-I) aims to introduce a consistent way to access, analyse and use location data to support the effective integration of socio-economic, statistical and environmental data from multiple data providers to support the spatially enabled delivery of Government policies and initiatives.
 +
 +
The flexible, and standards based, spatial data infrastructures being developed by Geoscience Australia and its partners under LOC-I include Internet of Things (IoT), Linked Data and Discrete Global Grid Systems (DGGS) technologies integrated with cloud-based data discovery and access tools. This will enable LOC-I to democratise spatial data applications, where people will be able to do spatial data integration operations without needing GIS specialists. This presentation will provide an overview of LOC-I, DGGS and how DGGS technologies can be used as a tool to spatially enable and integrate socio-economic and geospatial data together.
 +
 
 +
Dr. Matthew Brian John Purss is a Senior Advisor on Geospatial Standards at Geoscience Australia. He is a founding co-chair of the Open Geospatial Consortium's (OGC) Discrete Global Grid Systems (DGGS) Domain and Standards Working Groups. He is a geophysicist with over 20 years' experience in the exploration, research and government sectors and holds a PhD in Exploration Geophysics from Monash University where he studied grid based approaches to magnetic and electromagnetic modelling applications. Prior to initiating and leading the international standardisation of DGGS infrastructures.
 +
 
 +
http://locationindex.org
 +
 
 +
[[http://wiki.esipfed.org/images/c/c4/ESIP_DGGS_Webinar_presentation_2019_08_08.pptx|Download the slides here.]]
 +
 
 +
'''Recording'''
 +
 
 +
<html><iframe width="560" height="315" src="https://www.youtube.com/embed/Gbe6-N_Zisw" frameborder="0" allowfullscreen></iframe></html>
 +
 
 +
== 11 July 2019: "ADIwg Open Source Metadata Toolkit": Josh Bradley, Dennis Walworth ==
 +
 
 +
'''Summary'''
 +
Josh Bradley is a data manager for Fish & Wildlife Service, Science Data Applications and project manager for the ADIwg mdTools project.
 +
Dennis Walworth is the data manager for the USGS Alaska Science Center and collaborator on the ADIwg mdTools project.
 +
 
 +
* [https://docs.google.com/presentation/d/e/2PACX-1vTAXbGlKL8QZ4G8pSHDOoeOLrfH3xWHNmspLeaMAocJI2zl0fKn5g-x7PAirekLAi3vImIL6752T0k7/pub?start=false&loop=false&delayms=3000&slide=id.p1 See slides here.]   
 +
* mdEditor website: https://www.mdeditor.org/   
 +
* Development version: https://dev.mdeditor.org/   
 +
* mdEditor User Manual: https://adiwg.gitbooks.io/mdeditor/content/   
 +
* GitHub Issues: https://github.com/adiwg/mdEditor/issues 
 +
 
 +
'''Recording'''
 +
 
 +
<html><iframe width="560" height="315" src="https://www.youtube.com/embed/ClRmCU2Ae5c" frameborder="0" allowfullscreen></iframe></html>
 +
 
 +
== 6 June 2019: "Google Dataset Search: Facilitating data discovery in an open ecosystem.": Chris Gorgolewski ==
 +
 
 +
'''Summary'''
 +
 
 +
There are thousands of data repositories on the Web, providing access to millions of datasets. National and regional governments, scientific publishers and consortia, commercial data providers, and others publish data for fields ranging from social science to life science to high-energy physics to climate science and more. Access to this data is critical to facilitating reproducibility of research results, enabling scientists to build on others’ work, and providing data journalists easier access to information and its provenance. In this talk, I will discuss recently launched Google Dataset Search, which provides search capabilities over potentially all dataset repositories on the Web. I will talk about the open ecosystem for describing and citing datasets that we hope to encourage and the technical details on how we went about building Dataset Search. Finally, I will highlight research challenges in building a vibrant, heterogeneous, and open ecosystem where data becomes a first-class citizen.
 +
 
 +
https://toolbox.google.com/datasetsearch
 +
https://www.blog.google/products/search/making-it-easier-discover-datasets/
 +
 
 +
'''Recording'''
 +
 
 +
<html><iframe width="560" height="315" src="https://www.youtube.com/embed/1PSE2hQ7mBo?start=3" frameborder="0" allowfullscreen></iframe></html>
 +
 
 +
== 9 May 2019: "The SpatioTemporal Asset Catalog (STAC) specification": Chris Holmes ==
 +
 
 +
'''Summary'''
 +
 
 +
The SpatioTemporal Asset Catalog (STAC) specification is an emerging standard to make it easier to find geospatial information. It aims to enable a cloud-native geospatial future by providing a common layer of metadata for search and discovery, while playing well with the web and existing geospatial standards.
 +
 
 +
See the the [http://stacspec.org/ SpatioTemporal Asset Catalog (STAC) specification] web site here.
 +
 
 +
'''Time''': Thursday, 9, May, 2019 (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific)
 +
 
 +
'''Recording'''
 +
 
 +
<html><iframe width="560" height="315" src="https://www.youtube.com/embed/emXgkNutUTo" frameborder="0" allowfullscreen></iframe></html>
 +
 
 +
 
 +
 
 +
== 11 Apr 2019: "Pachyderm": John Karabaic ==
 +
 
 +
'''Summary'''
 +
 
 +
See the Pachyderm one pager here: https://cdn2.hubspot.net/hubfs/4751021/Business_One_Pager.pdf
 +
 
 +
"Pachyderm lets you deploy and manage multi-stage, language-agnostic data pipelines while maintaining complete reproducibility and provenance."
 +
 
 +
'''Time''': Thursday, 11, April, 2019 (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific)  
  
 
'''Join meeting''':  
 
'''Join meeting''':  
Line 9: Line 114:
 
* regular phone: United States: +1 (408) 650-3123, Access Code: 533-510-693
 
* regular phone: United States: +1 (408) 650-3123, Access Code: 533-510-693
  
'''Speaker''': Scott Henderson is a postdoctoral fellow at the University of Washington.
+
'''Speaker''': Michael Masters (Pachyderm)
 +
See Docs Here:
 +
http://docs.pachyderm.io/en/latest/getting_started/local_installation.html 
  
 +
Also see intro talk here: https://changelog.com/practicalai/23
 +
 +
<html><iframe width="560" height="315" src="https://www.youtube.com/embed/za0JNgL4JGA" frameborder="0" allow="accelerometer; autoplay; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe></html>
 +
 +
== 14 Mar 2019: "Integrating SciServer and OceanSpy to enable easy access to oceanographic model output": Mattia Almansi (Johns Hopkins University)  ==
 +
 +
'''Time''': Thursday, 14, March, 2019 (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific)
 +
 +
'''Summary''':  OceanSpy is an open-source and user-friendly Python package that enables scientists and interested amateurs to use ocean model data sets with out-of-the-box analysis tools.  OceanSpy builds on software packages developed by the Pangeo community (in particular xarray, dask, and xgcm). I will show how OceanSpy can be used on model outputs stored on the Johns Hopkins SciServer system, negating the need for the user to own a computing cluster or even download the data. OceanSpy accelerates and facilitates exploration (including visualization) of terascale data. It is designed to operate on petascale ocean simulations hosted by SciServer in the coming months.
 +
 +
'''Join meeting''':
 +
* computer, tablet or smartphone: https://www.gotomeeting.com/join/533510693
 +
* regular phone: United States: +1 (408) 650-3123, Access Code: 533-510-693
 +
 +
'''Speaker''': Mattia is a PhD student at Johns Hopkins University under Prof. Tom Haine.  He's a physical oceanographer, primarily interested in the dynamics governing the circulation of the Subpolar North Atlantic. To address the targets of his research he uses high-resolution numerical simulations and available observations.
 +
 +
=== Links ===
 +
* [https://github.com/malmans2/oceanspy OceanSpy Github Page]
 +
* [http://www.sciserver.org/ SciServer web page]
 +
<html><iframe width="560" height="315" src="https://www.youtube.com/embed/GO_hNIcFw8o" frameborder="0" allowfullscreen></iframe></html>
 +
 +
== 14 Feb 2019: "Cloud Native Geoprocessing of Earth Observation Satellite Data with Pangeo": Scott Henderson (University of Washington)  ==
 +
 +
'''Summary''': NASA has estimated that by 2025, it will be storing upwards of 250 Petabytes (PB) of its data using commercial Cloud services (e.g. Amazon Web Services [AWS]). This presentation will focus on efforts funded by a NASA ACCESS 2017 grant to transition the Earth Science community into Cloud computing by developing technologies that build on top of the growing Pangeo ecosystem. In particular, the integration of JupyterHub with Kubernetes and several high-level Python packages (i.e. Xarray, Dask, Rasterio, Intake, PyViz), are enabling Cloud-native workflows that circumvent the bottleneck of downloading large amounts of data. These tools work best with emerging Cloud-native storage solutions for satellite imagery (i.e. NASA’s CMR, STAC, COGs). In this presentation, Scott gives update on the Pangeo project and showcases a few example workflows using large public archives of optical and radar satellite data.
 +
 +
'''Speaker''': Scott Henderson has a PhD in Geological Sciences from Cornell University and is currently a postdoctoral fellow at the University of Washington eScience Institute. Scott studies geologic hazards with satellite-based synthetic aperture radar.
 +
 +
=== Links ===
 +
* [https://docs.google.com/presentation/d/1evNXCddIllXUt4a5jKfmlO2197sp8T6I9L650cYZcsk/edit?usp=sharing Google Slides]
 +
* [https://github.com/scottyhq/esip-tech-dive Landsat NDVI (data on AWS, compute on Google)]
 +
* [https://github.com/pangeo-data/pangeo-tutorial-agu-2018 AGU 2018 Tutorial material (various examples)]
 +
* [https://github.com/scottyhq/grfn_pangeo_demo Getting ready for NISAR data]
 +
* [https://github.com/scottyhq/stac-intake-landsat STAC catalogs, Intake, mosaics]
 +
* [https://medium.com/pangeo/cloud-native-geoprocessing-of-earth-observation-satellite-data-with-pangeo-997692d91ca2 blog post for context]
 +
=== Recording ===
 +
<html><iframe width="560" height="315" src="https://www.youtube.com/embed/yQon6Wh-lN4" frameborder="0" allowfullscreen></iframe></html>
 +
 +
== [CANCELLED DUE TO US GOV SHUTDOWN] 10 Jan 2019 ==
  
 
== 13 Dec 2018: "Developing JupyterLab Extensions": Ian Rose (Berkeley)  ==
 
== 13 Dec 2018: "Developing JupyterLab Extensions": Ian Rose (Berkeley)  ==
Line 28: Line 173:
 
* https://jupyterlab.readthedocs.io/en/stable/user/extensions.html
 
* https://jupyterlab.readthedocs.io/en/stable/user/extensions.html
 
* https://jupyterlab.readthedocs.io/en/stable/developer/extension_dev.html
 
* https://jupyterlab.readthedocs.io/en/stable/developer/extension_dev.html
 +
 +
=== Recording ===
 +
<html><iframe width="560" height="315" src="https://www.youtube.com/embed/cUEutudPmAo" frameborder="0" allowfullscreen></iframe></html>
 +
  
 
== 8 November 2018: "Intake: Lightweight tools for loading and sharing data in data science projects": Martin Durant (Anaconda) ==
 
== 8 November 2018: "Intake: Lightweight tools for loading and sharing data in data science projects": Martin Durant (Anaconda) ==
Line 687: Line 836:
  
 
[[Media:2015-10-13_ESIP_RantRave_RajPandya.pdf | PDF]]
 
[[Media:2015-10-13_ESIP_RantRave_RajPandya.pdf | PDF]]
 
=== WebEx Recordings ===
 
 
[https://esipfed.webex.com/esipfed/ldr.php?RCID=221cb6674dbff96604009d20e182c637 Streaming] | [https://esipfed.webex.com/esipfed/lsr.php?RCID=0ade6c77a86111ca979174ef312aca30 Download] (The talk starts at about 12:15 into the recording.)
 
  
 
== 13 Aug 2015: Rich Signell on Catalog-driven Workflows for Science ==
 
== 13 Aug 2015: Rich Signell on Catalog-driven Workflows for Science ==
Line 716: Line 861:
  
 
[https://speakerdeck.com/rsignell/catalog-driven-reproducible-workflows-for-ocean-science Speaker Deck] | [[Media:2015-08-13 ESIP RantRave.pdf | PDF]]
 
[https://speakerdeck.com/rsignell/catalog-driven-reproducible-workflows-for-ocean-science Speaker Deck] | [[Media:2015-08-13 ESIP RantRave.pdf | PDF]]
 
=== WebEx Recordings ===
 
 
[https://esipfed.webex.com/esipfed/ldr.php?RCID=c3ece329915705e77f48c6da0ecc2204 Streaming] | [https://esipfed.webex.com/esipfed/lsr.php?RCID=ffcf193fd739d057d72217cdd2ff8f3e Download] (The talk starts at about 12:30 into the recording.)
 
  
 
== 11 June 2015: [http://www.nationaldataservice.org/projects/labs.html NDS Labs], Matt Turk ==
 
== 11 June 2015: [http://www.nationaldataservice.org/projects/labs.html NDS Labs], Matt Turk ==
Line 727: Line 868:
 
=== Slides ===
 
=== Slides ===
 
[[Media:2015-06-11_ESIP_RantRave_NDSLabs.pdf | PDF]]
 
[[Media:2015-06-11_ESIP_RantRave_NDSLabs.pdf | PDF]]
 
=== WebEx Recordings ===
 
 
[https://esipfed.webex.com/esipfed/ldr.php?RCID=dd5f5320aed8be082110abbf107b27db Streaming] | [https://esipfed.webex.com/esipfed/lsr.php?RCID=dc5fba6bcc82a51951041cfcea8e9a98 Download] (The talk starts at about 21:00 into the recording.)
 

Revision as of 07:29, October 16, 2019

Bold text= Tech Dive Webinars =

14 November 2019: "TBD":

Time: Thursday, 14, November, 2019 (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific) https://www.gotomeeting.com/join/533510693

10 October 2019: "ELFIE: Environmental Linked Features Interoperability Experiment":

Summary This webinar will provide an update on activities of the OGC Environmental Linked Features Interoperability Experiment (ELFIE) and a preview of activities of the Second ELFIE (SELFIE). This will be an interactive demonstration-based session focused on practical application of JSON-LD to link features and observations.

Presenter: David Blodgett, Civil Engineer, U.S. Geological Survey Water Mission Area

More info on ELFIE here.

Recording

12 September 2019: "STARE: SpatioTemporal Adaptive Resolution Encoding for Scalable Integrative Analysis": Michael Rilee

Summary Title: STARE: SpatioTemporal Adaptive Resolution Encoding for Scalable Integrative Analysis

Abstract: The Earth Science enterprise has generated a great volume and variety of high-quality, high-spatiotemporal resolution data (e.g. level 1 and 2 swath data), that could be integrated to open wide arenas of new, improved scientific advances. Yet aligning and integrating different kinds of Earth Science data is a laborious process, leading most researchers to focus on more generic, high level data products that are more easily compared. Dealing with the great volume and variety is the goal of the NASA/ACCESS-17 STARE project. STARE is a unifying indexing scheme addressing variety and is well suited for applying distributed storage and computing resources to address volume. We will describe the STARE approach and how it relates to other geolocation schemes, e.g. DGGS or simulation grids.

Presenter: Michael Rilee, Ph.D., Rilee Systems Technologies LLC.

Bio: Michael Rilee, PI of the STARE project, has been involved with high-end computing and modeling and simulation for about 25 years. For about the past 5 years or so he has been researching advanced computing techniques for a range of Earth Science applications including parallel computing, array databases, regridding, kriging, and atmospheric chemistry. Before that he was involved with several NASA efforts involving spacecraft autonomy, on-board data processing, and high-end computing. He was initially trained in plasma astrophysics, receiving his Ph.D. from Cornell University.

See poster about STARE here. See demo here. See related research here.

Recording

8 August 2019: "The Challenge of Location and How Discrete Global Grid Systems can enable Spatial Data Integration.": Matthew Purss

Summary Everything has a location. Location can be defined using descriptive terms (e.g. place names, suburb names, and rivers), position and geometry (e.g. geographic coordinates like latitude and longitude) and/or index notations (e.g. mesh block or parcel identifiers). A street address is useful for service delivery; land parcels for revenue raising and investment; coordinates and grids for positioning and monitoring changes in the landscape; various administrative boundaries for law enforcement and service management; and statistical (e.g. health, society or economy) information to analyse and improve life in Australian. However, existing approaches and disconnected infrastructures coupled with the myriad of ways to describe and store location information limit our ability to discover, access and integrate spatial data across organisation and jurisdiction boundaries to produce reliable and actionable information. The Location Index project (LOC-I) aims to introduce a consistent way to access, analyse and use location data to support the effective integration of socio-economic, statistical and environmental data from multiple data providers to support the spatially enabled delivery of Government policies and initiatives.

The flexible, and standards based, spatial data infrastructures being developed by Geoscience Australia and its partners under LOC-I include Internet of Things (IoT), Linked Data and Discrete Global Grid Systems (DGGS) technologies integrated with cloud-based data discovery and access tools. This will enable LOC-I to democratise spatial data applications, where people will be able to do spatial data integration operations without needing GIS specialists. This presentation will provide an overview of LOC-I, DGGS and how DGGS technologies can be used as a tool to spatially enable and integrate socio-economic and geospatial data together.

Dr. Matthew Brian John Purss is a Senior Advisor on Geospatial Standards at Geoscience Australia. He is a founding co-chair of the Open Geospatial Consortium's (OGC) Discrete Global Grid Systems (DGGS) Domain and Standards Working Groups. He is a geophysicist with over 20 years' experience in the exploration, research and government sectors and holds a PhD in Exploration Geophysics from Monash University where he studied grid based approaches to magnetic and electromagnetic modelling applications. Prior to initiating and leading the international standardisation of DGGS infrastructures.

http://locationindex.org

[the slides here.]

Recording

11 July 2019: "ADIwg Open Source Metadata Toolkit": Josh Bradley, Dennis Walworth

Summary Josh Bradley is a data manager for Fish & Wildlife Service, Science Data Applications and project manager for the ADIwg mdTools project. Dennis Walworth is the data manager for the USGS Alaska Science Center and collaborator on the ADIwg mdTools project.

Recording

6 June 2019: "Google Dataset Search: Facilitating data discovery in an open ecosystem.": Chris Gorgolewski

Summary

There are thousands of data repositories on the Web, providing access to millions of datasets. National and regional governments, scientific publishers and consortia, commercial data providers, and others publish data for fields ranging from social science to life science to high-energy physics to climate science and more. Access to this data is critical to facilitating reproducibility of research results, enabling scientists to build on others’ work, and providing data journalists easier access to information and its provenance. In this talk, I will discuss recently launched Google Dataset Search, which provides search capabilities over potentially all dataset repositories on the Web. I will talk about the open ecosystem for describing and citing datasets that we hope to encourage and the technical details on how we went about building Dataset Search. Finally, I will highlight research challenges in building a vibrant, heterogeneous, and open ecosystem where data becomes a first-class citizen.

https://toolbox.google.com/datasetsearch https://www.blog.google/products/search/making-it-easier-discover-datasets/

Recording

9 May 2019: "The SpatioTemporal Asset Catalog (STAC) specification": Chris Holmes

Summary

The SpatioTemporal Asset Catalog (STAC) specification is an emerging standard to make it easier to find geospatial information. It aims to enable a cloud-native geospatial future by providing a common layer of metadata for search and discovery, while playing well with the web and existing geospatial standards.

See the the SpatioTemporal Asset Catalog (STAC) specification web site here.

Time: Thursday, 9, May, 2019 (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific)

Recording


11 Apr 2019: "Pachyderm": John Karabaic

Summary

See the Pachyderm one pager here: https://cdn2.hubspot.net/hubfs/4751021/Business_One_Pager.pdf

"Pachyderm lets you deploy and manage multi-stage, language-agnostic data pipelines while maintaining complete reproducibility and provenance."

Time: Thursday, 11, April, 2019 (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific)

Join meeting:

Speaker: Michael Masters (Pachyderm) See Docs Here: http://docs.pachyderm.io/en/latest/getting_started/local_installation.html

Also see intro talk here: https://changelog.com/practicalai/23

14 Mar 2019: "Integrating SciServer and OceanSpy to enable easy access to oceanographic model output": Mattia Almansi (Johns Hopkins University)

Time: Thursday, 14, March, 2019 (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific)

Summary: OceanSpy is an open-source and user-friendly Python package that enables scientists and interested amateurs to use ocean model data sets with out-of-the-box analysis tools. OceanSpy builds on software packages developed by the Pangeo community (in particular xarray, dask, and xgcm). I will show how OceanSpy can be used on model outputs stored on the Johns Hopkins SciServer system, negating the need for the user to own a computing cluster or even download the data. OceanSpy accelerates and facilitates exploration (including visualization) of terascale data. It is designed to operate on petascale ocean simulations hosted by SciServer in the coming months.

Join meeting:

Speaker: Mattia is a PhD student at Johns Hopkins University under Prof. Tom Haine. He's a physical oceanographer, primarily interested in the dynamics governing the circulation of the Subpolar North Atlantic. To address the targets of his research he uses high-resolution numerical simulations and available observations.

Links

14 Feb 2019: "Cloud Native Geoprocessing of Earth Observation Satellite Data with Pangeo": Scott Henderson (University of Washington)

Summary: NASA has estimated that by 2025, it will be storing upwards of 250 Petabytes (PB) of its data using commercial Cloud services (e.g. Amazon Web Services [AWS]). This presentation will focus on efforts funded by a NASA ACCESS 2017 grant to transition the Earth Science community into Cloud computing by developing technologies that build on top of the growing Pangeo ecosystem. In particular, the integration of JupyterHub with Kubernetes and several high-level Python packages (i.e. Xarray, Dask, Rasterio, Intake, PyViz), are enabling Cloud-native workflows that circumvent the bottleneck of downloading large amounts of data. These tools work best with emerging Cloud-native storage solutions for satellite imagery (i.e. NASA’s CMR, STAC, COGs). In this presentation, Scott gives update on the Pangeo project and showcases a few example workflows using large public archives of optical and radar satellite data.

Speaker: Scott Henderson has a PhD in Geological Sciences from Cornell University and is currently a postdoctoral fellow at the University of Washington eScience Institute. Scott studies geologic hazards with satellite-based synthetic aperture radar.

Links

Recording

[CANCELLED DUE TO US GOV SHUTDOWN] 10 Jan 2019

13 Dec 2018: "Developing JupyterLab Extensions": Ian Rose (Berkeley)

Summary: JupyterLab extensions can customize or enhance any part of JupyterLab. They can provide new themes, file viewers and editors, or renderers for rich outputs in notebooks. Extensions can add items to the menu or command palette, keyboard shortcuts, or settings in the settings system. Extensions can provide an API for other extensions to use and can depend on other extensions. In fact, the whole of JupyterLab itself is simply a collection of extensions that are no more powerful or privileged than any custom extension. In this talk Ian will demonstrate how to build a JuptyerLab extension.

Time: Thursday, 13 Dec, 2018, (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific)

Join meeting:

Speaker: Ian Rose is a postdoctoral fellow at the Berkeley Institute for Data Science, where he is a core developer on Project Jupyter, working on JupyterLab. He has a PhD in geophysics from Berkeley.

Links:

Recording


8 November 2018: "Intake: Lightweight tools for loading and sharing data in data science projects": Martin Durant (Anaconda)

Summary: Intake is a set of free open-source Python tools that help load data from a variety of formats into familiar containers like Pandas dataframes, Xarray datasets, and more. Boilerplate data loading code can be transformed into reusable Intake plugins. Datasets can be described for easy reuse and sharing using Intake catalog files. Martin will give an overview of Intake and demonstrate use via Jupyter Notebooks.

Time: Thursday, 8 November, 2018, (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific)

Join meeting:

Speaker: Martin Durant is a data scientist and software developer at Anaconda who specializes in Python data engineering tools, instruction and solutions. He also has a PhD in Astrophysics.

Links:

Recording


11 October 2018: "SpatioTemporal Feature Registry: ESIP idea campaign and working example in USGS": Sky Bristol (USGS)

Summary: Building a National Biogeographic Map, an analysis platform for exploring biodiversity conservation measures and stressors, sparked the need for a reliable data source with a wide variety of identified places. These need to be assembled in a sustainable and robust way that keeps track of provenance and processing steps so we can build reports for decisionmakers that are trustworthy and consistent. We started an idea campaign in the ESIP Lab with some questions we have about how best to do this work. This talk will share what we have in place so far, including a registry of sources, a data processing pipeline, an integrated index, a REST API, and a working web application. Technologies include USGS ScienceBase, PostgreSQL/PostGIS, ElasticSearch, Python Flask, and other Python processing codes using GDAL and other libraries.

Time: Thursday, 11 October, 2018, (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific)

Join meeting: No live meeting, as something came up at the last minute and Sky can't make it. He did record a 30 minute presentation, however. See "Recording" below. See also the sample Jupyter notebooks in the "Links" section below.

Speaker: Sky Bristol is the branch chief for Biogeographic Characterization in the USGS Core Science Analytics, Synthesis, and Library Program. Unless he can be skiing bumps, he likes doing cool things to help people make better decisions using data.

Links: https://github.com/skybristol/notebooks/blob/master/SFR%20Exploration.ipynb


Recording

9 August 2018: "EarthSim: Flexible Environmental Simulation Workflows Entirely Within Jupyter Notebooks": Dharhas Pothina (ERDC)

Summary: Building environmental simulation workflows is typically a slow process involving multiple proprietary desktop tools that do not interoperate well. In this work, we demonstrate building flexible, lightweight workflows entirely in Jupyter notebooks. The goal is to provide a set of tools that can easily be reconfigured and repurposed as needed to rapidly solve specific emerging issues. As part of this work, extensive improvements were made to several general-purpose open source packages, including support for annotating and editing plots and maps in Bokeh and HoloViews, rendering large triangular meshes and regridding large raster data in HoloViews, GeoViews, and Datashader, and widget libraries for Param.

Time: Thursday, August 9, 2018, (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific)

Join meeting:

Speaker: Dharhas Pothina is the Associate Technical Director of the Information Technology Laboratory, US Army Engineer Research and Development Center, in Vicksburg, MS. He was formerly the Water Informatics Lead at the Texas Water Development Board. He holds a Ph.D in Civil Engineering from the University of Texas at Austin.

Links: https://pyviz.github.io/EarthSim/

Recording


14 June 2018: "Analysis of Massive Underwater Video Data in the Cloud using Pangeo": Tim Crone (Lamont)

Summary: An open-source environment for parallel analysis of massive (100TB) image data in the Cloud is now available via the Pangeo environment, which allows you to apply the power of the Python ecosystem from your browser. Technologies include JupyterHub, Kubernetes, Docker, and Dask distributed.

Time: Thursday, June 14, 2018, (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific)

Join meeting:

Speaker(s): Tim Crone is a marine geophysicist at Lamont-Doherty Earth Observatory studying spatial variations in the tidal triggering of microearthquakes within ridge systems, and problems in acoustics associated with high-temperature hydrothermal vents and seafloor seismic networks. He has recently deployed the Pangeo framework on the Microsoft Azure Cloud.

Links:

Recording



10 May 2018: "NetCDF-CF Advances - Simple Geometries, Swaths, and Groups": Dave Blodgett (USGS), Tim Whiteaker (UT Austin), Aleksander Jelanek (HDF Group) and Daniel Lee (EUMETSAT)

Summary: Simple geometry (points, lines, and polygons) has now been accepted as part of the Open Geospatial Consortium’s NetCDF-CF specification. This a major enhancement to a widely used standard whose utility has previously been limited to time-series of point or (raster) coverage data only. Advances on Groups and Swaths will also be presented.

Time: Thursday, May 10, 2018, (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific)

Join meeting:

Speaker(s): Dave Blodgett is a project coordinator with the USGS Office of Water, Geointelligence Branch. He holds a B.S. in civil and environmental engineering and an M.S. in water resources engineering form the UW-Madison. Tim Whiteaker is a research scientist at the University of Texas at Austin. Tim develops innovative cyberinfrastructure for solving water resource engineering challenges. Aleksander Jelanek is a Senior Informatics Architect with the HDF Group, and Daniel Lee is a Software Engineer at EUMETSAT.

Links:

Recording


12 April, 2018: "Jetstream: A free national science and engineering cloud environment on XSEDE": Jeremy Fischer, Indiana University

Summary: Jetstream, a national science and engineering cloud, adds cloud-based, on-demand computing and data analysis resources to the national XSEDE cyberinfrastructure. A description of Jetstream current and planned capabilities, and how to gain access, will be presented.

Time: Thursday, April 12, 2018, (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific)

Join meeting:

Speaker(s): Jeremy Fischer is a Senior Technical Advisor at Indiana University. He works primarily on the Jetstream project, as the technical evangelist getting researchers and educators on the system. In this role, he is the jack of all trades doing unix sys admin work, cloud image maintenance, support, training, documentation, and anything else that needs to happen.

Links:

Recording



8 March, 2018: "Zarr: A simple, open, scalable solution for big NetCDF/HDF data on the Cloud": Alistair Miles, University of Oxford.

Summary: The motivation, current status and future plans for Zarr will be discussed, along with a demo of basic functionality.

Time: Thursday, March 8, 2018, (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific)

Join meeting:

Speaker(s): Alistair Miles is the Head of Epidemiological Informatics for the Kwiatkowski group at the University of Oxford. Before joining the University of Oxford, Alistair was a research scientist at the e-Science Centre at the UK Science and Technology Facilities Council, where he was involved in a range of computing research projects, primarily in the areas of Web and semantic technology, and also in the engineering of production software systems. He is the lead developer for Zarr.

Links:

Recording


8 February, 2018: "The National Data Service Labs Workbench": Craig Willis, NCSA

Summary: The National Data Service Labs Workbench is a platform designed to share, discover, evaluate, develop, and test research data management and analysis tools. Community members can recommend or contribute tools as well as drive the direction, and the Workbench is evolving into a platform for data access, education and training.

Time: Thursday, February 8, 2018, (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific)

Join meeting:

Speaker(s): Craig Willis is the Technical Coordinator for the National Data Service and a senior research programmer at the National Center for Supercomputer Applications (NCSA) at the University of Illinois.

Links:

Recording


11 January 2018: "The Pangeo Project": Ryan Abernathy (Lamont) and Matthew Rocklin (Anaconda)

Summary: Pangeo is a scalable, low-barrier-for-entry science platform, with cloud-optimized storage for large multidimensional datasets, such as simulation (met, ocean, hydrologic, climate) model output. Technologies include JupyterHub, Kubernetes, Xarray, Dask, and Zarr. The Pangeo environment has been deployed on the NCAR Cheyenne supercomputer, on Google Cloud and on AWS.

Speaker(s): Ryan Abernathy is a physical oceanographer at Lamont/Columbia, and Matthew Rocklin is a open-source developer for Anaconda.

Links:

Recording

Note: This talk was given live at the ESIP winter meeting in North Bethesday, MD, USA.

14 December 2017: "Mini-Hack-Session: Developing and extending Jupyter Widgets": Jason Grout, Bloomberg

Summary: Jupyter widgets (aka ipywidgets) enables building interactive GUIs for Python code using standard form controls (sliders, dropdowns, textboxes, etc.), as well providing a framework for building complex interactive controls such as interactive 2d graphs, 3d graphics, maps, and more. Jason will walk through the thought and technical processes involved with developing new widget capability.

Time: Thursday, Dec 14, 2017, (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific)

Join meeting:

Speaker(s): Jason Grout is scientific software developer at Bloomberg. He has been a member of the Project Jupyter team since it's inception in 2014 and a core developer for the Jupyter widgets project. He has a PhD in mathematics from Brigham Young University.

Recording


9 November 2017: "Jupyter Widgets": Jason Grout, Bloomberg

Summary: Jupyter widgets (aka ipywidgets) enables building interactive GUIs for Python code using standard form controls (sliders, dropdowns, textboxes, etc.), as well providing a framework for building complex interactive controls such as interactive 2d graphs, 3d graphics, maps, and more. The latest developments in Jupyter widgets will be discussed as well as plans for the future.

Time: Thursday, November 9, 2017, (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific)

Join meeting:

Speaker(s): Jason Grout is scientific software developer at Bloomberg. He has been a member of the Project Jupyter team since it's inception in 2014 and a core developer for the Jupyter widgets project. He has a PhD in mathematics from Brigham Young University.

Recording


12 October 2017: "Research Workspace: A web-based tool for data sharing, documentation, analysis, and publication": Rob Bochenek, Axiom Data Science

Summary: The Research Workspace (RW) is a web-based tool designed to support collaborative science and data management tasks throughout the data lifecycle. The RW provides a secure environment for organizing, sharing, documenting, and analyzing scientific datasets, and for publishing datasets through a DataONE member node. As a shared, cloud-based storage environment, the RW is designed for collaborative organization and management of project content. Multiple levels of access and read/write permissions provide transparent, controlled access and oversight to collaborators, funders, and project managers. A custom metadata editor exports standards-compliant metadata (ISO 19115-2 and 19110) and includes tools for easily adding keywords, taxonomic information, keywords, spatial boundaries, and contact information to metadata records. An integrated Jupyter notebooks environment allows R- and Python-based analysis scripts to be written in and run on the RW and to access data in the RW or any data set resource that is hosted within the Axiom Data Science cyber-infrastructure stack. These notebooks serve as transparent, reproducible, and easily-shareable computational analysis and processing tools. Finally, datasets in the RW that have undergone sufficient curation and documentation can be exported to the Research Workspace DataONE Member Node for long-term preservation and broader discoverability and accessibility.  More information about the RW and it's capabilities can be found in the help documents -

Time: Thursday, October 12, 2017, (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific)

Join meeting:

Speaker(s): Rob Bochenek is an information architect at Axiom Data Science. Rob has been developing data management and cyber infrastructure solutions for research programs and organizations for the past fifteen years. He is a graduate of the University of Michigan with a degrees in aerospace engineering and mathematics. Early in his career Rob spent five years at the Exxon Valdez Oil Spill Trustee Council leading the data management team in processing, documenting and organizing the informational products produced from the scientific research funded to understand and monitor the ecological effects of the oil spill. Based upon that experience, Rob founded Axiom in 2006 to develop more generalized and holistic solutions for data management. He specializes in scientific geospatial information management with applications to physical/biological modeling and decision support data warehouse knowledge systems.

Links:

Recording

14 September 2017: "JupyterHub and JupyterLab Developments": Brian Granger, Cal Poly

Summary: The latest developments in JupyterHub and JupyterLab will be discussed as well as the roadmap for the future.

Time: Thursday, September 14, 2017, (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific)

Join meeting:

Speaker(s): Brian Granger is an Associate Professor of Physics at Cal Poly State University in San Luis Obispo, CA. He has a background in theoretical atomic, molecular and optical physics, with a PhD from the University of Colorado. His current research interests include quantum computing, parallel and distributed computing and interactive computing environments for scientific and technical computing. He is a core developer of the Jupyter project and is an active contributor to a number of other open source projects focused on scientific computing in Python.


Links:

Recording

[BONUS] 31 August 2017: "ERDDAP 5 min Lightning Talks", Jenn Sevadjian, Jim Potemra, Conor Delaney, Kevin O'Brien, John Kerfoot, Stephanie Petillo, Charles Carleton, Eli Hunter

Summary: A series of 5 minute lightning talks on how people are using ERDDAP to solve environmental data problems.

Time: Thursday, August 31, 2017, (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific)

Join meeting:

Speaker(s): Jenn Sevadjian, Jim Potemra, Conor Delaney, Kevin O'Brien, John Kerfoot, Stephanie Petillo, Charles Carleton, Eli Hunter


Recording


10 August 2017: "ERDDAP: Easier access to scientific data": Bob Simons, NOAA

Summary: ERDDAP is a free, open source data server that gives you a simple, consistent way to download subsets of gridded and tabular scientific datasets in common file formats and make graphs and maps.

Time: Thursday, August 10, 2017, (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific)

Speaker(s): Bob Simons is an IT Specialist with NOAA's Environmental Research Division.

Links:

Recording

13 July 2017: "GeoServer Developments": Jody Garnett and Kevin Smith, Boundless

Summary: The latest developments in GeoServer will be discussed as well as plans for the future.

Time: Thursday, July 13, 2017, (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific)

Join meeting:

Speaker(s): Jody Garnett is the Community Lead and Kevin Smith is the GeoWebCache Lead at Boundless.

Links:

Recording

6 June 2017: "Installing JupyterHub in the Cloud using Kubernetes Helm": Yuvi Panda

Summary: Yuvi Panda will show how to deploy JupyterHub in the Cloud using Kubernetes Helm.

Time: Tuesday, June 6, 2017, (Time: 3PM Eastern, 2PM Central, 1PM Mountain, 12PM Pacific)

Join meeting:

Speaker(s): Yuvi Panda is a developer with 15 years of experience and 400+ followers on GitHub. He worked formerly with Wikimedia, and is currently working with the Data Science Education Program at UC Berkeley to make it easier for people who don't consider themselves programmers to write code. He has been very involved with creating the Helm Chart for JupyterHub.

Recording


Links:

11 May 2017: "TerriaJS: A Free, Open-Source Library for Building Web-based Geospatial Data Explorers": Kevin Ring, CSIRO/Data61, Australia

Summary: The library behind the Australian National Map. 3D and 2D geospatial visualization based on Cesium and Leaflet. Visualise WMS, WMTS, WFS, KML, GeoJSON, CSV, CZML, GPX, and many more spatial formats out of the box, or easily add your own. Present a dynamic catalog from your existing WMS, ArcGIS, CKAN, CSW, Socrata, WMTS or WFS server, curate your catalog by hand, or use any combination thereof. Explore time-varying WMS layers, watch vehicles move smoothly across the map, and observe your CSV data change over time.

Time: Thursday, May 11, 2017, (5:00pm ET | 4:00pm CT | 3:00pm MT | 2:00pm PT | 07:00am Sydney Time)

Join meeting:

Speaker(s):

Kevin Ring is a Principal Software Engineer at CSIRO's Data61, and is the lead developer for TerriaJS. Previously, he helped found the Cesium project while working at Analytical Graphics, Inc. (AGI) and developed its streaming terrain and imagery engine.

Links:

Recording

13 April 2017: "Processing Planetary-Scale Data in the Cloud": Drew Bollinger, Development Seed

Summary: Modern cloud-based infrastructure has had a huge effect on our ability to process, manipulate, and publish satellite imagery at scale. We'll discuss current methods of making imagery available across different platforms and how this is supported by the efforts of groups like AWS to publish open satellite data including MODIS, Landsat and more.

Time: Thursday, April 13, 2017, (3:00pm ET | 2:00pm CT | 1:00pm MT | 12:00am PT)

Join meeting:

Presenter:

Drew Bollinger is a data analyst and software developer, with experience running advanced statistical and spatial analysis on large and small data sets, as well as building visualizations for data storytelling.

Links:

Slides

http://drewbo.com/talks/esip-2017/#0

Recording

9 March 2017: "Introduction to Esri Story Maps": Christine White, Esri

Summary: Today, multi-media communication plays a pivotal role in how an audience experiences, understands, and shares your message. Story Maps bring a narrative to life by weaving maps, text, images, video, and other content into a creative and memorable story. Christine will share several examples of effective Story Maps and then walk through how you can create and configure your own.

Time: Thursday, March 9, 2017, (3:00pm ET | 2:00pm CT | 1:00pm MT | 12:00am PT)

Join meeting:

Speaker(s):

Christine is a Technical Advisor and science team member at Esri. She loves using art and technology to communicate about the challenges and opportunities for our future. Christine also serves as the Vice President of ESIP. One of her favorite things about ESIP is how its members offer their unique perspectives (stories) and shared knowledge to collaborate.

GoToMeeting Recording

Slides

Christine gave her presentation as a live StoryMap, available here:

https://www.arcgis.com/apps/MapSeries/index.html?appid=5a99a82a19c84dbab641a22ddd3d329b

9 February 2017: "Web AppBuilder for ArcGIS": Derek Law, ESRI

Summary: Web AppBuilder for ArcGIS is a pure HTML5/JavaScript-based application that allows you to create your own intuitive, fast, and beautiful web apps without writing a single line of code. The app uses new ArcGIS platform features and modern browser technology to provide both flexible and powerful capabilities such as 3D visualization of data. In addition, developers have an opportunity to create custom tools and themes through the extensibility framework.

Time: Thursday, February 9, 2017, (3:00pm ET | 2:00pm CT | 1:00pm MT | 12:00am PT)

Join meeting:

Speaker(s):

Derek Law is an Product Manager at ESRI. He has over 15 years experience with geospatial software and web application development.

GoToMeeting Recording

Slides

https://speakerdeck.com/esipfed/esri-webapp-builder-derek-law-esri

19 January 2017: "Introduction to Google Earth Engine": Jess Walker, USGS

Summary: Google Earth Engine is a cloud-based geospatial processing platform that unites multiple petabytes of publicly accessible imagery and a massive computational infrastructure with a web-based integrated development environment (IDE). Users can harness the unprecedented combination of data and computing resources to conduct complex geospatial analyses on planetary scales.

Time: Thursday, January 19, 2017, (3:00pm ET | 2:00pm CT | 1:00pm MT | 12:00am PT)

Join meeting:

Speaker(s):

Jessica Walker is a postdoctoral researcher with the USGS Western Geographic Science Center in Tucson, AZ. Her research investigates the recovery of post-wildfire landscapes in Alaska and across the southwestern US using time series of remote sensing imagery.


GoToMeeting Recording

Slides

https://speakerdeck.com/esipfed/introduction-to-google-earth-engine-jessica-walker-usgs

8 December 2016: "Vector Tile Maps": Sam Matthews, Mapbox

Summary: Vector tiles make huge maps fast while offering full design flexibility. They are the vector data equivalent of image tiles for web mapping, applying the strengths of tiling – developed for caching, scaling and serving map imagery rapidly – to vector data. A general overview of vector tiles will be presented.

Speaker(s):

Sam Matthews is a Mapbox engineer focused on improving the speed and reliability of maps. He works with the Mapnik team to generate vector tiles and maintains the upload pipeline behind Mapbox Studio. He is passionate about making open source tools as welcoming as possible through clear docs and zero assumptions.

Time: Thursday, December 8, 2016, (3:00pm ET | 2:00pm CT | 1:00pm MT | 12:00am PT)

Join meeting:

GoToMeeting Recording

https://www.youtube.com/watch?v=wN2-ms2PwBs

Slides

https://speakerdeck.com/esipfed/vector-tile-maps-sam-matthews-mapbox

10 November 2016: "Introducing 3D Tiles": Todd Smith, AGI

Summary: 3D Tiles are an open specification for streaming massive heterogeneous 3D geospatial datasets. To expand on Cesium’s terrain and imagery streaming, 3D Tiles will be used to stream 3D content, including buildings, trees, point clouds, and vector data.

Speaker(s):

Todd Smith is the Cesium Product Manager, and helps define and manage the Cesium product line. Todd has been with the AGI team from the beginning and has been in the web mapping world for over 15 years. He is a Penn State GIS graduate.


Time: Thursday, November 10, 2016, (3:00pm ET | 2:00pm CT | 1:00pm MT | 12:00am PT)

Join meeting:

GoToMeeting Recording

https://www.youtube.com/watch?v=0upb4E12CPE

Slides

https://speakerdeck.com/esipfed

13 October 2016: "EarthCube Integration and Test Environment (ECITE)": Phil Yang, GMU

Summary: An outgrowth of activities of the EarthCube Technology Architecture Committee (TAC)'s Testbed Working Group (TWG), ECITE provides an integration test-bed for technology and science projects for both EarthCube funded projects and community technology demonstrations. ECITE consists of a seamless federated system of scalable and location independent distributed computational resources (nodes) across the US. The hybrid federated system provides a robust set of distributed resources utilizing including both public and private cloud capabilities.

Speaker(s): Chaowei Phil Yang is a Professor at George Mason University where he founded the NSF Spatiotemporal Innovation Center with colleagues from Harvard and UC-Santa Barbara. He advised over 30 graduate students and has placed over 20 geoinformatics professors around the world. His research interest are utilizing spatiotemporal principles to optimize computing infrastructure for geospatial science applications of national and international significance. (http://cpgis.gmu.edu/homepage/)

Time: Thursday, October 13, 2016, (3:00pm ET | 2:00pm CT | 1:00pm MT | 12:00am PT)

Join meeting:

GoToMeeting Recording

https://www.youtube.com/watch?v=kYi-22hXY6k

Slides

https://speakerdeck.com/esipfed


8 September 2016: "Apache Open Climate Workbench": Lewis McGibbney and Kyo Lee, NASA JPL/Apache OCW

Summary: Apache Open Climate Workbench (OCW) is an effort to develop software that performs climate model evaluation using model outputs from a variety of different sources the Earth System Grid Federation, the Coordinated Regional Climate Downscaling Experiment, the U.S. National Climate Assessment and the North American Regional Climate Change Assessment Program and temporal/spatial scales with remote sensing data from NASA, NOAA and other agencies. The toolkit includes capabilities for rebinning, metrics computation and visualization.

Speaker(s): Lewis McGibbney, NASA JPL/Apache OCW; currently a Data Scientist at the NASA Jet Propulsion Laboratory in Pasadena, California, Lewis works in the Computer Science and Data Intensive Applications Group (398M). He enjoys floating up and down the tide of technologies at the Apache Software Foundation having a real enthusiasm for Web Search and Information Retrieval in particular. You'll find him on community mailing lists including Nutch, Gora, Any23, OODT, Open Climate Workbench, Tika, Usergrid and a number of incubating mailing lists including CommonsRDF, HTrace and Joshua. Lewis is currently a Project Management Committee member and Committer on OCW.

Speaker(s): Huikyo Lee, NASA JPL/Apache OCW; currently a Climate Data Scientist at the NASA Jet Propulsion Laboratory in Pasadena, California, Huikyo has lead development of Regional Climate Model Evaluation System (http://rcmes.jpl.nasa.gov), an open-source software toolkit based on Open Climate Workbench to facilitate systematic evaluation of climate models using observational datasets from a variety of sources.

Time: Thursday, September 8, 2016, (3:00pm ET | 2:00pm CT | 1:00pm MT | 12:00am PT)

Join meeting:

GoToMeeting Recording

https://www.youtube.com/watch?v=YA8SZiG9JZk

Slides

https://speakerdeck.com/esipfed/apache-ocw

11 August 2016: "Community Data Analysis Tools (CDAT)": Charles Doutriaux, LLNL

Summary: CDAT is a rich set of visual-data exploration and analysis capabilities well-suited for earth science data analysis problems. It integrates many tools and technology to offer scientist a start-to-finish environment for their work. From reading in various data format, to publication-quality output of their analysis.

Speaker: Charles Doutriaux is a senior Lawrence Livermore National Laboratory research computer scientist, where he is known for his work in climate analytics, informatics, and management systems supporting model intercomparison projects. He works closely with many international climate scientists and shares in the recognition of the Intergovernmental Panel on Climate Change 2007 Nobel Peace Prize. He has co­-authored over 30 peer­-reviewed articles. He presented his work to many scientific conferences. Aside from everything Python-related, his research interests include climate attribution and detection, visualization, and data analysis. Doutriaux has a master's degree in "Climate and Physico-­Chemistry of the Atmosphere" from the University Joseph Fourier in Grenoble. He’s a member of the AGU and AMS. You can contact him at doutriaux1@llnl.gov.

Time: Thursday, August 11, 2016, (3:00pm ET | 2:00pm CT | 1:00pm MT | 12:00am PT)

Join meeting:

GoToMeeting Recording

https://www.youtube.com/watch?v=nh2dqAHt5jY

Slides

13 July 2016: "The NOAA OneStop Data Discovery and Access Framework Project": Ken Casey, NOAA/NCEI

Summary: The OneStop Project is designed to improve NOAA's data discovery and access framework. Focusing on all layers of the framework and not just the user interface, OneStop is addressing data format and metadata best practices, ensuring more data are available through modern web services, working to improve the relevance of dataset searches, and improving both collection-level metadata management and granule level metadata systems to accommodate the wide variety and vast scale of NOAA's data.

Speaker: Ken Casey is the Deputy Director of the Data Stewardship Division in the NOAA National Centers for Environmental Information (NCEI). He leads the OneStop project, is active within NOAA's Big Earth Data Initiative and Big Data Project. Ken serves on a variety of national and international science and data management panels including the US Group on Earth Observations Data Management Working Group and the Group for High Resolution Sea Surface Temperature (GHRSST) Science Team. He co-chairs the Committee on Earth Observing Satellites SST Virtual Constellation and represents NCEI in the Federation of Earth Science Information Partners (ESIP). He holds a PhD in Physical Oceanography from the University of Rhode Island.

Time: Wednesday, July 13, 2016, (3:00pm ET | 2:00pm CT | 1:00pm MT | 12:00am PT)

Join meeting:

GoToMeeting Recording

https://youtu.be/wp7trIRFDOs

Slides

https://speakerdeck.com/esipfed/noaa-one-stop-ken-casey-ncei

9 June 2016: "Dive into Docker": Kyle Wilcox, Dave Foster and Shane StClair: Axiom Data Science

Summary: Docker is an open platform for distributed applications that has taken the world by storm, making it easy to deploy services with complicated dependencies. In this presentation you will learn what Docker is, why it will make your life easier, how to build a container, and how to install containers.

Speaker: Kyle Wilcox, Dave Foster and Shane StClair are developers at Axiom Data Science. Axiom Data Science works with organizations to improve the long term management, reuse and impact of their scientific data resources. They have built Docker containers for many of the key services used by the U.S. Integrated Ocean Observing System (US-IOOS).

Time: June 9, 2016, (3:00pm ET | 2:00pm CT | 1:00pm MT | 12:00am PT)

Join meeting:

Links:

GoToMeeting Recording

https://youtu.be/mDR_x0E5az0

Slides

https://speakerdeck.com/esipfed/dive-into-docker-kyle-wilcox-shane-stclair-dave-foster-axiom-data-science

12 May 2016: "Leaflet Time Dimension": Biel Frontera, SOCIB

Summary: Leaflet.TimeDimension is a free, open-source Leaflet.js plugin that enables visualization of spatial data with a temporal dimension. It can manage different types of layers (WMS, GeoJSON, Overlay) and it can be easily extended. It meet some common needs, enabling web maps using observational and forecasting layers generated by a THREDDS server (via ncWMS), animating trajectories of drifters, gliders, follow a simulated oil spill, and other time dependent mapping applications.

Speaker: Biel Frontera was trained as a mathematician, and has spent most of his career developing software. He is a free software enthusiast and has worked for the last 3 years on data visualization and geospatial software issues for SOCIB, the Baleric Islands Coastal Observing and Forecasting System.

Time: May 12, 2016, (3:00pm ET | 2:00pm CT | 1:00pm MT | 12:00am PT)

Join meeting:

Links:

GoToMeeting Recording

https://www.youtube.com/watch?v=US5FUUPqlww

Slides

https://speakerdeck.com/esipfed/leatlet-time-dimension-biel-frontera-socib

21 Apr 2016: "The New Geoplatform.gov": Tod Dabolt, DOI

Summary: Geoplatform.gov was recently rebuilt from the ground up. Tod will talk about new features of the platform and plans for the future.

Speaker: Tod Dabolt is the acting Geographic Information Officer for the Department of Interior, and the technical lead on Geoplatform.gov.

Time: April 21, 2016, (2:00pm ET | 1:00pm CT | 12:00pm MT | 11:00am PT)

Join meeting:

Links:

GoToMeeting Recording

https://www.youtube.com/watch?v=f-ABUpy4Qvk

Slides

https://speakerdeck.com/esipfed/the-new-geoplatform-tod-dabolt-doi

13 Oct 2015: Raj Pandya on AGU's Thriving Earth Exchange and Sharing Solutions

The Thriving Earth Exchange is a network and platform that connects community leaders, sponsors, and scientists and helps them combine science and local knowledge to solve on-the-ground challenges related to natural hazards, natural resources, and climate change. I’ll talk about the general principles on which we are building TEX and describe the basic modules that are part of the TEX. Drawing on the lessons learned from our pilots, I'll talk about how we are developing modules and launching new projects with several partners. I’ll describe a range of projects – from a community monitoring effort in Denver to a Pamiri Mountain project to integrate climate projections into traditional calendars. I’ll introduce our nascent “share” module, and describe our partnership with Amazon Web Services to move prototype community-based solutions to the cloud to enhance their adaptability. And, just to live up to the name, I’ll frame it all around a small rant about the loading-dock model of science and a rave about more participatory approaches.

Slides

PDF

13 Aug 2015: Rich Signell on Catalog-driven Workflows for Science

"Catalog-driven, reproducible workflows for ocean science: Comparing sea level forecasts along the US Coastline"

Rich Signell

Filipe Fernandes

The USGS Integrated Ocean Observing System (US-IOOS) requires that data providers use standard web services (OPeNDAP+CF, OGC WMS, OGC SOS) for distributing model products and insitu observations. The services are captured in ISO metadata records and searchable via standard catalog services (OGC CSW).

This presentation will demonstrate how to use this system in a reproducible Jupyter Notebook, discovering, accessing and using model and observed water levels along the US Coastline, using a free python environment that can be installed on Mac, Windows and Linux in less than 10 minutes.

Slides

Speaker Deck | PDF

11 June 2015: NDS Labs, Matt Turk

Matt is a member of the NDS Labs technical advisory committee and will present NDS Labs as a platform for exploring data services -- enabling the separation of data and its representation, and how NDS Labs is functioning as an emerging platform for such separation.

Slides

PDF