Presenters-0425: Difference between revisions
From CSDMS
m Add youtube views template if missing |
m Text replacement - "\|CSDMS meeting youtube views=\{\{(Youtube_[^}]+)\}\}" to "|CSDMS meeting youtube views={{#explode:{{$1}}| |0}} |CSDMS meeting youtube AverageViews={{#explode:{{$1}}| |1}}" |
||
Line 14: | Line 14: | ||
|CSDMS meeting abstract presentation=Earth scientists face serious challenges when working with large datasets. Pangeo is a rapidly growing community initiative and open source software ecosystem for scalable geoscience using Python. Three of Pangeo’s core packages are 1) Jupyter, a web-based tool for interactive computing, 2) Xarray, a data-model and toolkit for working with N-dimensional labeled arrays, and 3) Dask, a flexible parallel computing library. When combined with distributed computing, these tools can help geoscientists perform interactive analysis on datasets up to petabytes in size. In this interactive tutorial we will demonstrate how to employ this platform using real science examples from hydrology, remote sensing, and oceanography. Participants will follow along using Jupyter notebooks to interact with Xarray and Dask running in Google Cloud Platform. | |CSDMS meeting abstract presentation=Earth scientists face serious challenges when working with large datasets. Pangeo is a rapidly growing community initiative and open source software ecosystem for scalable geoscience using Python. Three of Pangeo’s core packages are 1) Jupyter, a web-based tool for interactive computing, 2) Xarray, a data-model and toolkit for working with N-dimensional labeled arrays, and 3) Dask, a flexible parallel computing library. When combined with distributed computing, these tools can help geoscientists perform interactive analysis on datasets up to petabytes in size. In this interactive tutorial we will demonstrate how to employ this platform using real science examples from hydrology, remote sensing, and oceanography. Participants will follow along using Jupyter notebooks to interact with Xarray and Dask running in Google Cloud Platform. | ||
|CSDMS meeting youtube code=0 | |CSDMS meeting youtube code=0 | ||
|CSDMS meeting youtube views={{Youtube_0}} | |CSDMS meeting youtube views={{#explode:{{Youtube_0}}| |0}} | ||
|CSDMS meeting youtube AverageViews={{#explode:{{Youtube_0}}| |1}} | |||
|CSDMS meeting participants=0 | |CSDMS meeting participants=0 | ||
}} | }} |
Latest revision as of 16:34, 11 June 2025
CSDMS3.0 - Bridging Boundaries
Pangeo: Scalable Geoscience Tools in Python — Xarray, Dask, and Jupyter
Abstract
Earth scientists face serious challenges when working with large datasets. Pangeo is a rapidly growing community initiative and open source software ecosystem for scalable geoscience using Python. Three of Pangeo’s core packages are 1) Jupyter, a web-based tool for interactive computing, 2) Xarray, a data-model and toolkit for working with N-dimensional labeled arrays, and 3) Dask, a flexible parallel computing library. When combined with distributed computing, these tools can help geoscientists perform interactive analysis on datasets up to petabytes in size. In this interactive tutorial we will demonstrate how to employ this platform using real science examples from hydrology, remote sensing, and oceanography. Participants will follow along using Jupyter notebooks to interact with Xarray and Dask running in Google Cloud Platform.
Please acknowledge the original contributors when you are using this material. If there are any copyright issues, please let us know (CSDMSweb@colorado.edu) and we will respond as soon as possible.
Of interest for: