This tutorial introduces Xarray which is a Python library that provides (1) data structures for multi-dimensional labeled arrays, (2) a toolkit for scalable data analysis on large, complex datasets using Dask which extends the SciPy ecosystem (e.g. NumPy, Pandas, Scikit-Learn) to larger-than-memory or distributed environments.
Attendees should be comfortable with basic Python programming (e.g., data structures, functions, etc.). Some prior exposure to Python data science libraries (e.g., NumPy, Pandas) is helpful. No specific domain knowledge is required to effectively participate in this tutorial.