Computational Patterns

Computational Patterns#

Often when writing code we repeat certain patterns, whether we realize it or not. If you have learned to write list comprehensions, you are taking advantage of a “control pattern”. Often, these patterns are so common that many packages have built in functions to implement them.

Quoting the toolz documentation:

The Toolz library contains dozens of patterns like map and groupby. Learning a core set (maybe a dozen) covers the vast majority of common programming tasks often done by hand. A rich vocabulary of core control functions conveys the following benefits:

You identify new patterns

You make fewer errors in rote coding

You can depend on well tested and benchmarked implementations

The same is true for xarray.

Motivation / Learning goals#

Learn what high-level computational patterns are available in Xarray.
Learn that these patterns replace common uses of the for loop.
Identify when you are re-implementing an existing computational pattern.
Implement that pattern using built-in Xarray functionality.
Understand the difference between map and reduce.

Xarray provides patterns in both “index space” and “label space”#

Index space#

These are sequential windowed operations with a window of a fixed size.

rolling : Operate on rolling or sliding (fixed length, overlapping) windows of your data e.g. running mean.
coarsen : Operate on blocks (fixed length) of your data (downsample).

Label space#

These are windowed operations with irregular windows based on your data. Members of a single group may be non-sequential and scattered through the dataset.

groupby : Parse data into groups (using an exact value) and operate on each one (reduce data).
groupby_bins: GroupBy after discretizing a numeric (non-exact, e.g. float) variable.
resample : Groupby specialized for time axes. Either downsample or upsample your data.

add some “loop” versions to show what a user might come up with that could be turned into one of these pattern operations

Summary#

Xarray provides methods for high-level analysis patterns:

rolling : Operate on rolling or sliding (fixed length, overlapping) windows of your data e.g. running mean.
coarsen : Operate on blocks (fixed length) of your data (downsample).
groupby : Parse data into groups (using an exact value) and operate on each one (reduce data).
groupby_bins: GroupBy after discretizing a numeric (non-exact, e.g. float) variable.
resample : [Groupby specialized for time axes. Either downsample or upsample your data.]
weighted: Weight your data before reducing.

Xarray also provides a consistent interface to make using those patterns easy:

Iterate over the operators (rolling, coarsen, groupby, groupby_bins, resample).
Apply functions that accept numpy-like arrays with reduce.
Reshape to a new xarray object with .construct (rolling, coarsen only).
Apply functions that accept xarray objects with map (groupby, groupby_bins, resample only).

Computational Patterns

Contents

Computational Patterns#

Motivation / Learning goals#

Xarray’s high-level patterns#

Load example dataset#

Identifying high-level computation patterns#

Concept refresher: “index space” vs “label space”#

Xarray provides patterns in both “index space” and “label space”#

Index space#

Label space#

Index space: windows of fixed width#

Sliding windows of fixed length: `rolling`#

Apply an existing numpy-only function with `reduce`#

View the `rolling` operation as a Xarray object with `construct`#

Advanced: Another `construct` example#

Block windows of fixed length: `coarsen`#

Coarsen supports `reduce` for custom reductions#

Coarsen supports `construct` for block reshaping and storing outputs#

Summary#

Label space “windows” or bins : GroupBy#

Deconstructing GroupBy#

Constructing group labels#

“Datetime components” for creating groups#

Construct and use custom labels#

Custom seasons with `numpy.isin`.#

`floor`, `ceil` and `round` on time#

`strftime` is another powerful option#

Custom reductions with `GroupBy.reduce`#

Viewing the GroupBy operation on your DataArray or DataSet#

Instead looping over groupby objects is possible#

In most cases, avoid a for loop using `map`#

Summary#

Computational Patterns

Contents

Computational Patterns#

Motivation / Learning goals#

Xarray’s high-level patterns#

Load example dataset#

Identifying high-level computation patterns#

Concept refresher: “index space” vs “label space”#

Xarray provides patterns in both “index space” and “label space”#

Index space#

Label space#

Index space: windows of fixed width#

Sliding windows of fixed length: rolling#

Apply an existing numpy-only function with reduce#

View the rolling operation as a Xarray object with construct#

Advanced: Another construct example#

Block windows of fixed length: coarsen#

Coarsen supports reduce for custom reductions#

Coarsen supports construct for block reshaping and storing outputs#

Summary#

Label space “windows” or bins : GroupBy#

Deconstructing GroupBy#

Constructing group labels#

“Datetime components” for creating groups#

Construct and use custom labels#

Custom seasons with numpy.isin.#

floor, ceil and round on time#

strftime is another powerful option#

Custom reductions with GroupBy.reduce#

Viewing the GroupBy operation on your DataArray or DataSet#

Instead looping over groupby objects is possible#

In most cases, avoid a for loop using map#

Summary#

Sliding windows of fixed length: `rolling`#

Apply an existing numpy-only function with `reduce`#

View the `rolling` operation as a Xarray object with `construct`#

Advanced: Another `construct` example#

Block windows of fixed length: `coarsen`#

Coarsen supports `reduce` for custom reductions#

Coarsen supports `construct` for block reshaping and storing outputs#

Custom seasons with `numpy.isin`.#

`floor`, `ceil` and `round` on time#

`strftime` is another powerful option#

Custom reductions with `GroupBy.reduce`#

In most cases, avoid a for loop using `map`#