Indexing and Selecting Data

Indexing and Selecting Data#

Learning Objectives#

Understanding the difference between position and label-based indexing
Select data by position using .isel with values or slices
Select data by label using .sel with values or slices
Use nearest-neighbor lookups with .sel
Select timeseries data by date/time with values or slices

Introduction#

Xarray offers extremely flexible indexing routines that combine the best features of NumPy and Pandas for data selection.

The most basic way to access elements of a DataArray object is to use Python’s [] syntax, such as array[i, j], where i and j are both integers.

As xarray objects can store coordinates corresponding to each dimension of an array, label-based indexing is also possible (e.g. .sel(latitude=0), similar to pandas.DataFrame.loc). In label-based indexing, the element position i is automatically looked-up from the coordinate values.

By leveraging the labeled dimensions and coordinates provided by Xarray, users can effortlessly access, subset, and manipulate data along multiple axes, enabling complex operations such as slicing, masking, and aggregating data based on specific criteria.

This indexing and selection capability of Xarray not only enhances data exploration and analysis workflows but also promotes reproducibility and efficiency by providing a convenient interface for working with multi-dimensional data structures.

Quick Overview#

In total, xarray supports four different kinds of indexing, as described below and summarized in this table:

Dimension lookup	Index lookup	`DataArray` syntax	`Dataset` syntax
Positional	By integer	`da[:,0]`	not available
Positional	By label	`da.loc[:,'IA']`	not available
By name	By integer	`da.isel(space=0)` or `da[dict(space=0)]`	`ds.isel(space=0)` or `ds[dict(space=0)]`
By name	By label	`da.sel(space='IA')` or `da.loc[dict(space='IA')]`	`ds.sel(space='IA')` or `ds.loc[dict(space='IA')]`

In this tutorial, first we cover the positional indexing and label-based indexing, next we will cover more advanced techniques such as nearest neighbor lookups.

First, let’s import packages:

import xarray as xr

xr.set_options(display_expand_attrs=False, display_expand_data=False);

Here we’ll use air temperature tutorial dataset from the National Center for Environmental Prediction.

da = ds["air"]

Exercises#

Practice the syntax you’ve learned so far:

Exercise

Select the first 30 entries of latitude and 30th to 40th entries of longitude:

Solution

ds.isel(lat=slice(None, 30), lon=slice(30, 40))

Exercise

Select all data at 75 degree north and between Jan 1, 2013 and Oct 15, 2013

Solution

ds.sel(lat=75, time=slice("2013-01-01", "2013-10-15"))

Exercise

Remove all entries at 260 and 270 degrees

Solution

ds.drop_sel(lon=[260, 270])

Summary#

In total, Xarray supports four different kinds of indexing, as described below and summarized in this table:

Dimension lookup	Index lookup	`DataArray` syntax	`Dataset` syntax
Positional	By integer	`da[:,0]`	not available
Positional	By label	`da.loc[:,'IA']`	not available
By name	By integer	`da.isel(space=0)` or `da[dict(space=0)]`	`ds.isel(space=0)` or `ds[dict(space=0)]`
By name	By label	`da.sel(space='IA')` or `da.loc[dict(space='IA')]`	`ds.sel(space='IA')` or `ds.loc[dict(space='IA')]`

For enhanced indexing capabilities across all methods, you can utilize DataArray objects as an indexer. For more detailed information, please see the Advanced Indexing notebook.

More Resources#

Xarray Docs - Indexing and Selecting Data

Indexing and Selecting Data

Contents

Indexing and Selecting Data#

Learning Objectives#

Introduction#

Quick Overview#

Position-based Indexing#

NumPy Positional Indexing#

Positional Indexing with Xarray#

NumPy style indexing with Xarray#

Positional Indexing Using Dimension Names#

Label-based Indexing#

Dropping using `drop_sel`#

Nearest Neighbor Lookups#

Datetime Indexing#

Selecting data based on single datetime#

Selecting data for a range of dates#

Indexing with a DatetimeIndex or date string list#

Fancy indexing based on year, month, day, or other datetime components#

Exercises#

Summary#

More Resources#

Indexing and Selecting Data

Contents

Indexing and Selecting Data#

Learning Objectives#

Introduction#

Quick Overview#

Position-based Indexing#

NumPy Positional Indexing#

Positional Indexing with Xarray#

NumPy style indexing with Xarray#

Positional Indexing Using Dimension Names#

Label-based Indexing#

Dropping using drop_sel#

Nearest Neighbor Lookups#

Datetime Indexing#

Selecting data based on single datetime#

Selecting data for a range of dates#

Indexing with a DatetimeIndex or date string list#

Fancy indexing based on year, month, day, or other datetime components#

Exercises#

Summary#

More Resources#

Dropping using `drop_sel`#