API docs¶
Core data model¶
|
Key for keeping track of chunks of a distributed Dataset. |
Reading and writing data¶
|
Returns a lazily indexable xarray.Dataset and chunks from a Zarr store. |
|
Split one or more xarray.Datasets into keyed chunks. |
|
Write keyed chunks to a Zarr store in parallel. |
|
Write an entire xarray.Dataset to a Zarr store. |
|
Make a lazy Dask xarray.Dataset for use only as a template. |
|
Replaces dimension(s) in a template with updates coordinates and/or sizes. |
|
Setup a Zarr store. |
|
Check a chunk for consistency against the given template. |
|
Write a single Dataset chunk to Zarr. |
Aggregation¶
|
Calculate the mean over one or more distributed dataset dimensions. |
|
Calculate global mean over a pcollection of xarray.Dataset objects. |
|
Calculate per-key mean over a pcollection of (hashable, Dataset). |
|
CombineFn for computing an arithmetic mean of xarray.Dataset objects. |
Rechunking¶
|
Consolidate existing chunks across offsets into bigger chunks. |
|
Consolidate existing chunks across variables into bigger chunks. |
|
Split existing chunks into smaller chunks. |
Split existing chunks into a separate chunk per data variable. |
|
|
Rechunk to an arbitrary new chunking scheme with bounded memory usage. |
Utility transforms¶
|
Check that keys and dataset(s) are valid for xarray-beam transforms. |
Utility functions¶
|
Convert offsets into slices with an optional base offset. |
|
Verify that a key and dataset(s) are valid for xarray-beam transforms. |
|
Consolidate chunks across offsets into (Key, Dataset) pairs. |
|
Consolidate chunks across distinct variables into (Key, Dataset) pairs. |
|
Consolidate chunks via merge/concat into a single (Key, Dataset) pair. |
|
Split a single (Key, xarray.Dataset) pair into many chunks. |
|
Split a single (Key, xarray.Dataset) pair into separate variables. |
|
Rechunk in-memory pairs of (Key, xarray.Dataset). |
High-level interface¶
Warning
The high-level interface is experimental and offers no backwards compatibility guarantees.
|
Experimental high-level representation of an Xarray-Beam dataset. |
|
Create an xarray_beam.Dataset from a Zarr store. |
|
Write this dataset to a Zarr file. |
|
Create an xarray_beam.Dataset from an xarray.Dataset. |
|
Create an xarray_beam.Dataset from a Beam PTransform. |
Collect a dataset in memory by writing it to a temp file. |
|
|
Map a function over the chunks of this dataset. |
|
Rechunk this Dataset. |
|
Split variables in this Dataset into separate chunks. |
|
Consolidate variables in this Dataset into a single chunk. |
|
Compute the mean of this Dataset using Beam combiners. |
|
Return a Dataset with the first N elements of each dimension. |
|
Apply a function to this dataset with method-chaining syntax. |
|
Normalize chunks for a xarray.Dataset. |