Using PlotCollection objects

Using `PlotCollection` objects#

This tutorial covers handling PlotCollection; its main attributes and methods, and how to use it to modify the figure it contains. It does not cover how to create a PlotCollection. Consequently, this should not be the first time you are hearing about PlotCollection. If you are not, we recommend first going over either one of the following two pages:

Introduction to batteries-included plots which introduces the “batteries-included” plots. That is, functions that take data and generate a specific type of plot, using opinionated defaults. All these functions return a PlotCollection object.
Create your own figure with PlotCollection which focuses on PlotCollection creation and different strategies that might be followed to generate/fill the figure and all its plots.

Customizing your `PlotCollection`#

Modify specific visual elements#

If you pass keyword arguments to map, those arguments will be used in all the calls to the plotting function .map does. However, in some cases we might want more control. The next cell shows an example. We directly manipulate these properties to highlight only variables that correspond to the national team of Scotland.

Important

As we have already mentioned, the structure of the .viz attribute is backend agnostic, but its contents are backend dependent.

Consequently, the steps to select a specific visual given variable names and coordinates is always the same, but the result of that is an object from the chosen plotting backend. Thus, modifying the visual element is backend dependent and we consider that adding helper functions for such tasks is out of the scope of the library.

You can interact with the .viz attribute as you’d interact with any xarray.DataTree. It is also possible to use the get_viz helper method to simplify these calls a bit. See the differences below:

pc = plot_dist(idata, var_names=["home", "intercept", "atts", "defs"])
atts_scotland_kde = pc.viz["kde"]["atts"].sel(team="Scotland").item()
# atts_scotland_kde is now the Line2D object that
# corresponds to the kde line of the coordinate Scotland of variable atts
atts_scotland_kde.set(linewidth=3, color="lime")
pc.get_viz("kde", "defs", team="Scotland").set(linewidth=3, color="lime");

../_images/1bd88869215e6e94d3a402402cac374e69198317a55744a9b042a2d52cd462c6.png

You are not limited to only manipulating visual element properties. In the next cell, we show how to manipulate plot properties; in this case to add a grid to only the intercept plot.

pc = plot_dist(idata, var_names=["home", "intercept", "atts", "defs"])
pc.get_viz("plot", "intercept").grid(True)

../_images/ecbb9398f198b11455ffcd9cfef5d5e0e02732e4c17a5ca2ec5a453ad4c680dd.png

Let’s also see an example of a similar task but using Bokeh as backend:

from bokeh.plotting import output_notebook
output_notebook()

Loading BokehJS ...

pc = plot_dist(
    idata,
    var_names=["home", "atts", "defs"],
    backend="bokeh",
    # make plot smaller
    figure_kwargs={"figsize": (1300, 600), "figsize_units": "dots"},
)
pe_glyph = pc.get_viz("point_estimate", "atts", team="Italy").glyph
pe_glyph

Scatter(

id = 'p3833', …)

We can inspect and modify any of the stored elements by their labels. We have saved the Bokeh object that corresponds to the point estimate dot in the atts[team=Italy] plot. We can now change some of its properties before rendering the figure:

pe_glyph.fill_color = "red"
pe_glyph.size = 20
pc.show()

In some cases, it is more convenient to select elements based on their positions in the plot grid, rather than by variable names or coordinates. The row_index and col_index groups are provided for this purpose.

Note

Selection with row and column is a bit more convoluted that it might need to be, but this also serves to illustrate an important issue. Some operations on the DataTree/Dataset/DataArray objects will trigger copies, which don’t play well with the majority of plotting backend objects.

Here for example, attempting to use .where(condition, drop=True) which would make things more direct will trigger a copy and because of that the plotting backend will raise an error. We are forced to convert the .where operation to an indexing one.

pc = plot_dist(
    idata,
    var_names=["home", "atts", "defs"],
    backend="bokeh",
    # make plot smaller
    figure_kwargs={"figsize": (1300, 600), "figsize_units": "dots"},
)

import numpy as np
condition = (pc.get_viz("row_index", "defs") == 2) & (pc.get_viz("col_index", "defs") == 1)
cond_sel = {"team" : condition.coords["team"][condition]}
kde_glyph = pc.get_viz("kde", "defs", cond_sel).glyph
kde_glyph.line_color = "lime"
kde_glyph.line_width = 4
pc.show()

Add new visual elements to a `PlotCollection`#

Instead of modifying existing visual elements, we might instead want to add more elements to the plots. If we want to add something to a specific plot, the procedure is basically the same as above with the only difference of calling a plotting function instead of modifying properties of the existing elements.

For example, let’s plot a vertical reference line to the defs of the France national team:

pc = plot_dist(idata, var_names=["home", "atts", "defs"])
ax = pc.get_viz("plot", "defs", team="France")
ax.axvline(0, color="red");

../_images/9afb757ef92cf113d61ab046a7bed6570705401a89e8693ccfcae4e06f5eefc6.png

If we instead want to apply it to all plotting functions, we can use map:

# to be able to use map, callables must accept 2 positional arguments
# a DataArray and the plotting target
def axvline(da, target, **kwargs):
    return target.axvline(0, **kwargs)

pc = plot_dist(idata, var_names=["home", "atts", "defs"])
pc.map(axvline, color="red")

../_images/97d6dd442b0a743ec627d4a70735b16d55064d3ed4272fed9d5e58645401c258.png

Legends#

PlotCollection also provides a method to automatically generate legends for the plots.

Warning

The API of the add_legend method is still quite experimental.

For properties that are shared for all variables, generating the legend is relatively straightforward. Mappings are unique, and we have sensible defaults available: coordinate values as legend entries and the dimension name as the legend title.

pc = plot_trace_dist(idata, var_names=["home", "intercept", "atts", "defs"])
pc.add_legend("chain");

../_images/dadbdc63099b3fbee9aaf874d6ac478ce10cd4b0073aedd86061012b87c5c810.png

It is also possible to have properties that depend on both the data variable and dimensions. In general, aesthetic mappings can be complex, with dependencies on arbitrary combinations of variables and dimensions. There can even be combinations of aesthetics which map to combinations of variables and dimensions!

Moreover, we sometimes use aesthetic mappings as a way to distinguish different visual elements or groups of visual elements. In these cases we might not need a legend, or we might even prefer to omit it.

The example we have just seen, which we’ll also repeat below, has a bit of everything. On one hand, we might need two legends: one for the color encoding into variable+team and another for the linestyle encoding into the chain. On the other hand, in this particular example (and in general when using plot_trace_dist as a diagnostic) we don’t really care about the specific encodings. The different colors for different variable+team combinations allow us to check if same color lines overlap, meaning all chains have converged to the same distribution. Knowing if the yellow line is atts for the Italy team or defs for the Scotland team is irrelevant to our goal of diagnosing convergence. So is knowing if the dashed line represents the chain 0 or the 3.

Therefore, it would be OK to skip both legends altogether. Using PlotCollection you can choose in couple lines which situation best adapts to your particular use-case: no legend, legend for a subset of the mappings or one legend for each aesthetic mapping.

To add a legend on a mapping over multiple dimensions we use a sequence of dimensions (with __variable__ also being valid) as first argument. Here we also add matplotlib specific kwargs to get the legend to look better:

pc = plot_trace_dist(
    idata,
    var_names=["home", "intercept", "atts", "defs"],
)
pc.add_legend(("__variable__", "team"), loc="outside upper center", fontsize=10, ncols=5);

../_images/fdba0974f2f894be4c07fa123ef03a1641d0434c0af601a40056cd4dd96160a8.png

In this example each combination of variable and dimension is encoded in a single aesthetic, but that is not necessarily true. The Advanced examples section has some examples with multiple aesthetics mapped to the same dimension combination. In such cases, the legend requested for such dimension shows the multiple aesthetics.

Using PlotCollection objects

Contents

Using `PlotCollection` objects#

`PlotCollection` attributes#

`viz`: organized storage of plotting backend objects#

`aes`: mapping of aesthetic keys to values and storage all at once#

Customizing your `PlotCollection`#

Modify specific visual elements#

Add new visual elements to a `PlotCollection`#

Legends#

Using PlotCollection objects

Contents

Using PlotCollection objects#

PlotCollection attributes#

viz: organized storage of plotting backend objects#

aes: mapping of aesthetic keys to values and storage all at once#

Customizing your PlotCollection#

Modify specific visual elements#

Add new visual elements to a PlotCollection#

Legends#

Using `PlotCollection` objects#

`PlotCollection` attributes#

`viz`: organized storage of plotting backend objects#

`aes`: mapping of aesthetic keys to values and storage all at once#

Customizing your `PlotCollection`#

Add new visual elements to a `PlotCollection`#