Skip to content

Adding Steps

In order to split a notebook into multiple pipeline steps, we add cell tags in the notebook.

Option 1: Jupyter Lab

Add a Tag that Specifies a Step

Go to the cell you want to split steps with, and click the "cogs" icon in the top right. Add a tag of the following form: same_step_x where x is a digit of any length. (We currently only support linear DAG execution so even though we ask for digits, we don't actually do anything with them.)

Add step information to cell

Option 2: Classic Jupyter

Allowing for Adding Specific Tags

First, go to the notebook settings and view "Cell Metadata":

The menu item for displaying tags for Jupyter cells

Add a Tag that Specifies a Step

Second, go to the cell you want to split steps with, and add a tag of the following form: same_step_x where x is a digit of any length. (We currently only support linear DAG execution so even though we ask for digits, we don't actually do anything with them.)

Add step information to cell

Execute the Steps

Now, when you execute same run the cells will automatically be grouped together into steps, and executed serially. No additional work is necessary for the end user.

multi-step-execution

We inject an additional step for each graph, a "run info" step (which is necessary because we only know things like Run ID when the step is executed).