🚧
Apologies, the code for this tutorial is out of date due to the major change in how we configure Monty. We'll update it soon™️.

Introduction

In this tutorial we will introduce the basic mechanics of Monty experiment configs, how to run them, and what happens during the execution of a Monty experiment. Since we will focus mainly on the execution of an experiment, we'll configure and run the simplest possible experiment and walk through it step-by-step. Please have a look at the next tutorials for more concrete examples of running the code with our current graph learning approach.

📘
Don't have the YCB Dataset Downloaded?
Most of our tutorials require the YCB dataset, including this one. Please follow the instructions on downloading it here.

Setting up the Experiment Config

📘
Below instructions assume you'll be running an experiment within the checked out tbp.monty repository. This is the recommended way to start. Once you are familiar with Monty, if you'd rather setup your experiment in your own repository, then take a look at Running An Experiment From A Different Repository.

To follow along, copy this code into the benchmarks/configs/my_experiments.py file.

from dataclasses import asdict

from benchmarks.configs.names import MyExperiments
from tbp.monty.frameworks.config_utils.config_args import (
    LoggingConfig,
    PatchAndViewMontyConfig,
)
from tbp.monty.frameworks.config_utils.make_env_interface_configs import (
    get_env_interface_per_object_by_idx,
    SupervisedPretrainingExperimentArgs,
)
from tbp.monty.frameworks.environments import embodied_data as ED
from tbp.monty.frameworks.experiments.pretraining_experiments import (
    MontySupervisedObjectPretrainingExperiment,
)
from tbp.monty.simulators.habitat.configs import (
    PatchViewFinderMountHabitatEnvInterfaceConfig,
)

#####
# To test your env and help you familiarize yourself with the code, we'll run the simplest possible
# experiment. We'll use a model with a single learning module as specified in
# monty_config. We'll also skip evaluation, train for a single epoch for a single step,
# and only train on a single object, as specified in experiment_args and train_env_interface_args.
#####

first_experiment = dict(
    experiment_class=MontySupervisedObjectPretrainingExperiment,
    logging=LoggingConfig(),
    experiment_args=SupervisedPretrainingExperimentArgs(
        do_eval=False,
        max_train_steps=1,
        n_train_epochs=1,
    ),
    monty_config=PatchAndViewMontyConfig(),
    # Set up the environment and agent.
    env_interface_config=PatchViewFinderMountHabitatEnvInterfaceConfig(),
    train_env_interface_class=ED.EnvironmentInterfacePerObject,
    train_env_interface_args=get_env_interface_per_object_by_idx(start=0, stop=1),
)

experiments = MyExperiments(
    first_experiment=first_experiment,
)
CONFIGS = asdict(experiments)

Next you will need to declare your experiment name as part of the MyExperiments dataclass in the benchmarks/configs/names.py file:

@dataclass
class MyExperiments:
    first_experiment: dict

Running the Experiment

To run this experiment you just defined, you can now simply navigate to the benchmarks/ folder and call the run.py script with the experiment name as the -e argument.

cd benchmarks
python run.py -e first_experiment

What Just Happened?

Now that you have run your first experiment, let's unpack what happened. This first section involves a lot of text, but rest assured, once you grok this first experiment, the rest of the tutorials will be much more interactive and will focus on running experiments and using tooling. This first experiment is virtually the simplest one possible, but it is designed to familiarize you with all the pieces and parts of the experimental workflow to give you a good foundation for further experimentation.

Experiments are implemented as Python classes with methods like train and evaluate. In essence, run.py loads a config and calls train and evaluate methods if the config says to run them. Notice that first_experiment has do_eval set to False, so run.py will only run the train method.

Experiment Structure: Epochs, Episodes, and Steps

One epoch will run training (or evaluation) on all the specified objects. An epoch generally consists of multiple episodes, one for each object, or for each pose of an object in the environment. An episode is one training or evaluating session with one single object. This episode consists of a sequence of steps. What happens in a step depends on the particular experiment, but an example would be: shifting the agent's position, reading sensor inputs, transforming sensor inputs to features, and adding these features to an object model. For more details on this default experiment setup see this section from the Monty documentation.

If you examine the MontyExperiment class, the parent class of MontySupervisedObjectPretrainingExperiment, you will notice that there are related methods like {pre,post}_epoch, and {pre,post}_episode. With inheritance or mixin classes, you can use these methods to customize what happens before during and after each epoch, or episode. For example, MontySupervisedObjectPretrainingExperiment reimplements pre_episode and post_epoch to provide extra functionality specific to pretraining experiments. Also notice that each method contains calls to a logger. Logger classes can also be customized to log specific information at each control point. Finally, we save a model with the save_state_dict method at the end of each epoch. All told, the sequence of method calls goes something like

MontyExperiment.train (loops over epochs)
- Do pre-train logging.
- MontyExperiment.run_epoch (loops over episodes)
  - MontyExperiment.pre_epoch
    - Do pre-epoch logging.
  - MontyExperiment.run_episode (loops over steps)
    - MontyExperiment.pre_episode
      - Do pre-episode logging.
    - Monty.step
    - MontyExperiment.post_episode
      - Update object model in memory.
      - Do post-episode logging
  - MontyExperiment.post_epoch
    - MontyExperiment.save_state_dict.
    - Do post-epoch logging.
- Do post-train logging.

and this is exactly the procedure that was executed when you ran python run.py -e first_experiment. (Please note that we're writing MontyExperiment in the above sequence rather than MontySupervisedObjectPretrainingExperiment for the sake of generality). When we run Monty in evaluation mode, the same sequence of calls is initiated by MontyExperiment.evaluate minus the model updating step in MontyExperiment.post_episode. See here for more details on epochs, episodes, and steps.

Model

The model is specified in the monty_config field of the first_experiment config as a PatchAndViewMontyConfig which is in turn defined within src/tbp/monty/frameworks/config_utils/config_args.py. Yes, that's a config within a config. The reason for nesting configs is that the model is an ensemble of LearningModules (LMs), and SensorModules (SMs), each of which could potentially have their own configuration as well. For more details on configuring custom learning or sensor modules see this guide.

For now, we will start with one of the simpler and most common versions of this complex system. The PatchAndViewMontyConfig dataclass has fields learning_module_configs and sensor_module_configs where each key is the name of an LM (or SM resp.), and each value is the full config for that model component. Our first model has one LM and two SMs. Why two SMs and only 1 LM? One SM provides the LM with processed observations, while the second SM is used solely to initialize the agent at the beginning of the experiment.

Note that the sm_to_agent_dict field of the model config maps each SM to an "agent" (i.e. a moveable part), and only a single agent is specified, meaning that our model has one moveable part with one sensor attached to it. In particular, it has an RGBD camera attached to it.

Steps

By now, we know that an experiment relies on train and evaluate methods, that each of these runs one or more epochs, which consists of one or more episodes, and finally each episode repeatedly calls model.step. Now we will start unpacking each of these levels, starting with the innermost loop over steps.

In PatchAndViewMontyConfig, notice that the model class is specified as MontyForGraphMatching (src/tbp/monty/frameworks/models/graph_matching.py), which is a subclass of MontyBase defined in src/tbp/monty/frameworks/models/monty_base.py, which in turn is a subclass of Monty, an abstract base class defined in src/tbp/monty/frameworks/models/abstract_monty_classes.py. In the abstract base class Monty, you will see that there are two template methods for two types of steps: _exploratory_step and _matching_step. In turn, each of these steps is defined as a sequence of calls to other abstract methods, including _set_step_type_and_check_if_done, which is a point at which the step type can be switched. The conceptual difference between these types of steps is that during exploratory steps, no inference is attempted, which means no voting and no keeping track of which objects or poses are possible matches to the current observation. Each time model.step is called in the experimental procedure listed under the "Episodes and Epochs" heading, either _exploratory_step or _matching_step will be called. In a typical experiment, training consists of running _matching_step until a) an object is recognized, or b) all known objects are ruled out, or c) a step counter exceeds a threshold. Regardless of how matching-steps is terminated, the system then switches to running exploratory step so as to gather more observations and build a more complete model of an object.

You can, of course, customize step types and when to switch between step types by defining subclasses or mixins. To set the initial step type, use model.pre_episode. To adjust when and how to switch step types, use _set_step_type_and_check_if_done.

In this particular experiment, n_train_epochs was set to 1, and max_train_steps was set to 1. This means a single epoch was run, with one matching step per episode. In the next section, we go up a level from the model step to understand episodes and epochs.

Environment Interface

In the config for first_experiment, there is a comment that marks the start of environment and agent setup. Now we turn our attention to everything below that line, as this is where episode specifics are defined.

The environment interface class is the way we interact with a simulation environment. The objects within an environment are assumed to be the same for both training and evaluation (for now), hence only one (class, args) pairing is needed. Note however that object orientations, as well as specific observations obtained from an object, will generally differ across training and evaluation.

The environment interface is basically the API between the environment and the model. Its job is to sample from the environment and return observations to the model (+initialize and reset the environment). Note that the next observation is decided by the last action, and the actions are selected by a motor_system. This motor system is shared by reference with the model. By changing the actions, the model controls what it observes next, just as you would expect from a sensorimotor system.

Now, finally answering our question of what happens in an episode, notice that our config uses a special type of environment interface: EnvironmentInterfacePerObject (note that this is a subclass of EnvironmentInterface which is kept as general as possible to allow for flexible subclass customization). As indicated in the docstring, this environment interface has a list of objects, and at the beginning / end of an episode, it removes the current object from the environment, increments a (cyclical) counter that determines which object is next, and places the new object in the environment. The arguments to EnvironmentInterfacePerObject determine which objects are added to the environment and in what pose. In our config, we use a single list with one YCB object. As shown by this line train_env_interface_args=get_env_interface_per_object_by_idx(start=0, stop=1),

Final Notes on the Model

To wrap up this tutorial, we'll cover a few more details of the model. Recall that sm_to_agent_dict assigns each SM to a moveable part (i.e. an "agent"). The action space for each moveable part is in turn defined in the motor_system_config part of the model config. Once an action is executed, the agent moves, and each sensor attached to that agent (here just a single RGBD sensor) receives an observation. Just as sm_to_agent_dict specifies which sensors are attached to which agents, in src/tbp/monty/frameworks/config_utils/config_args the MontyConfig field sm_to_lm_matrix specifies for each LM which SMs it will receive observations from. Thus, observations flow from agents to sensors (SMs), and from SMs to LMs, where all actual modeling takes place in the LM. Near the end of model.step (remember, this can be either matching_step or exploratory_step), the model calls decide_location_for_movement which selects actions and closes the loop between the model and the environment. Finally, at the end of each epoch, we save a model in a directory specified by the ExperimentArgs field of the model config.

Summary

That was a lot of text, so let's review what all went into this experiment.

We ran a MontyExperiment using run.py
We went through the train procedure with one epoch
The epoch looped over a list of objects of length 1 - so a single episode was run
The max steps was set to 1, so all told, we took one single step on one single object
Our model had a single agent with a single RGBD camera attached to it
During model.step, matching_step was called and one SM received one observation from the environment
The decide_location_for_movement method was called
We saved our model at the end of the epoch

Congratulations on completing your first experiment! Ready to take the next step? Learn the ins-and-outs of pretraining a model.

Help Us Make This Page Better

All our docs are open-source. If something is wrong or unclear, submit a PR to fix it!

Make a Contribution

Learn how to contribute to our docs