Guides

Application Criteria & Current Capabilities

Requirements

Monty is designed for sensorimotor applications. It is not designed to learn from static datasets like many current AI systems are, and so it may not be a drop-in replacement for an existing AI usecase you have. Our system is made to learn and interact using similar principles to the human brain, the most intelligent and flexible system known to exist. The brain is a sensorimotor system that constantly receives movement input and produces movement outputs. In fact, the only outputs from the brain are for movement. Our system works the same way. It needs to receive information about how the sensors connected to it are moving in space in order to learn structured models of whatever it is sensing. The inputs are not just a bag of features, but instead features at poses (location + orientation).

Potential Applications

Any application where you have moving sensors is a potential application for Monty. This could be physical movement of sensors on a robot. It could also be simulated movement such as our simulations in Habitat or the sensor patch cropped out of a larger 2D image in the monty meets world experiments. It could also be movement through conceptual space or another non-physical space such as navigating the internet.

Applications where we anticipage Monty to particularly shine are:

  • Applications where little data to learn from is available
  • Applications where little compute to train is available
  • Applications where little compute to do inference is available (like on edge devices)
  • Applications where no supervised data is available
  • Applications where continual learning and fast adaptation is required
  • Applications where the agent needs to generalize/extrapolate to new tasks
  • Applications where interpretability is important
  • Applications where robustness is important (to noise but also samples outside of the training distribution)
  • Applications where multimodal integration is required or multimodal transfer (learning with one modality and inferring with another)
  • Applications where the agent needs to solve a wide range of tasks
  • Applications where humans do well but current AI does not

Capabilities

For a list of current and future capabilities, see Capabilities of the System. For experiments (and their results) measuring the current capabilities of the Monty implementation, see our Benchmark Experiments.