SPINE: Online Semantic Planning for Missions with Incomplete Natural Language Specifications in Unstructured Environments

The GRASP Lab, University of Pennsylvania

SPINE takes a mission given in natural language, reasons about implied tasks, and plans online while reacting to perception updates.

SPINE takes a mission given in natural language and a prior map generated from overhead imagery.

The map is refined real-time, and SPINE reacts to these updates as it realizes its mission. Examples above.

Abstract

As robots become increasingly capable, users will want to describe high-level missions and have robots fill in the gaps. In many realistic settings, pre-built maps are difficult to obtain, so execution requires exploration and mapping that are necessary and specific to the mission. Consider an emergency response scenario where a user commands a robot, ``triage impacted regions.'' The robot must infer relevant semantics (victims, etc.) and exploration targets (damaged regions) based on priors or other context, then explore and refine its plan online. These missions are incompletely specified, meaning they imply subtasks and semantics. While many semantic planning methods operate online, they are typically designed for well specified tasks such as object search or exploration. Recently, Large Language Models (LLMs) have demonstrated powerful contextual reasoning over a range of robotic tasks described in natural language. However, existing LLM planners typically do not consider online planning or complex missions; rather, relevant subtasks are provided by a pre-built map or a user.

We address these limitations via SPINE, a method for online semantic planning over incomplete mission specifications. The planner uses an LLM to reason about subtasks implied by the mission then realizes these subtasks in a receding horizon framework. Tasks are automatically validated for safety and refined online with new observations. We evaluate our method in simulation and real-world settings. Evaluation missions require multiple steps of semantic reasoning and exploration in cluttered outdoor environments of over 20,000 meters squared area. We evaluate our method against competitive baselines in single-agent and air-ground teaming applications.

Video

BibTeX

@article{ravichandran_jackal,
  title={SPINE: Online Semantic Planning for Missions with Incomplete Natural Language Specifications in Unstructured Environments}, 
  author={Zachary Ravichandran and Varun Murali and Mariliza Tzes and George J. Pappas and Vijay Kumar},
  year={2024},
  journal={arXiv preprint arxiv:2410.03035},
  url={https://arxiv.org/abs/2410.03035}, 
}