SPINE

As robots become increasingly capable, users will want to describe high-level missions and have robots infer the relevant details. because pre-built maps are difficult to obtain in many realistic settings, accomplishing such missions will require the robot to map and plan online. while many semantic planning methods operate online, they are typically designed for well specified missions such as object search or exploration. recently, large language models (LLMs) have demonstrated powerful contextual reasoning abilities over a range of robotic tasks described in natural language. however, existing LLM-enabled planners typically do not consider online planning or complex missions; rather, relevant subtasks and semantics are provided by a pre-built map or a user. we address these limitations via spine, an online planner for missions with incomplete mission specifications provided in natural language. the planner uses an LLM to reason about subtasks implied by the mission specification and then realizes these subtasks in a receding horizon framework. tasks are automatically validated for safety and refined online with new map observations. we evaluate spine in simulation and real-world settings with missions that require multiple steps of semantic reasoning and exploration in cluttered outdoor environments of over 20,000m^2. compared to baselines that use existing LLM-enabled planning approaches, our method is over twice as efficient in terms of time and distance, requires less user interactions, and does not require a full map.

A user provides SPINE with a mission specification. SPINE's plan generator infers a task sequence which is validated online for correctness and feasibility; if necessary, feedback and corrections are provided in real-time. Actions are sent to the appropriate module, and the planner refines its plan as new information is acquired.

SPINE is comparible to an expert planner which is given step-by-step mission instructions. SPINE also has over 2x time and distance savings compared to an LLM-enabled planner that is provided relevant regions of the environment, maps those regions, and then plans.

SPINE: Online Semantic Planning for Missions with Incomplete Natural Language Specifications in Unstructured Environments

SPINE takes a mission given in natural language, reasons about implied tasks, and refines its plan online as it actively updates its map.

Abstract

Method

Results

Video

BibTeX