Multi-Robot Task Planning for Multi-Object Retrieval Tasks with Distributed On-Site Knowledge via Large Language Models

Abstract

We study cooperative object search by multiple robots that receive natural-language instructions including multi-object or context-dependent goals (e.g., “find an apple and a banana”). Our framework integrates a large language model (LLM) with a spatial concept model that provides room names and room-wise object presence probabilities learned on each robot’s assigned area. With a tailored prompting strategy, the LLM infers required items from ambiguous commands, decomposes them into subtasks, and allocates them to robots that are most likely to succeed given their local knowledge. In experiments, the method achieved 47/50 successful allocations, outperforming random (28/50) and commonsense-only allocation (26/50), and was validated qualitatively on real mobile manipulators.

Overview

Each robot maintains on-site knowledge that links places (e.g., kitchen, bedroom) with object–room presence probabilities learned in its assigned area. Given a user instruction, an LLM performs task decomposition and assigns subtasks to robots expected to have higher success probabilities under local knowledge. Subtasks are executed by a skill sequence (navigation → object_detection → pick → place) with feedback and replanning.

Method

Each robot learns on-site knowledge via a spatial concept model that links places (e.g., kitchen, bedroom) and object occurrence probabilities. The pipeline has four stages: (1) task decomposition from language, (2) knowledge-aware subtask allocation, (3) sequential action planning (navigation → object_detection → pick → place), and (4) execution with feedback loops (FlexBE).

Four-stage pipeline: knowledge acquisition, decomposition & allocation, action planning, and feedback execution. — Four-stage pipeline: knowledge acquisition → decomposition & allocation → action planning → execution with feedback.

On-site knowledge: place vocabulary + object–room presence probabilities.
LLM prompts: few-shot designs to infer items, decompose tasks, and justify allocation using local probabilities.
Execution: closed-loop behaviors and replanning on success/failure signals.

Prompts

This section provides the exact LLM prompts used in our experiments. If you are reviewing the paper and want to inspect the prompts, please start here.

Task decomposition prompt: Task_Decomposition.txt
Coalition / allocation prompt: Coalition_Formation.txt

Example: Task Decomposition Prompt

We provide the full prompt as Task_Decomposition.txt. Below we show only the header part.

from skills import navigation, object_detection, pick, put
  
  List of object_name:
  [plate, bowl, pitcher_base, banana, apple, orange, cracker_box, pudding_box, chips_bag, coffee, muscat, fruits_juice, pig_doll, sheep_doll, penguin_doll, airplane_toy, car_toy, truck_toy, tooth_paste, towel, cup, treatments, sponge, bath_slipper]
  
  
  def decompose_task(task_description):
      # GENERAL TASK DECOMPOSITION
      # Decompose and parallelize subtasks wherever possible.
      # (full prompt: see prompts/Task_Decomposition.txt)

Example: Knowledge-aware Allocation Prompt

Full version is in Coalition_Formation.txt. We show here the core structure.

from skills import navigation, object_detection, pick, put
  
  # TASK ALLOCATION
  # Scenario: There are 2 robots available. The task should be
  # performed using the minimum number of robots necessary.
  
  robots = [
      {"name":"Robot1", "skills":["navigation","object_detection","pick","put"],
       "found objects":["car_toy, bath_slipper"]},
      {"name":"Robot2", "skills":["navigation","object_detection","pick","put"],
       "found objects":["banana, apple, plate, plate, bowl, coffee"]}
  ]
  
  List of probabilities that an object exists at [living_room, kitchen, bedroom, bathroom]:
      plate: [0.005, 0.983, 0.007, 0.005]
      bowl:  [0.005, 0.983, 0.007, 0.005]
      ...
  # IMPORTANT: Think step by step and output only the final allocation.

Experiments

Evaluation environment – first floor — Evaluation environment – First floor (5 rooms, 12 objects).

Evaluation environment – second floor — Evaluation environment – Second floor (5 rooms, 12 objects).

We evaluate allocation accuracy across instruction types (random, hard-to-predict, commonsense, mixed). The proposed method reaches 47/50 correct allocations versus random 28/50 and commonsense-only 26/50.

Bar chart of allocation success counts across instruction types. — Allocation success counts across instruction types (random/hard/commonsense/mixed).

Resources

Paper (arXiv): arXiv:2509.12838
Submission PDF (AROB/ISBC 2026, under review): PDF
Video: Demo

Code availability: not publicly released at this time.

Demo Video

Demonstration of our multi-robot task planning framework.

BibTeX

@article{Murata2025MultiRobotTaskPlanning,
  title   = {Multi-Robot Task Planning for Multi-Object Retrieval Tasks with Distributed On-Site Knowledge via Large Language Models},
  author  = {Murata, Kento and Hasegawa, Shoichi and Ishikawa, Tomochika and Hagiwara, Yoshinobu and Taniguchi, Akira and El Hafi, Lotfi and Taniguchi, Tadahiro},
  journal = {arXiv preprint arXiv:2509.12838},
  year    = {2025},
  note    = {Project page: https://kentomurata0610.github.io/multi-robot-task-planning/}
}

Please cite the arXiv version until the conference review is complete.

Acknowledgments

Partially supported by JST Moonshot (JPMJMS2011), JSPS KAKENHI (JP25K15292, JP23K16975), and JST Challenging Research Program for Next-Generation Researchers (JPMJSP2101).