Catalogue Search | MBRL

Integrating action knowledge and LLMs for task planning and situation handling in open worlds

by Ding, Yan , Amiri, Saeid , Yang, Hao in Knowledge , Large language models , Robots

2023

Task planning systems have been developed to help robots use human knowledge (about actions) to complete long-horizon tasks. Most of them have been developed for “closed worlds” while assuming the robot is provided with complete world knowledge. However, the real world is generally open, and the robots frequently encounter unforeseen situations that can potentially break theplanner’s completeness. Could we leverage the recent advances on pre-trained Large Language Models (LLMs) to enable classical planning systems to deal with novel situations? This paper introduces a novel framework, called COWP, for open-world task planning and situation handling. COWP dynamically augments the robot’s action knowledge, including the preconditions and effects of actions, with task-oriented commonsense knowledge. COWP embraces the openness from LLMs, and is grounded to specific domains via action knowledge. For systematic evaluations, we collected a dataset that includes 1085 execution-time situations. Each situation corresponds to a state instance wherein a robot is potentially unable to complete a task using a solution that normally works. Experimental results show that our approach outperforms competitive baselines from the literature in the success rate of service tasks. Additionally, we have demonstrated COWP using a mobile manipulator. Supplementary materials are available at: https://cowplanning.github.io/

Journal Article

Share this book

Add to My Shelf

ProgPrompt: program generation for situated robot task planning using large language models

by Blukis, Valts , Fox, Dieter , Singh, Ishika in Ablation , Free form , Large language models

2023

Task planning can require defining myriad domain knowledge about the world in which a robot needs to act. To ameliorate that effort, large language models (LLMs) can be used to score potential next actions during task planning, and even generate action sequences directly, given an instruction in natural language with no additional domain information. However, such methods either require enumerating all possible next steps for scoring, or generate free-form text that may contain actions not possible on a given robot in its current context. We present a programmatic LLM prompt structure that enables plan generation functional across situated environments, robot capabilities, and tasks. Our key insight is to prompt the LLM with program-like specifications of the available actions and objects in an environment, as well as with example programs that can be executed. We make concrete recommendations about prompt structure and generation constraints through ablation experiments, demonstrate state of the art success rates in VirtualHome household tasks, and deploy our method on a physical robot arm for tabletop tasks. Website and code at progprompt.github.io

Journal Article

Share this book

Add to My Shelf

Multi-Robot Coordination Analysis, Taxonomy, Challenges and Future Scope

by Verma, Janardan Kumar , Ranga, Virender in Artificial Intelligence , Control , Coordination

2021

Recently, Multi-Robot Systems (MRS) have attained considerable recognition because of their efficiency and applicability in different types of real-life applications. This paper provides a comprehensive research study on MRS coordination, starting with the basic terminology, categorization, application domains, and finally, give a summary and insights on the proposed coordination approaches for each application domain. We have done an extensive study on recent contributions in this research area in order to identify the strengths, limitations, and open research issues, and also highlighted the scope for future research. Further, we have examined a series of MRS state-of-the-art parameters that affect MRS coordination and, thus, the efficiency of MRS, like communication mechanism, planning strategy, control architecture, scalability, and decision-making. We have proposed a new taxonomy to classify various coordination approaches of MRS based on the six broad dimensions. We have also analyzed that how coordination can be achieved and improved in two fundamental problems, i.e., multi-robot motion planning, and task planning, and in various application domains of MRS such as exploration, object transport, target tracking, etc.

Journal Article

Share this book

Add to My Shelf

Ontology-based task planning for autonomous unmanned system: framework and principle

by Gu, Wenguan , Pang, Weijian , Li, Hui in Artificial Intelligent planning , Autonomy , Knowledge representation

2022

Operational knowledge is an important factor to improve the autonomy of unmanned systems. How to store and represent operational knowledge semantically and normatively, so that unmanned systems can autonomously accomplish task planning according to the high-level order of humans, carry out re-planning, and make decisions has been a new topic in the autonomous unmanned system field. This paper reviews several typical robot ontologies, as well as modeling and representation methods of uncertain information, proposes the concept of Ontology-based Task Planning(OTP) for the unmanned system, and introduces the principles of ontology construction. Finally, the application of ontology in unmanned combat systems are discussed.

Journal Article

Share this book

Add to My Shelf

Text2Motion: from natural language instructions to feasible plans

by Migimatsu, Toki , Lin, Kevin , Bohg, Jeannette in Feasibility , Language instruction , Large language models

2023

We propose Text2Motion, a language-based planning framework enabling robots to solve sequential manipulation tasks that require long-horizon reasoning. Given a natural language instruction, our framework constructs both a task- and motion-level plan that is verified to reach inferred symbolic goals. Text2Motion uses feasibility heuristics encoded in Q-functions of a library of skills to guide task planning with Large Language Models. Whereas previous language-based planners only consider the feasibility of individual skills, Text2Motion actively resolves geometric dependencies spanning skill sequences by performing geometric feasibility planning during its search. We evaluate our method on a suite of problems that require long-horizon reasoning, interpretation of abstract goals, and handling of partial affordance perception. Our experiments show that Text2Motion can solve these challenging problems with a success rate of 82%, while prior state-of-the-art language-based planning methods only achieve 13%. Text2Motion thus provides promising generalization characteristics to semantically diverse sequential manipulation tasks with geometric dependencies between skills. Qualitative results are made available at https://sites.google.com/stanford.edu/text2motion.

Journal Article

Share this book

Add to My Shelf

Concept for the automated adaption of abstract planning domains for specific application cases in skills-based industrial robotics

by Heuss, Lisa , Reinhart, Gunther , Gebauer, Daniel in Advanced manufacturing technologies , Artificial intelligence , Automation

2024

High product diversity, dynamic market conditions, and a lack of skilled workers are current challenges in manufacturing. Industrial robots autonomously planning and completing upcoming production tasks can help companies address these challenges. In this publication, we focus on autonomous task planning within industrial robotics and investigate how to facilitate the use of automated planning techniques from the field of artificial intelligence for this purpose. First, we present a novel methodology to automatically adapt abstractly modeled planning domains to the characteristics of individual application cases a user intends to implement. A planning domain is a formalized representation of the robot’s working environment that builds the basis for automated planning. Second, we integrate this approach into the procedure for developing skills-based industrial robotic applications to enable them to perform autonomous task planning. Finally, we demonstrate the use of the methodology within the application field kitting in two reference scenarios with a mobile robot and a stationary robot cell. Using our methodology, persons without expertise in automated planning can enable a robot for autonomous task planning without much extra effort.

Journal Article

Share this book

Add to My Shelf

Logic programming for deliberative robotic task planning

in Activities of daily living , Application , Autonomy

2023

Over the last decade, the use of robots in production and daily life has increased. With increasingly complex tasks and interaction in different environments including humans, robots are required a higher level of autonomy for efficient deliberation. Task planning is a key element of deliberation. It combines elementary operations into a structured plan to satisfy a prescribed goal, given specifications on the robot and the environment. In this manuscript, we present a survey on recent advances in the application of logic programming to the problem of task planning. Logic programming offers several advantages compared to other approaches, including greater expressivity and interpretability which may aid in the development of safe and reliable robots. We analyze different planners and their suitability for specific robotic applications, based on expressivity in domain representation, computational efficiency and software implementation. In this way, we support the robotic designer in choosing the best tool for his application.

Journal Article

Share this book

Add to My Shelf

Affordance-informed Robotic Manipulation via Intelligent Action Library

by Zhiyang, L , Ruiteng, Z , Zhengshen, Z in Human engineering , Robot arms , Semantic segmentation

2024

In the realm of conventional affordance detection, the primary objective is to provide insights into the potential uses of objects. However, a significant limitation remains as these conventional methods merely treat affordance detection as a semantic segmentation task, disregarding the crucial aspect of interpreting affordances for actions that can be performed by manipulator. To address this critical gap, we present a novel pipeline incorporating the Intelligent Action Library (IAL) concept. This framework enables affordance interpretation for various manipulation tasks, allowing robots to be taught and guided on how to execute specific actions based on the detected affordances and human-robot interaction. Through real-world experiments, we have demonstrated the ingenuity and dependability of our pipeline, effectively bridging the gap between affordance detection and manipulation task planning and execution. The integration of IAL facilitates a seamless connection between understanding affordances and empowering robots to perform tasks with precision and efficiency. The demo link is available to the public: https://youtu.be/_oBAer2Vl8k

Journal Article

Share this book

Add to My Shelf

Embodied intelligence in manufacturing: leveraging large language models for autonomous industrial robotics

by Lu, Wen Feng , Liu, Xuan , Li, Bingbing in Automation , Business and Management , Control

2025

This paper delves into the potential of Large Language Model (LLM) agents for industrial robotics, with an emphasis on autonomous design, decision-making, and task execution within manufacturing contexts. We propose a comprehensive framework that includes three core components: (1) matches manufacturing tasks with process parameters, emphasizing the challenges in LLM agents’ understanding of human-imposed constraints; (2) autonomously designs tool paths, highlighting the LLM agents’ proficiency in planar tasks and challenges in 3D spatial tasks; and (3) integrates embodied intelligence within industrial robotics simulations, showcasing the adaptability of LLM agents like GPT-4. Our experimental results underscore the distinctive performance of the GPT-4 agent, especially in Component 3, where it is outstanding in task planning and achieved a success rate of 81.88% across 10 samples in task completion. In conclusion, our study accentuates the transformative potential of LLM agents in industrial robotics and suggests specific avenues, such as visual semantic control and real-time feedback loops, for their enhancement.

Journal Article

Share this book

Add to My Shelf

Unlocking Robotic Autonomy: A Survey on the Applications of Foundation Models

by Lee, Minjoon , Choi, Han-Lim , Cho, Doo-Hyun in Artificial intelligence , Autonomy , Control

2024

The advancement of foundation models, such as large language models (LLMs), vision-language models (VLMs), diffusion models, and robotics foundation models (RFMs), has become a new paradigm in robotics by offering innovative approaches to the long-standing challenge of building robot autonomy. These models enable the development of robotic agents that can independently understand and reason about semantic contexts, plan actions, physically interact with surroundings, and adapt to new environments and untrained tasks. This paper presents a comprehensive and systematic survey of recent advancements in applying foundation models to robot perception, planning, and control. It introduces the key concepts and terminology associated with foundation models, providing a clear understanding for researchers in robotics and control engineering. The relevant studies are categorized based on how foundation models are utilized in various elements of robotic autonomy, focusing on 1) perception and situational awareness: object detection and classification, semantic understanding, mapping, and navigation; 2) decision making and task planning: mission understanding, task decomposition and coordination, planning with symbolic and learning-based approaches, plan validation and correction, and LLM-robot interaction; 3) motion planning and control: motion planning, control command and reward generation, and trajectory generation and optimization with diffusion models. Furthermore, the survey covers essential environmental setups, including real-world and simulation datasets and platforms used in training and validating these models. It concludes with a discussion on current challenges such as robustness, explainability, data scarcity, and real-time performance, and highlights promising future directions, including retrieval augmented generation, on-device foundation models, and explainability. This survey aims to systematically summarize the latest research trends in applying foundation models to robotics, bridging the gap between the state-of-the-art in artificial intelligence and robotics. By sharing knowledge and resources, this survey is expected to foster the introduction of a new research paradigm for building generalized and autonomous robots.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter