Non-Prehensile Mobile Agent Interactive Navigation in Cluttered Environments

Zhong, Ninghan

Non-Prehensile Mobile Agent Interactive Navigation in Cluttered Environments

dc.contributor.author	Zhong, Ninghan
dc.date.accessioned	2025-08-22T18:19:01Z
dc.date.available	2025-08-22T18:19:01Z
dc.date.issued	2025-08-22
dc.date.submitted	2025-08-16
dc.description.abstract	As autonomous mobile robots are increasingly deployed in complex, real-world environments, traditional approaches that rely solely on collision-free navigation often fall short. In many practical scenarios, physical interactions with the environment, such as pushing objects or manipulating articulated structures, are not only unavoidable but also crucial for completing tasks. This thesis studies non-prehensile interactive navigation (NPIN), where robots interact with their surroundings using non-grasping, contact-based actions. These interactions present significant challenges, including complex interaction dynamics, long-horizon planning under uncertainty, and the lack of standardized tools for evaluation. This thesis addresses these challenges through three core contributions. First, we present a novel learning-based predictive planning algorithm for autonomous surface vehicle (ASV) navigation in ice-covered waters, a real-world instantiation of NPIN. In this scenario, the ASV must interact with dynamic ice floes to reach its goal. We propose a hybrid planning framework that integrates deep learning-based occupancy prediction of obstacle motion with a graph search-based planner. The resulting system is capable of computing safe and efficient trajectories that account for future ice movements, both in simulation and in a physical testbed. Second, we broaden the scope and introduce Bench-NPIN, the first standardized benchmarking suite for NPIN. Bench-NPIN includes a diverse set of environments and tasks, such as maze navigation, box delivery, and area clearing, which span both navigation-centric and manipulation-centric categories. It also provides unified evaluation metrics and reference baseline algorithms to facilitate fair and reproducible comparisons across different approaches. Third, we present a forward-looking study into long-horizon planning through generative skill chaining using diffusion models. We investigate how sequences of low-level interaction skills can be generated in one shot using a learned generative model. Furthermore, we introduce an out-of-distribution (OOD) detection mechanism to evaluate the feasibility of these skill sequences. This allows the planner to anticipate failure, reject infeasible plans, and potentially replan in a timely manner. Together, these studies provide a comprehensive investigation into the foundations, applications, and emerging approaches for NPIN in cluttered and uncertain environments.
dc.identifier.uri	https://hdl.handle.net/10012/22239
dc.language.iso	en
dc.pending	false
dc.publisher	University of Waterloo	en
dc.subject	robotics
dc.subject	machine learning
dc.subject	path planning
dc.subject	generative modeling
dc.title	Non-Prehensile Mobile Agent Interactive Navigation in Cluttered Environments
dc.type	Master Thesis
uws-etd.degree	Master of Applied Science
uws-etd.degree.department	Electrical and Computer Engineering
uws-etd.degree.discipline	Electrical and Computer Engineering
uws-etd.degree.grantor	University of Waterloo	en
uws-etd.embargo.terms	1 year
uws.contributor.advisor	Smith, Stephen
uws.contributor.affiliation1	Faculty of Engineering
uws.peerReviewStatus	Unreviewed	en
uws.published.city	Waterloo	en
uws.published.country	Canada	en
uws.published.province	Ontario	en
uws.scholarLevel	Graduate	en
uws.typeOfResource	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Zhong_Ninghan.pdf
Size:: 3.89 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 6.4 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Theses
Electrical and Computer Engineering