Learning Human-Aware Strategies for Legible and Predictable Robot Navigation

dc.contributor.authorMoskalenko, Anna
dc.date.accessioned2025-09-08T15:45:42Z
dc.date.available2025-09-08T15:45:42Z
dc.date.issued2025-09-08
dc.date.submitted2025-08-04
dc.description.abstractA challenge in human-robot interaction, particularly in dynamic and crowded environments, is to design navigation that is both legible and predictable. For people to feel comfortable in the same environment as robots, their movements must be legible - ensuring quick understanding of the robot’s intentions - and predictable, meaning they align with human expectations. In this thesis, we introduce a learning-based navigation system that leverages a vector reward function to capture the dual objectives of legibility and predictability. Rather than relying on manually designed transitioning rules or fixed weighting parameters such as alpha and beta, the reward function is learned from expert demonstrations generated by a planner (LPSNav) that blends between these objectives using a continuous parameter. Our approach does not attempt to replicate the planner’s handcrafted logic, but instead generalizes the emergent patterns in its trajectories through a supervised learning framework inspired by the structure of maximum entropy IRL. The proposed architecture includes: an LSTM-based Robot Motion History Encoder, a CNN-based Environment Encoder, an LSTM-based Human History Encoder, and a two-channel Reward Analysis Engine. The system was evaluated both in simulation using the nuScenes dataset and in real-world trials using the Clearpath Jackal robot. In the user study, participants interacted with the robot in predefined navigation scenarios, and their feedback was used to assess the perceived clarity and predictability of the robot’s actions. Results show that our method generates more human-like trajectories and outperforms baseline models in scenarios requiring socially acceptable motion, such as intersections, sidestepping, and close-proximity passing. We provide both quantitative metrics (e.g., ADE and FDE) and qualitative visualizations demonstrating smooth and socially-aware navigation behavior. This work represents a step toward safer and more intuitive human-robot coexistence, offering a practical solution for real-world robotic deployment.
dc.identifier.urihttps://hdl.handle.net/10012/22354
dc.language.isoen
dc.pendingfalse
dc.publisherUniversity of Waterlooen
dc.subjectlegibility
dc.subjectpredictability
dc.subjecthuman-aware robot navigation
dc.subjectsocially-aware navigation
dc.subjecthuman-robot interaction
dc.subjectmaximum entropy inverse reinforcement learning
dc.subjectdeep reinforcement learning
dc.subjectmotion history encoder
dc.subjecthuman expectation encoder
dc.subjectenvironment encoder
dc.subjectconditional variational autoencoder
dc.subjectreward function
dc.subjectfeature vector
dc.titleLearning Human-Aware Strategies for Legible and Predictable Robot Navigation
dc.typeMaster Thesis
uws-etd.degreeMaster of Applied Science
uws-etd.degree.departmentElectrical and Computer Engineering
uws-etd.degree.disciplineElectrical and Computer Engineering
uws-etd.degree.grantorUniversity of Waterlooen
uws-etd.embargo.terms0
uws.contributor.advisorSmith, Stephen L.
uws.contributor.affiliation1Faculty of Engineering
uws.peerReviewStatusUnrevieweden
uws.published.cityWaterlooen
uws.published.countryCanadaen
uws.published.provinceOntarioen
uws.scholarLevelGraduateen
uws.typeOfResourceTexten

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Moskalenko_Anna.pdf
Size:
6.48 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
6.4 KB
Format:
Item-specific license agreed upon to submission
Description: