End-to-End Multiview Gesture Recognition for Autonomous Car Parking System

Ben Amara, Hassene

End-to-End Multiview Gesture Recognition for Autonomous Car Parking System

dc.contributor.advisor	Karray, Fakhri
dc.contributor.author	Ben Amara, Hassene
dc.date.accessioned	2019-05-21T18:39:08Z
dc.date.available	2019-05-21T18:39:08Z
dc.date.issued	2019-05-21
dc.date.submitted	2019-05-10
dc.description.abstract	The use of hand gestures can be the most intuitive human-machine interaction medium. The early approaches for hand gesture recognition used device-based methods. These methods use mechanical or optical sensors attached to a glove or markers, which hinders the natural human-machine communication. On the other hand, vision-based methods are not restrictive and allow for a more spontaneous communication without the need of an intermediary between human and machine. Therefore, vision gesture recognition has been a popular area of research for the past thirty years. Hand gesture recognition finds its application in many areas, particularly the automotive industry where advanced automotive human-machine interface (HMI) designers are using gesture recognition to improve driver and vehicle safety. However, technology advances go beyond active/passive safety and into convenience and comfort. In this context, one of America’s big three automakers has partnered with the Centre of Pattern Analysis and Machine Intelligence (CPAMI) at the University of Waterloo to investigate expanding their product segment through machine learning to provide an increased driver convenience and comfort with the particular application of hand gesture recognition for autonomous car parking. In this thesis, we leverage the state-of-the-art deep learning and optimization techniques to develop a vision-based multiview dynamic hand gesture recognizer for self-parking system. We propose a 3DCNN gesture model architecture that we train on a publicly available hand gesture database. We apply transfer learning methods to fine-tune the pre-trained gesture model on a custom-made data, which significantly improved the proposed system performance in real world environment. We adapt the architecture of the end-to-end solution to expand the state of the art video classifier from a single image as input (fed by monocular camera) to a multiview 360 feed, offered by a six cameras module. Finally, we optimize the proposed solution to work on a limited resources embedded platform (Nvidia Jetson TX2) that is used by automakers for vehicle-based features, without sacrificing the accuracy robustness and real time functionality of the system.	en
dc.identifier.uri	http://hdl.handle.net/10012/14657
dc.language.iso	en	en
dc.pending	false
dc.publisher	University of Waterloo	en
dc.subject	deep learning	en
dc.subject	video classification	en
dc.subject	dynamic hand gesture recognition	en
dc.subject	embedded platform	en
dc.subject	automotive	en
dc.subject	vehicle self-parking	en
dc.title	End-to-End Multiview Gesture Recognition for Autonomous Car Parking System	en
dc.type	Master Thesis	en
uws-etd.degree	Master of Applied Science	en
uws-etd.degree.department	Electrical and Computer Engineering	en
uws-etd.degree.discipline	Electrical and Computer Engineering	en
uws-etd.degree.grantor	University of Waterloo	en
uws.contributor.advisor	Karray, Fakhri
uws.contributor.affiliation1	Faculty of Engineering	en
uws.peerReviewStatus	Unreviewed	en
uws.published.city	Waterloo	en
uws.published.country	Canada	en
uws.published.province	Ontario	en
uws.scholarLevel	Graduate	en
uws.typeOfResource	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: BenAmara_Hassene.pdf
Size:: 14.04 MB
Format:: Adobe Portable Document Format
Description:: Master Thesis

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 6.08 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Theses
Electrical and Computer Engineering