UWSpace is currently experiencing technical difficulties resulting from its recent migration to a new version of its software. These technical issues are not affecting the submission and browse features of the site. UWaterloo community members may continue submitting items to UWSpace. We apologize for the inconvenience, and are actively working to resolve these technical issues.
 

Design of practical computer vision system with real-time object detection capability

Loading...
Thumbnail Image

Date

2024-03-07

Authors

chen, guanyu

Journal Title

Journal ISSN

Volume Title

Publisher

University of Waterloo

Abstract

Computer vision nowadays relies heavily on machine learning techniques to interpret useful information from images or videos. Object detection is one such computer vision technique for identifying and locating objects in images. This type of application is of great interest for its potential use in various fields including product inspection, analysis, security, etc. As another important technique in computer vision, object recognition for identifying objects in images has been accomplished earlier. Classic models including LeNet and VGG16 have already adopt CNN-like architectures. In comparison, an object detection model would not only identify objects, but also label each detected object with a bounding box. Provided ground truth labels about both object class and bounding box coordinates, object detection models can be trained regularly for making both predictions. Certain families of object detection models are listed as follows: In R-CNN, the Region Proposal Network (RPN) produces region proposals, corresponding to rectangular regions in the image in which targeting object is possibly present. YOLO divides the input image into grids and predicts the bounding box and class confidence simultaneously for each grid. SSD is a similar model to YOLO but has better accuracy by using features at different scales. As a result of improved hardware performance and innovative network architecture in recent years, real-time object detection has become possible with both satisfying speed and accuracy. The goal of this thesis is to implement a real-time object detection system based on some of the already published models, with the Proposal Connection Network (PCN) discussed in more detail. PCN in simple terms is a two-stage, anchor-free object detection model with unique advantages. Following the demonstration of system design and setup are training and experimental processes, focusing primarily on performance analysis and comparison among models.

Description

Keywords

LC Keywords

Citation