Continual learning-based Video Object Segmentation

dc.contributor.advisorFieguth, Paul
dc.contributor.authorNazemi, Amir
dc.date.accessioned2023-06-23T12:46:20Z
dc.date.available2023-06-23T12:46:20Z
dc.date.issued2023-06-23
dc.date.submitted2023-06-13
dc.description.abstractMachine learning models, specifically deep convolutional neural networks, have exceeded human-level performance in many research areas, such as object classification and voice recognition. However, they are not comparable to humans in real-world learning scenarios when the training data is non-i.i.d. infinite streaming data. An example of those real-world scenarios is continual learning. Continual learning, as a new area of research in the field of machine learning, has become quite popular. It is the process of learning sequential data that comprises different domains and tasks. The main feature of a continual leaning problem is that the learning model does not have access to previously trained data. The main challenge of training a machine learning model on sequential data is catastrophic forgetting, which happens when a model forgets the previously learned tasks after being trained on new ones. There are three different solutions for the problems of continual learning: prior-focused (regularization-based) solutions, likelihood-focused (rehearsal-based) solutions, and hybrid (ensemble) approaches. In this thesis, semi-supervised video object segmentation (VOS) is addressed as a continual learning problem specifically for long video sequences, and three solutions are proposed. The first solution is Gated-Regularizer Continual Learning (GRCL) which is a prior-focused solution. The second proposed solution is aligned with likelihood-focused solutions and is Reconstruction-based Memory Selection Continual Learning (RMSCL). The third proposed solution is a hybrid solution (Hybrid) that benefits from GRCL and RMSCL. All of the proposed solutions improve the performance of two baseline Online VOS methods (LWL and JOINT) but they can augment any online VOS and improve its performance on long videos.en
dc.identifier.urihttp://hdl.handle.net/10012/19583
dc.language.isoenen
dc.pendingfalse
dc.publisherUniversity of Waterlooen
dc.titleContinual learning-based Video Object Segmentationen
dc.typeDoctoral Thesisen
uws-etd.degreeDoctor of Philosophyen
uws-etd.degree.departmentSystems Design Engineeringen
uws-etd.degree.disciplineSystem Design Engineeringen
uws-etd.degree.grantorUniversity of Waterlooen
uws-etd.embargo.terms0en
uws.contributor.advisorFieguth, Paul
uws.contributor.affiliation1Faculty of Engineeringen
uws.peerReviewStatusUnrevieweden
uws.published.cityWaterlooen
uws.published.countryCanadaen
uws.published.provinceOntarioen
uws.scholarLevelGraduateen
uws.typeOfResourceTexten

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Nazemi_Amir.pdf
Size:
19.97 MB
Format:
Adobe Portable Document Format
Description:
Revised

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
6.4 KB
Format:
Item-specific license agreed upon to submission
Description: