Using Domain Adaptation to Improve Water Quality Modeling with Scarce Data

dc.contributor.advisorLayton, Anita
dc.contributor.authorCheung, Chi-Chung
dc.date.accessioned2025-01-06T16:39:08Z
dc.date.available2025-01-06T16:39:08Z
dc.date.issued2025-01-06
dc.date.submitted2024-12-16
dc.description.abstractWater Quality (WQ) modelling is important not just to the conservation of ecosystems, but also to the welfare of modern human society. However, collecting enough high-quality data to use for training WQ prediction models is difficult. Unlike hydrology, current WQ collecting methods are constrained by cost, spatial coverage, and temporal sparsity. This thesis explores using Domain Adaptation (DA) to overcome the data scarcity problem. By treating the different WQ measuring locations as different domains, high- resolution data from other locations can be used to better model a target location that has sparse data. The chosen DA method is inspired by domain-invariant (DI) representation learning. The model itself consists of (1) an f submodel representing the DI portion, and (2) one g submodel per domain representing the domain-variant portion. Within the context of this thesis, the main findings are as follows: 1. DA can be successfully applied in the context of WQ modeling 2. The optimal model sizes are different between the full DA method and just the pretraining. 3. Using a station’s basin was not a good measure of similarity. 4. At a high number of domains, further increasing the number of domains did not increase model performance. 5. Simply adding the outputs of f and g (i.e. f (x) + g(x)) did not perform as well as passing the output of f through g (i.e. g(f (x))). These findings support the effectiveness of using DA in WQ modelling as well as present various considerations that affect the final performance. Furthermore, these findings are relevant to not only this particular DA method but also to DA in general.
dc.identifier.urihttps://hdl.handle.net/10012/21307
dc.language.isoen
dc.pendingfalse
dc.publisherUniversity of Waterlooen
dc.relation.urihttps://github.com/Layton-Lab/DA-for-WQ-Modelling
dc.relation.urihttps://ncwqr-data.org/HTLP/Portal
dc.subjectwater quality
dc.subjectmodeling
dc.subjectdomain adaptation
dc.subjectmachine learning
dc.subjecttime series
dc.titleUsing Domain Adaptation to Improve Water Quality Modeling with Scarce Data
dc.typeMaster Thesis
uws-etd.degreeMaster of Mathematics
uws-etd.degree.departmentDavid R. Cheriton School of Computer Science
uws-etd.degree.disciplineComputer Science
uws-etd.degree.grantorUniversity of Waterlooen
uws-etd.embargo.terms0
uws.comment.hiddenThird Submission Made sure file was of format LastName_FirstName.pdf Removed the s from Masters Ensured all blank pages have no page numbering Double checked https://uwaterloo.ca/current-graduate-students/academics/thesis-and-defence/thesis-formatting Pages on the email such as https://uwaterloo.ca/library/uwspace/thesis-submission-guide/resubmit, did not show-up (Page not found).
uws.contributor.advisorLayton, Anita
uws.contributor.affiliation1Faculty of Mathematics
uws.peerReviewStatusUnrevieweden
uws.published.cityWaterlooen
uws.published.countryCanadaen
uws.published.provinceOntarioen
uws.scholarLevelGraduateen
uws.typeOfResourceTexten

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Cheung_Chi-Chung.pdf
Size:
6.84 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
6.4 KB
Format:
Item-specific license agreed upon to submission
Description: