UWSpace

UWSpace is the University of Waterloo’s institutional repository for the free, secure, and long-term home of research produced by faculty, students, and staff.

Depositing Theses/Dissertations or Research to UWSpace

Are you a Graduate Student depositing your thesis to UWSpace? See our Thesis Deposit Help and UWSpace Thesis FAQ pages to learn more.

Are you a Faculty or Staff member depositing research to UWSpace? See our Waterloo Research Deposit Help and Self-Archiving pages to learn more.

Photo by Waterloo staff

Recent Submissions

  • Item type: Item ,
    Mitigating Risks to Dependability from Vibe-Coding C for Embedded Systems
    (University of Waterloo, 2026-05-13) Dunne, Murray
    Vibe coding is the process of using a Large Language Model (LLM) to iteratively generate software code. It is popular, with 36% of workers at technology companies reporting adoption of generative artificial intelligence for software engineering in 2024 [1]. At this rate of use, LLM-generated code is quickly becoming part of the embedded-systems that comprise our everyday cyber-physical infrastructure. Most of this infrastructure is built on C language code [2]. LLM-generated C code poses threats to dependability, exhibiting faults such as buffer overflows, out-of-bounds writes, integer overflows, and more. In this work, we contribute methods for improving the dependability of these systems in three key parts: providing a real-world benchmark dataset for evaluating LLM-generated C code, protecting LLM code generation from poisoning attacks, and detecting changes in production embedded systems through power side-channel analysis. This work begins with an examination and categorization of weaknesses in LLMgenerated C code for embedded systems networking. Our findings suggest that LLMs perform poorly at programming tasks involving direct interactions with memory. Scores on existing LLM-generated C benchmarks do not adequately express this difficulty, as these benchmarks do not include sufficiently real-world C programming challenges. To support future testing of LLMs, we introduce EmbedEvalC, a dataset of C coding challenges to provide a benchmark against which LLMs can be evaluated on real-world tasks. Retrieval Augmented Code Generation (RACG) is an essential tool for vibe coding, but presents new threats to dependability from poisoning attacks. If an attacker can cause a RACG system to retrieve their crafted documents, they can induce the LLM to generate code with weaknesses. To detect this attack, we introduce canary functions, a process by which specific functions in the codebase are regenerated and re-tested to determine whether the addition of new documents induces new weaknesses. Finally, we consider the black-box setting where a systems integrator seeks to detect unexpected changes in embedded firmware. Such changes will only become more common with the proliferation of vibe coding. We suggest using power side-channel analysis to provide a feedback mechanism to a fuzzer in order to determine if a fuzzing input has caused a new response from the system. We show that responses involving five or more memory-interacting instructions are consistently detectable. In this work, we suggest a collection of techniques to mitigate risks to the dependability of embedded systems posed by LLM-generated C code. Abstract Citations: [1] Alex Singla, Alexander Sukharevsky, Lareina Yee, Michael Chui, and Bryce Hall. "The state of AI: How organizations are rewiring to capture value", McKinsey & Company, March 2025. [2] P. Soulier, D. Li, and J. R. Williams, “A Survey of Language-Based Approaches to Cyber-Physical and Embedded System Development,” Tsinghua Science and Technology, vol. 20, no. 2, pp. 130–141, 2015.
  • Item type: Item ,
    Efficiently Training Deep Learning Models on Elastic and Heterogeneous Cloud Resources
    (University of Waterloo, 2026-05-12) Guo, Runsheng
    Deep Neural Networks (DNNs) have demonstrated remarkable success across diverse domains, but their training requires substantial computational resources and is typically parallelized across large GPU clusters. However, such clusters are prohibitively expensive for most organizations to own and manage. Hence, instead of owning and managing their own clusters, organizations often rent clusters on cloud platforms to meet their training needs. While cloud environments offer elastic scalability and heterogeneous hardware options, they also introduce significant challenges for efficient distributed DNN training. Specifically, existing training frameworks lack support for dynamic reconfiguration during training, limiting the exploitation of cloud elasticity. Additionally, most systems assume homogeneous clusters, which rarely reflect the heterogeneous GPU clusters that organizations commonly use due to hardware availability constraints. Furthermore, heterogeneous network conditions in cloud environments create communication bottlenecks that limit the scalability of existing approaches. This thesis presents three systems that collectively address these limitations to enable efficient distributed DNN training on elastic and heterogeneous cloud resources. First, Hydrozoa leverages cloud elasticity through serverless containers, enabling dynamic scaling and configuration changes during training without the traditional pitfalls of serverless computing. By combining data and model parallelism with fine-grained resource provisioning, Hydrozoa achieves cost-effective training while eliminating cluster management overhead. Second, Cephalo addresses heterogeneous GPU clusters by independently balancing compute and memory resources across GPUs with different capabilities. Unlike existing approaches that tie workload assignment to computational speed, Cephalo separately optimizes compute distribution through proportional batch sizing and memory utilization through intelligent partitioning of training state, activation checkpointing, and gradient accumulation strategies. Third, Zorse tackles heterogeneous network conditions, which are particularly common in heterogeneous clusters, by efficiently combining memory-efficient data parallelism with pipeline parallelism. Through interleaved pipelining, parameter and activation offloading, and heterogeneous pipeline parallelism configurations, Zorse achieves both communication and memory efficiency for training large DNN models across diverse network topologies. The experimental evaluation demonstrates that these systems significantly improve training efficiency and resource utilization compared to existing approaches. Hydrozoa reduces training costs while providing seamless scalability, Cephalo simultaneously achieves high compute and memory utilization in heterogeneous clusters, and Zorse maintains high throughput under varying network conditions. Together, these contributions make distributed DNN training more accessible, cost-effective, and efficient in modern cloud environments, advancing the state of the art in large-scale machine learning infrastructure.
  • Item type: Item ,
    Evaluating Mercury Deposition over Space and Time in Critical Nesting Habitat for Endangered Whooping Crane (Wood Buffalo National Park)
    (University of Waterloo, 2026-05-12) Lacey, Amy Lynn
    The only remaining wild, self-sustaining population of endangered Whooping Crane (the Aransas-Wood Buffalo population of Grus americana) nests and breeds in a remote boreal landscape of northwestern Canada known as the Whooping Crane Summer Range (WCSR), a Ramsar Wetland of International Importance. Although geographically isolated from industrial development, ponds in the WCSR may receive deposition of substances of concern, including mercury (Hg), from far-field sources via direct atmospheric transport and deposition and locally via remobilization of legacy Hg stored in soils and catchments. Mercury is of particular concern because it is a potent neurotoxin capable of long-range transport and accumulation in aquatic ecosystems. However, there are no data available to evaluate present-day Hg concentrations in aquatic sediment, the enrichment of Hg relative to naturally occurring baseline concentrations, or long-term accumulation of Hg within WCSR pond sediment. This study investigates total mercury (THg) concentrations over space and time in pond sediments of the WCSR to determine whether the concentrations pose risk of adverse biological effects on whooping crane and other aquatic biota and to quantify the enrichment and storage of THg in pond sediment. Total Hg concentrations were measured in surface sediments collected in 2024 from 63 ponds and in radiometrically dated sediment cores spanning the past ~370 years from three ponds within the Sass–Klewi Nesting Area (SKNA), a subregion of the WCSR. Sediment cores were used to establish a multi-site pre-industrial baseline to quantify enrichment using organic matter–normalized Enrichment Factors (EFs), and to calculate cumulative inventories of excess THg for comparison with other Canadian lakes at near- and far-field distances from major sources of emissions. Concentrations of THg ranged from 2.7 – 45.4 ng g⁻¹ (mean 20.0 ± 9.9 ng g⁻¹, n = 63) in the surface sediments and from 5.4 – 72.0 ng g⁻¹ in the sediment cores. All concentrations are well below the Canadian Interim Sediment Quality Guideline of 170 ng g⁻¹ and indicate that adverse biological effects on aquatic life are unlikely to occur at present and in the past. Strong, positive linear relations between THg concentrations and organic matter (OM) content in sediment deposited before ~1900 at three ponds allowed for construction of a multi-site pre-industrial OM-normalized baseline capable of estimating the amount of enrichment in samples deposited since 1900 in sediment cores each of the three pond and the surface sediment samples from 63 ponds. Peak THg enrichment occurred between ~1960 and ~2010 at the three ponds, when EFs range from 1.5-2.6 and signify ‘minimal enrichment’ based on widely used categorization. After ~2010, EFs decline rapidly to around 1.0, indicating a return to concentrations that existed before 1900. The mean EF for the surface sediment samples collected from 63 ponds in 2024 is 1.03, indicating no enrichment relative to pre-1900 sediment, and EFs exceed the 1.5 threshold for ‘minimal enrichment’ at fewer than 10% of the ponds. The cumulative inventory of excess THg at pond SK 43 falls within a narrow range reported for other lakes in Canada at far-field distances from industrialization and is 700 times lower than a lake at near-field distance from a major point source of Hg emissions (mine and smelter at Flin Flon, Manitoba). These findings suggest that mercury poses little risk of harm to whooping crane and other aquatic organisms. The pre-1900 baselines generated during the research can be used in future sediment quality monitoring to detect whether ongoing climatic and environmental changes begin to elevate THg concentrations above natural levels.
  • Item type: Item ,
    Design, Testing, and Analysis of Advanced Massive MIMO Transmitters
    (University of Waterloo, 2026-05-12) Lim, Jin Gyu
    The evolution of Fifth Generation (5G) wireless systems is driven by the need for higher data rates, lower latency, and improved coverage. Massive Multiple Input, Multiple Output (MIMO) transmitter front-ends have emerged as a key enabling technology, employing multiple parallel transmitter chains—each comprising Digital Signal Processing (DSP), Power Amplifiers (PAs), and antenna elements—to exploit spatial multiplexing and enable multi-user beamforming, at the cost of increased system complexity. The PA is a critical component in each transmitter chain, as its efficiency dominates energy consumption and its output power determines coverage. However, PA performance is highly sensitive to load impedance. While isolators in 4G systems maintain a constant 50 Ω load, their use in massive MIMO arrays is impractical due to cost, size, bandwidth, and integration constraints. Consequently, antenna mismatches and mutual coupling introduce dynamic load variations that degrade PA and overall system performance, motivating a holistic system-level design approach. The first objective of this thesis is to develop tools for system-level analysis of massive MIMO transmitters under realistic excitation. A multidisciplinary co-simulation frame- work integrating DSP, Radio Frequency (RF), and electromagnetic domains is proposed to capture signal processing, PA nonlinearities, and antenna coupling within a unified environment. Experimental validation using a four-channel fully digital MIMO transmit- ter demonstrates accurate prediction of system-level trends. A sixteen-channel testbed is further developed to validate design strategies and capture hardware-specific effects. The second objective is to enable PA design under realistic system conditions. To mitigate the computational complexity of large-scale simulations, an emulation platform is developed that reproduces massive MIMO loading conditions using a single PA. This approach enables efficient characterization and optimization under dynamic impedance environments. Combined with the co-simulation framework, it supports PA design directly at the system level rather than under idealized 50 Ω assumptions. The third objective is to investigate the impact of precoding on PA behavior. Con- ventional Digital Pre-Distortion (DPD) linearizes PAs using uncorrelated signals prior to precoding, implicitly assuming invariant load conditions. However, precoding alters signal correlation and power distribution, thereby modifying the load impedance seen by each PA in the presence of mutual coupling, which degrades linearization performance. To ad- dress this, an alternative architecture is explored in which precoding precedes linearization, enabling improved robustness and reduced DPD complexity under dynamic conditions.
  • Item type: Item ,
    Monolithic Integration of GeTe Switches and BST Varactors for Reconfigurable Millimeter-Wave Devices
    (University of Waterloo, 2026-05-12) Golcheshmeh, Mehran
    Reconfigurable microwave and millimeter-wave systems require tunable components that provide low loss, high resolution, and compact implementation. Conventional approaches based on semiconductor devices, microelectromechanical systems (MEMS), and purely analog or digital tuning techniques face limitations in loss, tuning range, resolution, and integration complexity. Therefore, alternative approaches are needed that can overcome these challenges while remaining compatible with integrated fabrication processes. This thesis presents the development of tunable RF components based on the monolithic integration of ferroelectric barium strontium titanate (BST) varactors and phase-change material (PCM) germanium telluride (GeTe) switches. BST varactors provide continuous analog tuning with low power consumption, while GeTe switches enable discrete, nonvolatile reconfiguration. The combination of these technologies enables a hybrid analog–digital tuning approach that improves tuning range and flexibility. The work begins with the development and optimization of fabrication process for BST thin-film varactors, followed by their application in tunable circuits. A monolithic fabrication process is then developed to integrate BST varactors and GeTe switches. The challenges associated with material compatibility and process conditions are addressed, and both BST and GeTe devices are fabricated and characterized. Using this platform, hybrid analog–digital varactors with enhanced tuning range are demonstrated. Finally, the hybrid tuning approach is applied to the design and implementation of phase shifters, including true-time-delay (TTD) and reflective-type architectures. These designs combine the advantages of analog and digital tuning to achieve improved phase control, compact implementation, and reduced loss compared to conventional approaches. The results presented in this thesis demonstrate the effectiveness of combining BST varactors and GeTe switches for the realization of reconfigurable millimeter-wave components, providing a practical approach for next-generation tunable RF systems.