Browsing Mathematics (Faculty of) by Supervisor "Al-Kiswany, Samer"
Now showing items 1-7 of 7
-
An Analysis of Network-Partitioning Failures in Cloud Systems
(University of Waterloo, 2018-12-10)We present a comprehensive study of 136 system failures attributed to network-partitioning faults from 25 widely used distributed systems. We found that the majority of the failures led to catastrophic effects, such as ... -
An Analysis of Partial Network Partitioning Failures in Modern Distributed Systems
(University of Waterloo, 2020-01-03)We present a comprehensive study of system failures from 12 popular systems caused by a peculiar type of network partitioning faults: partial partitions. Partial partitions isolate a set of nodes from some, but not all, ... -
In-Network Scheduling for Real-Time Analytics
(University of Waterloo, 2021-04-30)This thesis presents Bolt, a novel scheduler design for large-scale real-time data analytics. Bolt achieves the scheduling accuracy of modern centralized schedulers while supporting clusters with hundreds of thousands of ... -
Leveraging Software-Defined Networking to Mask Partial Network Partitions
(University of Waterloo, 2021-08-11)We present an extensive study focused on partial network partitioning. Partial network partitions disrupt the communication between some but not all nodes in a cluster. First, we conduct a comprehensive study of system ... -
A Network Integrated Design for Micro-scale Cloud Services
(University of Waterloo, 2022-05-25)We present Falcon, network-integrated scheduler for micro-scale services. Falcon follows a centralized scheduler design to achieve high scheduling efficiency and leverages modern programmable switches to lower the scheduling ... -
Network-Accelerated Linearizable Reads
(University of Waterloo, 2019-08-07)This thesis present FLAIR, a novel approach for accelerating read operations in leader-based consensus protocols. FLAIR leverages the capabilities of the new generation of programmable switches to serve reads from follower ... -
Optimizing MPI Collective Operations for Cloud Deployments
(University of Waterloo, 2020-01-24)Cloud infrastructures are increasingly being adopted as a platform for high performance computing (HPC) science and engineering applications. For HPC applications, the Message-Passing Interface (MPI) is widely-used. Among ...