Now showing items 1-7 of 7

    • An Analysis of Network-Partitioning Failures in Cloud Systems 

      Alquraan, Ahmed (University of Waterloo, 2018-12-10)
      We present a comprehensive study of 136 system failures attributed to network-partitioning faults from 25 widely used distributed systems. We found that the majority of the failures led to catastrophic effects, such as ...
    • An Analysis of Partial Network Partitioning Failures in Modern Distributed Systems 

      Alfatafta, Mohammed (University of Waterloo, 2020-01-03)
      We present a comprehensive study of system failures from 12 popular systems caused by a peculiar type of network partitioning faults: partial partitions. Partial partitions isolate a set of nodes from some, but not all, ...
    • In-Network Scheduling for Real-Time Analytics 

      Udayashankar, Sreeharsha (University of Waterloo, 2021-04-30)
      This thesis presents Bolt, a novel scheduler design for large-scale real-time data analytics. Bolt achieves the scheduling accuracy of modern centralized schedulers while supporting clusters with hundreds of thousands of ...
    • Leveraging Software-Defined Networking to Mask Partial Network Partitions 

      Alkhatib, Basil (University of Waterloo, 2021-08-11)
      We present an extensive study focused on partial network partitioning. Partial network partitions disrupt the communication between some but not all nodes in a cluster. First, we conduct a comprehensive study of system ...
    • A Network Integrated Design for Micro-scale Cloud Services 

      Abdel-hadi, Ashraf (University of Waterloo, 2022-05-25)
      We present Falcon, network-integrated scheduler for micro-scale services. Falcon follows a centralized scheduler design to achieve high scheduling efficiency and leverages modern programmable switches to lower the scheduling ...
    • Network-Accelerated Linearizable Reads 

      Takruri, Hatem (University of Waterloo, 2019-08-07)
      This thesis present FLAIR, a novel approach for accelerating read operations in leader-based consensus protocols. FLAIR leverages the capabilities of the new generation of programmable switches to serve reads from follower ...
    • Optimizing MPI Collective Operations for Cloud Deployments 

      AlSader, Zuhair (University of Waterloo, 2020-01-24)
      Cloud infrastructures are increasingly being adopted as a platform for high performance computing (HPC) science and engineering applications. For HPC applications, the Message-Passing Interface (MPI) is widely-used. Among ...


      University of Waterloo Library
      200 University Avenue West
      Waterloo, Ontario, Canada N2L 3G1
      519 888 4883

      All items in UWSpace are protected by copyright, with all rights reserved.

      DSpace software

      Service outages