Now showing items 1-9 of 9

    • Automatic Parallelization for Graphics Processing Units in JikesRVM 

      Leung, Alan Chun Wai (University of Waterloo, 2008-05-23)
      Accelerated graphics cards, or Graphics Processing Units (GPUs), have become ubiquitous in recent years. On the right kinds of problems, GPUs greatly surpass CPUs in terms of raw performance. However, GPUs are currently ...
    • Compiling Data Dependent Control Flow on SIMD GPUs 

      Popa, Tiberiu (University of Waterloo, 2004)
      Current Graphic Processing Units (GPUs) (circa. 2003/2004) have programmable vertex and fragment units. Often these units are implemented as SIMD processors employing parallel pipelines. Data dependent conditional ...
    • An Embedded Shading Language 

      Qin, Zheng (University of Waterloo, 2004)
      Modern graphics accelerators have embedded programmable components in the form of vertex and fragment shading units. Current APIs permit specification of the programs for these components using an assembly-language level ...
    • Flashpoint: A Low-latency Serverless Platform for Deep Learning Inference Serving 

      San Juan, Justin David Quitalig (University of Waterloo, 2023-08-24)
      Recent breakthroughs in Deep Learning (DL) have led to high demand for executing inferences in interactive services such as ChatGPT and GitHub Copilot. However, these interactive services require low-latency inferences, ...
    • Generating Radiosity Maps on the GPU 

      Moreno-Fortuny, Gabriel (University of Waterloo, 2005)
      Global illumination algorithms are used to render photorealistic images of 3D scenes taking into account both direct lighting from the light source and light reflected from other surfaces in the scene. Algorithms based ...
    • Models for Parallel Computation in Multi-Core, Heterogeneous, and Ultra Wide-Word Architectures 

      Salinger, Alejandro (University of Waterloo, 2013-04-26)
      Multi-core processors have become the dominant processor architecture with 2, 4, and 8 cores on a chip being widely available and an increasing number of cores predicted for the future. In addition, the decreasing costs ...
    • Rendering Antialiased Shadows using Warped Variance Shadow Maps 

      Lauritzen, Andrew Timothy (University of Waterloo, 2008-05-06)
      Shadows contribute significantly to the perceived realism of an image, and provide an important depth cue. Rendering high quality, antialiased shadows efficiently is a difficult problem. To antialias shadows, it is necessary ...
    • Solving Hyperbolic PDEs using Accelerator Architectures 

      Rostrup, Scott (University of Waterloo, 2009-07-27)
      Accelerator architectures are used to accelerate the simulation of nonlinear hyperbolic PDEs. Three different architectures, a multicore CPU using threading, IBM’s Cell Processor, and Nvidia’s Tesla GPUs are investigated. ...
    • A Study of Efficiency, Accuracy, and Robustness in Intensity-Based Rigid Image Registration 

      Xu, Lin (University of Waterloo, 2008-09-30)
      Image registration is widely used in different areas nowadays. Usually, the efficiency, accuracy, and robustness in the registration process are concerned in applications. This thesis studies these issues by presenting an ...

      UWSpace

      University of Waterloo Library
      200 University Avenue West
      Waterloo, Ontario, Canada N2L 3G1
      519 888 4883

      All items in UWSpace are protected by copyright, with all rights reserved.

      DSpace software

      Service outages