Serverless Data Analytics with Flint

Kim, Youngbin

Serverless Data Analytics with Flint

dc.contributor.advisor	Lin, Jimmy
dc.contributor.author	Kim, Youngbin
dc.date.accessioned	2018-08-30T14:10:39Z
dc.date.available	2018-08-30T14:10:39Z
dc.date.issued	2018-08-30
dc.date.submitted	2018-08-13
dc.description.abstract	Serverless architectures organized around loosely-coupled function invocations represent an emerging design for many applications. Recent work mostly focuses on user-facing products and event-driven processing pipelines. In this thesis, we explore a completely different part of the application space and examine the feasibility of analytical processing on big data using a serverless architecture. We present Flint, a prototype Spark execution engine that takes advantage of AWS Lambda to provide a pure pay-as-you-go cost model. With Flint, a developer uses PySpark exactly as before, but without needing a Spark cluster and only paying for the execution of individual Spark programs. We describe the design, implementation, and performance of Flint, along with the challenges associated with serverless analytics.	en
dc.identifier.uri	http://hdl.handle.net/10012/13681
dc.language.iso	en	en
dc.pending	false
dc.publisher	University of Waterloo	en
dc.title	Serverless Data Analytics with Flint	en
dc.type	Master Thesis	en
uws-etd.degree	Master of Mathematics	en
uws-etd.degree.department	David R. Cheriton School of Computer Science	en
uws-etd.degree.discipline	Computer Science	en
uws-etd.degree.grantor	University of Waterloo	en
uws.contributor.advisor	Lin, Jimmy
uws.contributor.affiliation1	Faculty of Mathematics	en
uws.peerReviewStatus	Unreviewed	en
uws.published.city	Waterloo	en
uws.published.country	Canada	en
uws.published.province	Ontario	en
uws.scholarLevel	Graduate	en
uws.typeOfResource	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Kim_Youngbin.pdf
Size:: 2.43 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 6.08 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Theses
Computer Science