Show simple item record

dc.contributor.authorFeng, Guoyao
dc.date.accessioned2016-08-10 18:13:37 (GMT)
dc.date.available2016-08-10 18:13:37 (GMT)
dc.date.issued2016-08-10
dc.date.submitted2016-08-02
dc.identifier.urihttp://hdl.handle.net/10012/10620
dc.description.abstractIn this thesis we present SIRUM: a system for Scalable Informative RUle Mining from multi-dimensional data. Informative rules have recently been studied in several contexts, including data summarization, data cube exploration and data quality. The objective is to produce a concise set of rules (patterns) over the values of the dimension attributes that provide the most information about the distribution of a numeric measure attribute. SIRUM optimizes this task for big, wide and distributed datasets. We implemented SIRUM in Spark and observed significant performance improvements on real data due to our optimizations.en
dc.language.isoenen
dc.publisherUniversity of Waterlooen
dc.subjectInformative Rule Miningen
dc.subjectScalable Data Processing Systemsen
dc.titleScalable Informative Rule Miningen
dc.typeMaster Thesisen
dc.pendingfalse
uws-etd.degree.departmentDavid R. Cheriton School of Computer Scienceen
uws-etd.degree.disciplineComputer Scienceen
uws-etd.degree.grantorUniversity of Waterlooen
uws-etd.degreeMaster of Mathematicsen
uws.contributor.advisorGolab, Lukasz
uws.contributor.advisorKeshav, Srinivasan
uws.contributor.affiliation1Faculty of Mathematicsen
uws.published.cityWaterlooen
uws.published.countryCanadaen
uws.published.provinceOntarioen
uws.typeOfResourceTexten
uws.peerReviewStatusUnrevieweden
uws.scholarLevelGraduateen


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record


UWSpace

University of Waterloo Library
200 University Avenue West
Waterloo, Ontario, Canada N2L 3G1
519 888 4883

All items in UWSpace are protected by copyright, with all rights reserved.

DSpace software

Service outages