A mathematical foundation for the use of cliques in the exploration of data with navigation graphs

dc.contributor.authorShuldiner, Pavel
dc.date.accessioned2023-01-19T19:18:44Z
dc.date.available2023-01-19T19:18:44Z
dc.date.issued2023-01-19
dc.date.submitted2023-01-18
dc.description.abstractNavigation graphs were introduced by Hurley and Oldford (2011) as a graph-theoretic framework for exploring data sets, particularly those with many variables. They allow the user to visualize one small subset of the variables and then proceed to another subset, which shares a few of the original variables, via a smooth transition. These graphs serve as both a high level overview of the dataset as well as a tool for a first-hand exploration of regions deemed interesting. This work examines the nature of cliques in navigation graphs, both in terms of type and magnitude, and speculates as to what their significance to the underlying dataset might be. The questions answered by this body of work were motivated by the belief that the presence of cliques in navigation graphs is a potential indicator for the existence of an interesting, possibly unanticipated, relationship among some of the variables. In this thesis we provide a detailed examination of cliques in navigation graphs, both in terms of type, size and number. The study of types of cliques informs us of the potential significance of highly connected structures to the underlying data and guides our approach for examining the possible clique sizes and counts. On the other hand, the prevalence of large clique sizes and counts is suggestive of an interesting, possibly unexpected, relationship between the variates in the data. To address the challenges surrounding the nature of cliques in navigation graphs, we develop a framework for the derivation of closed-form expressions for the moments of count random variables in terms of their underlying indecomposable summands is established. We use this framework in conjunction with a connection between intersecting set families to obtain edge counts within a clique cover and thus, obtain closed-form expressions for the moments of clique counts in random graphs.en
dc.identifier.urihttp://hdl.handle.net/10012/19082
dc.language.isoenen
dc.pendingfalse
dc.publisherUniversity of Waterlooen
dc.subjectstatisticsen
dc.subjectprobabilityen
dc.subjectcombinatoricsen
dc.subjectrandom graphsen
dc.subjectnetwork theoryen
dc.subjectexploratory data analysisen
dc.titleA mathematical foundation for the use of cliques in the exploration of data with navigation graphsen
dc.typeDoctoral Thesisen
uws-etd.degreeDoctor of Philosophyen
uws-etd.degree.departmentStatistics and Actuarial Scienceen
uws-etd.degree.disciplineStatisticsen
uws-etd.degree.grantorUniversity of Waterlooen
uws-etd.embargo.terms0en
uws.contributor.advisorOldford, Wayne
uws.contributor.affiliation1Faculty of Mathematicsen
uws.peerReviewStatusUnrevieweden
uws.published.cityWaterlooen
uws.published.countryCanadaen
uws.published.provinceOntarioen
uws.scholarLevelGraduateen
uws.typeOfResourceTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Shuldiner_Pavel.pdf
Size:
3.97 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
6.4 KB
Format:
Item-specific license agreed upon to submission
Description: