Distributions in Semantic Space

Selby, Kira

Distributions in Semantic Space

dc.contributor.advisor	Poupart, Pascal
dc.contributor.author	Selby, Kira
dc.date.accessioned	2024-04-26T18:41:46Z
dc.date.available	2024-04-26T18:41:46Z
dc.date.issued	2024-04-26
dc.date.submitted	2024-04-22
dc.description.abstract	This thesis is an investigation of the powerful and flexible applications of analyzing empirical distributions of vectors within latent spaces. These methods have historically been applied with great success to the domain of word embeddings, leading to improvements in robustness against polysemy, unsupervised inference of hierarchical relationships between words, and even used to shatter existing benchmarks on unsupervised translation. This work will serve to extend these existing lines of inquiry, with a focus on two key areas of further research: a) Probabilistic approaches to robustness in natural language. b) Approximating general distance functions between distributions in order to infer general hierarchical relationships between words from their distributions over contexts. Motivated by these initial research directions, the resulting investigations will then demonstrate novel and significant contributions to a diverse range of problems across many different fields and domains - far beyond the narrow scope of word embeddings. The key contributions of this work are threefold: 1. Proposing a probabilistic, model-agnostic framework for robustness in natural language models. The proposed model improves performance on a wide range of downstream tasks compared to existing baselines. 2. Constructing a general architecture for modelling distance functions between multiple permutation invariant sets. The proposed architecture is proved to be a universal approximator for all partially permutation-invariant functions and outperforms all existing baselines on a number of set-based tasks, as well as approximating distance functions such as KL Divergence and Mutual Information. 3. Leveraging this architecture to define a novel, set-based approach to few-shot image generation. The proposed approach outperforms all existing image-to-image baselines without making restrictive assumptions about the structure of the training and evaluation sets that might limit its ability to generalize, making it a promising candidate for scaling to true zero-shot generation.	en
dc.identifier.uri	http://hdl.handle.net/10012/20506
dc.language.iso	en	en
dc.pending	false
dc.publisher	University of Waterloo	en
dc.subject	machine learning	en
dc.subject	deep learning	en
dc.subject	generative models	en
dc.subject	natural language processing	en
dc.title	Distributions in Semantic Space	en
dc.type	Doctoral Thesis	en
uws-etd.degree	Doctor of Philosophy	en
uws-etd.degree.department	David R. Cheriton School of Computer Science	en
uws-etd.degree.discipline	Computer Science	en
uws-etd.degree.grantor	University of Waterloo	en
uws-etd.embargo.terms	0	en
uws.contributor.advisor	Poupart, Pascal
uws.contributor.affiliation1	Faculty of Mathematics	en
uws.peerReviewStatus	Unreviewed	en
uws.published.city	Waterloo	en
uws.published.country	Canada	en
uws.published.province	Ontario	en
uws.scholarLevel	Graduate	en
uws.typeOfResource	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Selby_Kira.pdf
Size:: 4.7 MB
Format:: Adobe Portable Document Format
Description:: Revised thesis

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 6.4 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Theses
Computer Science