Learning From Almost No Data

Sucholutsky, Ilia

Learning From Almost No Data

dc.contributor.advisor	Schonlau, Matthias
dc.contributor.author	Sucholutsky, Ilia
dc.date.accessioned	2021-06-15T18:16:46Z
dc.date.available	2021-06-15T18:16:46Z
dc.date.issued	2021-06-15
dc.date.submitted	2021-06-09
dc.description.abstract	The tremendous recent growth in the fields of artificial intelligence and machine learning has largely been tied to the availability of big data and massive amounts of compute. The increasingly popular approach of training large neural networks on large datasets has provided great returns, but it leaves behind the multitude of researchers, companies, and practitioners who do not have access to sufficient funding, compute power, or volume of data. This thesis aims to rectify this growing imbalance by probing the limits of what machine learning and deep learning methods can achieve with small data. What knowledge does a dataset contain? At the highest level, a dataset is just a collection of samples: images, text, etc. Yet somehow, when we train models on these datasets, they are able to find patterns, make inferences, detect similarities, and otherwise generalize to samples that they have previously never seen. This suggests that datasets may contain some kind of intrinsic knowledge about the systems or distributions from which they are sampled. Moreover, it appears that this knowledge is somehow distributed and duplicated across the samples; we intuitively expect that removing an image from a large training set will have virtually no impact on the final model performance. We develop a framework to explain efficient generalization around three principles: information sharing, information repackaging, and information injection. We use this framework to propose `less than one'-shot learning, an extreme form of few-shot learning where a learner must recognize N classes from M < N training examples. To achieve this extreme level of efficiency, we develop new framework-consistent methods and theory for lost data restoration, for dataset size reduction, and for few-shot learning with deep neural networks and other popular machine learning models.	en
dc.identifier.uri	http://hdl.handle.net/10012/17103
dc.language.iso	en	en
dc.pending	false
dc.publisher	University of Waterloo	en
dc.relation.uri	https://github.com/ilia10000/dataset-distillation	en
dc.relation.uri	https://github.com/ilia10000/LO-Shot	en
dc.subject	deep learning	en
dc.subject	machine learning	en
dc.subject	few-shot learning	en
dc.subject	one-shot learning	en
dc.subject	dataset reduction	en
dc.subject	dataset distillation	en
dc.subject	small data	en
dc.subject	ML	en
dc.subject	AI	en
dc.subject	artificial intelligence	en
dc.subject	neural networks	en
dc.subject	NLP	en
dc.subject	computer vision	en
dc.subject	optimization	en
dc.subject	LO-shot learning	en
dc.title	Learning From Almost No Data	en
dc.type	Doctoral Thesis	en
uws-etd.degree	Doctor of Philosophy	en
uws-etd.degree.department	Statistics and Actuarial Science	en
uws-etd.degree.discipline	Statistics	en
uws-etd.degree.grantor	University of Waterloo	en
uws-etd.embargo.terms	0	en
uws.contributor.advisor	Schonlau, Matthias
uws.contributor.affiliation1	Faculty of Mathematics	en
uws.peerReviewStatus	Unreviewed	en
uws.published.city	Waterloo	en
uws.published.country	Canada	en
uws.published.province	Ontario	en
uws.scholarLevel	Graduate	en
uws.typeOfResource	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Sucholutsky_Ilia.pdf
Size:: 42.44 MB
Format:: Adobe Portable Document Format
Description:: PhD thesis

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 6.4 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Theses
Statistics and Actuarial Science