Private Distribution Learning with Public Data
dc.contributor.author | Bie, Alex | |
dc.date.accessioned | 2024-01-22T14:21:11Z | |
dc.date.available | 2024-01-22T14:21:11Z | |
dc.date.issued | 2024-01-22 | |
dc.date.submitted | 2024-01-16 | |
dc.description.abstract | We study the problem of private distribution learning with access to public data. In this setup, a learner is given both public and private samples drawn from an unknown distribution 𝑝 belonging to a class 𝑄, and has the task of outputting an estimate of 𝑝 while adhering to privacy constraints (here, pure differential privacy) only with respect to the private samples. Our setting is motivated by the privacy-utility tradeoff: algorithms satisfying the mathematical definition of differential privacy offer provable privacy guarantees for the data they operate on, however, owing to such a constraint, exhibit degraded accuracy. In particular, there are classes 𝑄 where learning is possible when privacy is not a concern, but for which any algorithm satisfying the constraint of pure differential privacy will fail on. We show that in several scenarios, we can use a small amount of public data to evade such impossibility results. Additionally, we complement these positive results with an analysis of how much public data is necessary to see such improvements. Our main result is that to learn the class of all Gaussians in ℝᵈ under pure differential privacy, 𝑑+1 public samples suffice while 𝑑 public samples are necessary. | en |
dc.identifier.uri | http://hdl.handle.net/10012/20254 | |
dc.language.iso | en | en |
dc.pending | false | |
dc.publisher | University of Waterloo | en |
dc.subject | differential privacy | en |
dc.subject | machine learning | en |
dc.subject | density estimation | en |
dc.subject | theory of machine learning | en |
dc.title | Private Distribution Learning with Public Data | en |
dc.type | Master Thesis | en |
uws-etd.degree | Master of Mathematics | en |
uws-etd.degree.department | David R. Cheriton School of Computer Science | en |
uws-etd.degree.discipline | Computer Science | en |
uws-etd.degree.grantor | University of Waterloo | en |
uws-etd.embargo.terms | 0 | en |
uws.contributor.advisor | Kamath, Gautam | |
uws.contributor.advisor | Ben-David, Shai | |
uws.contributor.affiliation1 | Faculty of Mathematics | en |
uws.peerReviewStatus | Unreviewed | en |
uws.published.city | Waterloo | en |
uws.published.country | Canada | en |
uws.published.province | Ontario | en |
uws.scholarLevel | Graduate | en |
uws.typeOfResource | Text | en |