Adapting Component Analysis

dc.contributor.authorDorri, Fatemeh
dc.date.accessioned2012-05-18T17:30:35Z
dc.date.available2014-04-02T05:00:09Z
dc.date.issued2012-05-18T17:30:35Z
dc.date.submitted2012
dc.description.abstractA main problem in machine learning is to predict the response variables of a test set given the training data and its corresponding response variables. A predictive model can perform satisfactorily only if the training data is an appropriate representative of the test data. This intuition is reflected in the assumption that the training data and the test data are drawn from the same underlying distribution. However, the assumption may not be correct in many applications for various reasons. For example, gathering training data from the test population might not be easily possible, due to its expense or rareness. Or, factors like time, place, weather, etc can cause the difference in the distributions. I propose a method based on kernel distribution embedding and Hilbert Schmidt Independence Criteria (HSIC) to address this problem. The proposed method explores a new representation of the data in a new feature space with two properties: (i) the distributions of the training and the test data sets are as close as possible in the new feature space, (ii) the important structural information of the data is preserved. The algorithm can reduce the dimensionality of the data while it preserves the aforementioned properties and therefore it can be seen as a dimensionality reduction method as well. Our method has a closed-form solution and the experimental results on various data sets show that it works well in practice.en
dc.description.embargoterms1 yearen
dc.identifier.urihttp://hdl.handle.net/10012/6738
dc.language.isoenen
dc.pendingtrueen
dc.publisherUniversity of Waterlooen
dc.subjectDomain adaptationen
dc.subjectKernel embeddingen
dc.subjectHilbert-Schmidt Independence Criteriaen
dc.subjectDimension reductionen
dc.subject.programComputer Scienceen
dc.titleAdapting Component Analysisen
dc.typeMaster Thesisen
uws-etd.degreeMaster of Mathematicsen
uws-etd.degree.departmentSchool of Computer Scienceen
uws.peerReviewStatusUnrevieweden
uws.scholarLevelGraduateen
uws.typeOfResourceTexten

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Dorri_Fatemeh.pdf
Size:
710.29 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
251 B
Format:
Item-specific license agreed upon to submission
Description: