Now showing items 1-6 of 6
A Statistical Analysis of the Aggregation of Crowdsourced Labels
(University of Waterloo, 2015-10-29)
Crowdsourcing, due to its inexpensive and timely nature, has become a popular method of collecting data that is difficult for computers to generate. We focus on using this method of human computation to gather labels for ...
Evaluating Information Retrieval Systems With Multiple Non-Expert Assessors
(University of Waterloo, 2013-08-21)
Many current test collections require the use of expert judgments during construction. The true label of each document is given by an expert assessor. However, the cost and effort associated with expert training and judging ...
Unsupervised Spectral Ranking For Anomaly Detection
(University of Waterloo, 2014-09-10)
Anomaly detection is the problem of finding deviations from expected normal patterns. A wide variety of applications, such as fraud detection for credit cards and insurance, medical image monitoring, network intrusion ...
Finding Cost-Efficient Decision Trees
(University of Waterloo, 2014-08-06)
Decision trees have been a popular machine learning technique for some time. Labelled data, examples each with a vector of values in a feature space, are used to create a structure that can assign a class to unseen examples ...
Lyrics Matter: Using Lyrics to Solve Music Information Retrieval Tasks
(University of Waterloo, 2015-08-06)
Music Information Retrieval (MIR) research tends to focus on audio features like melody and timbre of songs while largely ignoring lyrics. Lyrics and poetry adhere to a specific rhyme and meter structure which set them ...
StyleCounsel: Seeing the (Random) Forest for the Trees in Adversarial Code Stylometry
(University of Waterloo, 2018-01-12)
Authorship attribution has piqued the interest of scholars for centuries, but had historically remained a matter of subjective opinion, based upon examination of handwriting and the physical document. Midway through the ...