Rhyme, Rhythm, and Rhubarb: Using Probabilistic Methods to Analyze Hip Hop, Poetry, and Misheard Lyrics

Hirjee, Hussein

Rhyme, Rhythm, and Rhubarb: Using Probabilistic Methods to Analyze Hip Hop, Poetry, and Misheard Lyrics

dc.comment.hidden	I don't need written permission to include the work which has already been published.	en
dc.contributor.author	Hirjee, Hussein
dc.date.accessioned	2010-08-30T20:37:51Z
dc.date.available	2010-08-30T20:37:51Z
dc.date.issued	2010-08-30T20:37:51Z
dc.date.submitted	2010
dc.description.abstract	While text Information Retrieval applications often focus on extracting semantic features to identify the topic of a document, and Music Information Research tends to deal with melodic, timbral or meta-tagged data of songs, useful information can be gained from surface-level features of musical texts as well. This is especially true for texts such as song lyrics and poetry, in which the sound and structure of the words is important. These types of lyrical verse usually contain regular and repetitive patterns, like the rhymes in rap lyrics or the meter in metrical poetry. The existence of such patterns is not always categorical, as there may be a degree to which they appear or apply in any sample of text. For example, rhymes in hip hop are often imperfect and vary in the degree to which their constituent parts differ. Although a definitive decision as to the existence of any such feature cannot always be made, large corpora of known examples can be used to train probabilistic models enumerating the likelihood of their appearance. In this thesis, we apply likelihood-based methods to identify and characterize patterns in lyrical verse. We use a probabilistic model of mishearing in music to resolve misheard lyric search queries. We then apply a probabilistic model of rhyme to detect imperfect and internal rhymes in rap lyrics and quantitatively characterize rappers' styles in their use. Finally, we compute likelihoods of prosodic stress in words to perform automated scansion of poetry and compare poets' usage of and adherence to meter. In these applications, we find that likelihood-based methods outperform simpler, rule-based models at finding and quantifying lyrical features in text.	en
dc.identifier.uri	http://hdl.handle.net/10012/5419
dc.language.iso	en	en
dc.pending	false	en
dc.publisher	University of Waterloo	en
dc.subject	information retrieval	en
dc.subject	music	en
dc.subject	lyrics	en
dc.subject	hip hop	en
dc.subject	rap	en
dc.subject	rhyme	en
dc.subject	misheard	en
dc.subject	mondegreen	en
dc.subject	phonetic similarity	en
dc.subject	scansion	en
dc.subject	poetry	en
dc.subject	meter	en
dc.subject.program	Computer Science	en
dc.title	Rhyme, Rhythm, and Rhubarb: Using Probabilistic Methods to Analyze Hip Hop, Poetry, and Misheard Lyrics	en
dc.type	Master Thesis	en
uws-etd.degree	Master of Mathematics	en
uws-etd.degree.department	School of Computer Science	en
uws.peerReviewStatus	Unreviewed	en
uws.scholarLevel	Graduate	en
uws.typeOfResource	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Hirjee_Hussein.pdf
Size:: 961.47 KB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 251 B
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Theses
Computer Science