Audio processing on constrained devices

dc.contributor.authorGupta, Amod
dc.date.accessioned2009-10-29T17:51:09Z
dc.date.available2009-10-29T17:51:09Z
dc.date.issued2009-10-29T17:51:09Z
dc.date.submitted2009-09-28
dc.description.abstractThis thesis discusses the future of smart business applications on mobile phones and the integration of voice interface across several business applications. It proposes a framework that provides speech processing support for business applications on mobile phones. The framework uses Gaussian Mixture Models (GMM) for low-enrollment speaker recognition and limited vocabulary speech recognition. Algorithms are presented for pre-processing of audio signals into different categories and for start and end point detection. A method is proposed for speech processing that uses Mel Frequency Cepstral Coeffcients (MFCC) as primary feature for extraction. In addition, optimization schemes are developed to improve performance, and overcome constraints of a mobile phone. Experimental results are presented for some prototype applications that evaluate the performance of computationally expensive algorithms on constrained hardware. The thesis concludes by discussing the scope for improvement for the work done in this thesis and future directions in which this work could possibly be extended.en
dc.identifier.urihttp://hdl.handle.net/10012/4830
dc.language.isoenen
dc.pendingfalseen
dc.publisherUniversity of Waterlooen
dc.subjectSpeech processingen
dc.subjectAudio processing on mobile phonesen
dc.subjectMobile phone applicationsen
dc.subject.programComputer Scienceen
dc.titleAudio processing on constrained devicesen
dc.typeMaster Thesisen
uws-etd.degreeMaster of Mathematicsen
uws-etd.degree.departmentSchool of Computer Scienceen
uws.peerReviewStatusUnrevieweden
uws.scholarLevelGraduateen
uws.typeOfResourceTexten

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Thesis.pdf
Size:
959.58 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
251 B
Format:
Item-specific license agreed upon to submission
Description: