Design of Speech Enhancement System Based on Microphone Array
dc.contributor.advisor | Ho, Pin-Han | |
dc.contributor.author | Liang, Kairan | |
dc.date.accessioned | 2025-01-09T18:19:13Z | |
dc.date.available | 2025-01-09T18:19:13Z | |
dc.date.issued | 2025-01-09 | |
dc.date.submitted | 2024-12-20 | |
dc.description.abstract | Beamforming is an important speech enhancement technology that can be employed to separate the speech signal from interfering noise. This technique has a decades-long history of application on microphone arrays, ranging from the basic Delay-Sum beamformer to the recently developed CGMM-MVDR beamformer based on iterative optimization algorithm. This work evaluates the capability and stability of different beamforming methods and variously shaped microphone arrays in isolating human speech and enhancing speech intelligibility through simulation experiments. Furthermore, the issues and advantages of each method are discussed by examining the beampattern and the spectrum of the processed signal. In particular, this work discovered a deficiency in the Time-Frequency masking estimation of the CGMM-MVDR method, analyzed the causes of the issue, and proposed a post-processing method to process the mask, significantly improving the final result. In addition, based on the results of the simulation experiments, this work designed and fabricated a prototype that includes a microphone array and data processing devices, presenting a viable solution for implementing the improved CGMM-MVDR beamforming algorithm on a portable device. An Android app was also designed to control the steering direction of the beamformer via gesture interaction, and simple tests were carried out in real-world scenarios. | |
dc.identifier.uri | https://hdl.handle.net/10012/21325 | |
dc.language.iso | en | |
dc.pending | false | |
dc.publisher | University of Waterloo | en |
dc.subject | multi-channel speech enhancement | |
dc.subject | acoustic beamforming | |
dc.title | Design of Speech Enhancement System Based on Microphone Array | |
dc.type | Master Thesis | |
uws-etd.degree | Master of Applied Science | |
uws-etd.degree.department | Electrical and Computer Engineering | |
uws-etd.degree.discipline | Electrical and Computer Engineering | |
uws-etd.degree.grantor | University of Waterloo | en |
uws-etd.embargo.terms | 0 | |
uws.contributor.advisor | Ho, Pin-Han | |
uws.contributor.affiliation1 | Faculty of Engineering | |
uws.peerReviewStatus | Unreviewed | en |
uws.published.city | Waterloo | en |
uws.published.country | Canada | en |
uws.published.province | Ontario | en |
uws.scholarLevel | Graduate | en |
uws.typeOfResource | Text | en |