The Performance of wearable speech enhancement system under noisy environment: An experimental study

Cherukuru, Pavani and Mustafa, Mumtaz Begum and Subramaniam, Hema A. P. (2022) The Performance of wearable speech enhancement system under noisy environment: An experimental study. IEEE Access, 10. pp. 5647-5659. ISSN 2169-3536, DOI https://doi.org/10.1109/ACCESS.2021.3137878.

Full text not available from this repository.

Abstract

Wearable speech enhancement can improve the recognition accuracy of the speech signals in stationary noise environments at 0dB to 60dB signal to noise ratio. Beamforming, adaptive noise reduction, and voice activity detection algorithms are used in wearable speech enhancement systems to enhance speech signals. In recent works, a word rate recognition accuracy of 63% for a 0db signal-to-noise ratio is not satisfactory for a robust speech recognition system. This paper discusses the experimental study using fixed beamforming, adaptive noise reduction, and voice activity detection algorithms with the inclusion of -10dB to 20dB signal to noise ratio for different types of noises to test the wearable speech enhancement system's performance in noisy environments. It also compares deep learning-based noise reduction methods as a benchmark for speech enhancement and word recognition for different noise levels. We have obtained an average word rate recognition accuracy of 5.74% at -10dB and 93.79% at 20dB for non-stationary noisy environments. The outcome of the experiments shows that the selected methods perform significantly better in the environment with high noise dB for both stationary and non-stationary noise. We found that there is no significant statistical difference between the stationary and non-stationary noise word recognition and SNRs level. However, the deep learning-based method performs significantly better than the fixed beamforming, adaptive noise reduction, and voice activity detection algorithms in all noisy levels.

Item Type: Article
Funders: Ministry of Higher Education Malaysia (Fundamental Research Grant Scheme (FRGS)) [FP062-2020]
Uncontrolled Keywords: Speech enhancement; Noise measurement; Array signal processing; Signal to noise ratio; Speech recognition; Filtering algorithms; Noise reduction; Wearable speech enhancement; Beamforming; Adaptive noise reduction; Voice activity detection; Deep learning
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
T Technology > TA Engineering (General). Civil engineering (General)
T Technology > TK Electrical engineering. Electronics Nuclear engineering
Divisions: Faculty of Computer Science & Information Technology > Department of Software Engineering
Depositing User: Ms. Juhaida Abd Rahim
Date Deposited: 02 Aug 2022 00:57
Last Modified: 02 Aug 2022 00:57
URI: http://eprints.um.edu.my/id/eprint/33532

Actions (login required)

View Item View Item