Data Driven Neural Speech Enhancement for Smart Healthcare in Consumer Electronics Applications

This paper presents the practical response and performance-aware development of online speech enhancement from a consumer electronic perspective. To improve the efficiency of human-machine interaction, speech can play a vital role as a transmission medium on the Internet of Medical Things (IoM). How...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on consumer electronics Vol. 70; no. 2; pp. 4828 - 4838
Main Authors: Paikrao, Pavan D., Mukherjee, Amrit, Ghosh, Uttam, Goswami, Pratik, Novak, Milan, Kumar Jain, Deepak, Al-Numay, Mohammed S., Narwade, Pradeep
Format: Journal Article
Language:English
Published: New York IEEE 01-05-2024
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper presents the practical response and performance-aware development of online speech enhancement from a consumer electronic perspective. To improve the efficiency of human-machine interaction, speech can play a vital role as a transmission medium on the Internet of Medical Things (IoM). However, some intelligent speech recognition systems cannot preserve the confidentiality of speech data. Additionally, the preservation of privacy is onerous, especially for model training and speech recognition in real-time. The recent development of big data-oriented wireless technologies associated with edge computing, interconnected devices of the Internet of Medical Things (IoMT), and big data analytics has great demand for connected human-machine interaction for many applications like automated cars, health monitoring, and consumer personal health care monitoring systems. Although big data-oriented wireless technologies serve these applications, the challenge remains of ignoring emotional care. This paper starts by explaining how to make a neural network-based architecture that can improve the speech of multichannel first-order Ambisonics mixtures and lower the need for human intervention through ambient intelligence (AmI). This will make the system work better overall in medical situations. Second, we demonstrate the effectiveness of different noise estimation techniques on proposed modulation domain processing (MDP) applications in smart hospitals, including electronic medical documentation, disease diagnosis, and evaluation. The proposed approach outperforms the enhancement of the conventional modulation domain in the cortex with several objective evaluation parameters such as Log Likelihood Ratio (LLR), Weighted Spectral Slope (WSS), Perceptual Evaluation of Speech Quality (PESQ), Csig and segmental (SNR seg.) Different noise estimators are used to figure out what effect the system has on different spectral modification parameters, like the over-subtraction factor and the modification domain. The experimental results show that the MDP system achieves better performance in terms of SNRseg. scores (49%) for the state-of-the-art consumer electronics perspective in a health care system. The proposed framework would greatly contribute to personalized communication health monitoring by consumers in a noisy environment.
ISSN:0098-3063
1558-4127
DOI:10.1109/TCE.2024.3387740