Search Results - "Rao M V, Achuth"
-
1
Automatic Identification of Speakers From Head Gestures in a Narration
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2020)“…In this work, we focus on quantifying speaker identity information encoded in the head gestures of speakers, while they narrate a story. We hypothesize that…”
Get full text
Conference Proceeding -
2
Automatic Classification of Healthy Subjects and Patients With Essential Vocal Tremor Using Probabilistic Source-Filter Model Based Noise Robust Pitch Estimation
Published in Journal of voice (01-05-2023)“…Essential voice tremor (EVT) is a voice disorder resulting from dyscoordination within the laryngeal musculature. A low-frequency fluctuations of fundamental…”
Get full text
Journal Article -
3
TLU-Net: A Deep Learning Approach for Automatic Steel Surface Defect Detection
Published in 2021 International Conference on Applied Artificial Intelligence (ICAPAI) (19-05-2021)“…Visual steel surface defect detection is an essential step in steel sheet manufacturing. Several machine learning-based automated visual inspection (AVI)…”
Get full text
Conference Proceeding -
4
Two step convolutional neural network for automatic glottis localization and segmentation in stroboscopic videos
Published in Biomedical optics express (01-08-2020)“…Precise analysis of the vocal fold vibratory pattern in a stroboscopic video plays a key role in the evaluation of voice disorders. Automatic glottis…”
Get full text
Journal Article -
5
Effect of source filter interaction on isolated vowel-consonant-vowel perception
Published in The Journal of the Acoustical Society of America (01-08-2018)“…Source-filter interaction explains the drop in pitch in voiced consonant due to constriction in the vocal tract during vowel-consonant-vowel (VCV) production…”
Get full text
Journal Article -
6
SFNet: A Computationally Efficient Source Filter Model Based Neural Speech Synthesis
Published in IEEE signal processing letters (2020)“…Recently, neural speech synthesizers have achieved a high-quality synthesis for text-to-speech applications, but a real-time synthesis is possible only in the…”
Get full text
Journal Article -
7
Trend Statistics Network and Channel invariant EEG Network for sleep arousal study
Published in 2019 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) (01-07-2019)“…Sleep is a very important part of life. Lack of sleep or sleep disorder can cause a negative impact on day to day life and can have long term serious…”
Get full text
Conference Proceeding Journal Article -
8
Automatic Classification of Volumes of Water Using Swallow Sounds from Cervical Auscultation
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2020)“…The signatures of swallowing vary depending on the volume of bolus swallowed. Among existing instrumental methods, cervical auscultation (CA) captures the…”
Get full text
Conference Proceeding -
9
Glottal Inverse Filtering Using Probabilistic Weighted Linear Prediction
Published in IEEE/ACM transactions on audio, speech, and language processing (01-01-2019)“…Glottal inverse filtering is a noninvasive method for getting the glottal flow estimate from the speech. In this paper, we propose a method for glottal inverse…”
Get full text
Journal Article -
10
Automatic Native Language Identification Using Novel Acoustic and Prosodic Feature Selection Strategies
Published in 2018 15th IEEE India Council International Conference (INDICON) (01-12-2018)“…We consider the problem of automatic identification of native language (L1) of non-native English (L2) speakers from eleven L1 backgrounds. Analyzing the…”
Get full text
Conference Proceeding -
11
Formant-gaps Features for Speaker Verification Using Whispered Speech
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2019)“…In this work, we propose a new feature based on formants for whispered speaker verification (SV) task, where neutral data is used for enrollment and whispered…”
Get full text
Conference Proceeding -
12
SegNet-Based Deep Representation Learning for Dysphagia Classification
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)“…Swallowing disorders, broadly known as Dysphagia, are difficulties in the process of swallowing food. Many currently available methods for classifying healthy…”
Get full text
Conference Proceeding -
13
Impact of Speaking Rate on the Source Filter Interaction in Speech: A Study
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06-06-2021)“…Source filter interaction (SFI) explains the drop in pitch caused due to the constriction in the vocal tract during voiced consonant production in a…”
Get full text
Conference Proceeding -
14
Pseudo Likelihood Correction Technique for Low Resource Accented ASR
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2020)“…With the availability of large data, ASRs perform well on native English but poorly for non-native English data. Training nonnative ASRs or adapting a native…”
Get full text
Conference Proceeding -
15
P- and T-wave delineation in ECG signals using parametric mixture Gaussian and dynamic programming
Published in Biomedical signal processing and control (01-05-2019)“…•Detection and tracking of the P- and T-waves using two mixture Gaussian function and the Dynamic programming are proposed.•A key feature of the proposed…”
Get full text
Journal Article -
16
PSFM-A Probabilistic Source Filter Model for Noise Robust Glottal Closure Instant Detection
Published in IEEE/ACM transactions on audio, speech, and language processing (01-09-2018)“…Accurate estimation of glottal closure instant (GCI) enables several pitch synchronous speech analysis, such as prosody modifications, glottal inverse…”
Get full text
Journal Article -
17
A study on native American English speech recognition by Indian listeners with varying word familiarity level
Published 08-12-2021“…In this study, listeners of varied Indian nativities are asked to listen and recognize TIMIT utterances spoken by American speakers. We have three kinds of…”
Get full text
Journal Article -
18
A Study on Native American English Speech Recognition by Indian Listeners with Varying Word Familiarity Level
Published in 2021 24th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA) (18-11-2021)“…In this study, listeners of varied Indian nativities are asked to listen and recognize TIMIT utterances spoken by American speakers. We have three kinds of…”
Get full text
Conference Proceeding -
19
TLU-Net: A Deep Learning Approach for Automatic Steel Surface Defect Detection
Published 18-01-2021“…International Conference on Applied Artificial Intelligence (ICAPAI 2021), Halden, Norway, May 19-21, 2021 Visual steel surface defect detection is an…”
Get full text
Journal Article -
20
Pitch prediction from Mel-generalized cepstrum - a computationally efficient pitch modeling approach for speech synthesis
Published in 2017 25th European Signal Processing Conference (EUSIPCO) (01-08-2017)“…Text-to-speech (TTS) systems are often used as part of the user interface in wearable devices. Due to limited memory and computational/battery power in…”
Get full text
Conference Proceeding