Search Results - "Reddy M, Gurunath"
-
1
Predominant Melody Extraction from Vocal Polyphonic Music Signal by Time-Domain Adaptive Filtering-Based Method
Published in Circuits, systems, and signal processing (01-07-2018)“…In this paper, a time-domain adaptive filtering-based melody extraction method is proposed. The proposed method works in multiple stages to extract the vocal…”
Get full text
Journal Article -
2
Deep Attention-Based Alignment Network for Melody Generation from Incomplete Lyrics
Published in 2022 IEEE International Symposium on Multimedia (ISM) (01-12-2022)“…We propose a deep attention-based alignment network, which aims to automatically predict lyrics and melody with given incomplete lyrics as input in a way…”
Get full text
Conference Proceeding -
3
hf0: A Hybrid Pitch Extraction Method for Multimodal Voice
Published in Circuits, systems, and signal processing (2021)“…Pitch or fundamental frequency ( f 0 ) estimation is a fundamental problem extensively studied for its potential speech and clinical applications. The existing…”
Get full text
Journal Article -
4
Predominant melody extraction from vocal polyphonic music signal by combined spectro-temporal method
Published in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2016)“…A combined spectro-temporal based method is proposed to derive the predominant melody from vocal polyphonic music signals. In the proposed method, vocal…”
Get full text
Conference Proceeding Journal Article -
5
Neutral to Joyous Happy Emotion Conversion
Published in 2017 14th IEEE India Council International Conference (INDICON) (01-12-2017)“…A method to convert neutrally synthesised speech to joyous happy emotion by prosody modification and laughter synthesis is presented. Modified zero frequency…”
Get full text
Conference Proceeding -
6
hf_0$$: A Hybrid Pitch Extraction Method for Multimodal Voice
Published in Circuits, systems, and signal processing (01-01-2021)Get full text
Journal Article -
7
Neutral to happy emotion conversion by blending prosody and laughter
Published in 2015 Eighth International Conference on Contemporary Computing (IC3) (01-08-2015)“…In this paper, we propose a method to convert synthesised neutral speech to happy emotion speech by prosody modification and synthesizing laughter sequence…”
Get full text
Conference Proceeding -
8
Multi-stage children story speech synthesis for Hindi
Published in 2015 Eighth International Conference on Contemporary Computing (IC3) (01-08-2015)“…In this paper, we propose a multi-stage children story speech synthesis system for Hindi language. The proposed system performs the following tasks: (i)…”
Get full text
Conference Proceeding -
9
Improvement of phone recognition accuracy using source and system features
Published in 2015 International Conference on Signal Processing and Communication Engineering Systems (01-01-2015)“…The goal of this work is to improve phone recognition accuracy using combination of source and system features. As speech is produced by exciting time varying…”
Get full text
Conference Proceeding -
10
Two-stage phone recognition system using articulatory and spectral features
Published in 2015 International Conference on Signal Processing and Communication Engineering Systems (01-01-2015)“…In this paper, we propose a two-stage phone recognition system using articulatory and spectral features. In the first stage, articulatory features are…”
Get full text
Conference Proceeding -
11
Automatic pitch accent contour transcription for Indian languages
Published in 2015 International Conference on Computer, Communication and Control (IC4) (01-09-2015)“…In this paper, an automatic method to transcribe the pitch accent contour from the speech signal is presented. Pitch contour transcription refers to the…”
Get full text
Conference Proceeding -
12
Telugu emotional story speech synthesis using SABLE markup language
Published in 2015 International Conference on Signal Processing and Communication Engineering Systems (01-01-2015)“…In this paper, a framework for synthesizing Telugu emotional speech for story telling applications is presented. An XML based markup langauge, SABLE is used to…”
Get full text
Conference Proceeding -
13
Designing prosody rule-set for converting neutral TTS speech to storytelling style speech for Indian languages: Bengali, Hindi and Telugu
Published in 2014 Seventh International Conference on Contemporary Computing (IC3) (01-08-2014)“…This paper provides a design of prosody rule-set for transforming the neutral speech synthesized by Text-to-Speech (TTS) system to storytelling style speech…”
Get full text
Conference Proceeding -
14
Language Augmentation in CLIP for Improved Anatomy Detection on Multi-modal Medical Images
Published 31-05-2024“…Vision-language models have emerged as a powerful tool for previously challenging multi-modal classification problem in the medical domain. This development…”
Get full text
Journal Article -
15
Melody Extraction from Polyphonic Music by Deep Learning Approaches: A Review
Published 02-02-2022“…Melody extraction is a vital music information retrieval task among music researchers for its potential applications in education pedagogy and the music…”
Get full text
Journal Article -
16
Deep Attention-Based Alignment Network for Melody Generation from Incomplete Lyrics
Published 22-01-2023“…We propose a deep attention-based alignment network, which aims to automatically predict lyrics and melody with given incomplete lyrics as input in a way…”
Get full text
Journal Article -
17
One-shot Localization and Segmentation of Medical Images with Foundation Models
Published 28-10-2023“…Recent advances in Vision Transformers (ViT) and Stable Diffusion (SD) models with their ability to capture rich semantic features of the image have been used…”
Get full text
Journal Article -
18
hf0: A hybrid pitch extraction method for multimodal voice
Published 22-04-2019“…Pitch or fundamental frequency (f0) extraction is a fundamental problem studied extensively for its potential applications in speech and clinical applications…”
Get full text
Journal Article -
19
Knowledge Distillation for Singing Voice Detection
Published 09-11-2020“…Singing Voice Detection (SVD) has been an active area of research in music information retrieval (MIR). Currently, two deep neural network-based methods, one…”
Get full text
Journal Article -
20
Glottal Closure Instants Detection From Pathological Acoustic Speech Signal Using Deep Learning
Published 25-11-2018“…In this paper, we propose a classification based glottal closure instants (GCI) detection from pathological acoustic speech signal, which finds many…”
Get full text
Journal Article