Search Results - "Bellegarda, J.R."
-
1
Exploiting latent semantic information in statistical language modeling
Published in Proceedings of the IEEE (01-08-2000)“…Statistical language models used in large-vocabulary speech recognition must properly encapsulate the various constraints, both local and global, present in…”
Get full text
Journal Article -
2
A global, boundary-centric framework for unit selection text-to-speech synthesis
Published in IEEE transactions on audio, speech, and language processing (01-05-2006)“…The level of quality that can be achieved by modern concatenative text-to-speech synthesis heavily depends on the optimization criteria used in the unit…”
Get full text
Journal Article -
3
Unit-Centric Feature Mapping for Inventory Pruning in Unit Selection Text-to-Speech Synthesis
Published in IEEE transactions on audio, speech, and language processing (01-01-2008)“…The level of quality that can be attained in concatenative text-to-speech (TTS) synthesis is primarily governed by the inventory of units used in unit…”
Get full text
Journal Article -
4
Large vocabulary speech recognition with multispan statistical language models
Published in IEEE transactions on speech and audio processing (01-01-2000)“…Multispan language modeling refers to the integration of various constraints, both local and global, present in the language. It was recently proposed to…”
Get full text
Journal Article -
5
Statistical prosodic modeling: from corpus design to parameter estimation
Published in IEEE transactions on speech and audio processing (01-01-2001)“…The increasing availability of carefully designed and collected speech corpora opens up new possibilities for the statistical estimation of formal multivariate…”
Get full text
Journal Article -
6
Natural language spoken interface control using data-driven semantic inference
Published in IEEE transactions on speech and audio processing (01-05-2003)“…Spoken interaction tasks are typically approached using a formal grammar as language model. While ensuring good system performance, this imposes a rigid…”
Get full text
Journal Article -
7
Further analysis of LSM-based unit pruning forunit selection TTS
Published in 2008 IEEE International Conference on Acoustics, Speech and Signal Processing (01-03-2008)“…The level of quality that can be achieved in concatenative text-to-speech synthesis is primarily governed by the inventory of units used in unit selection…”
Get full text
Conference Proceeding -
8
The hit array: an analysis formalism for multiple access frequency hop coding
Published in IEEE transactions on aerospace and electronic systems (01-01-1991)“…A formalism is presented for the analysis of general frequency hop waveforms, such as those suitable for use in coherent active radar and sonar echolocation…”
Get full text
Journal Article -
9
Lsm-Based Boundary Training for Concatenative Speech Synthesis
Published in 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings (2006)“…The level of quality that can be achieved in concatenative text-to-speech synthesis depends, among other things, on a judicious chiseling of the inventory used…”
Get full text
Conference Proceeding -
10
A novel approach to part-of-speech tagging based on latent analogy
Published in 2008 IEEE International Conference on Acoustics, Speech and Signal Processing (01-03-2008)“…Part-of-speech tagging is a necessary pre-processing step for many natural language tasks. Recent statistical approaches, such as conditional random fields,…”
Get full text
Conference Proceeding -
11
Globally Optimal Training of Unit Boundaries in Unit Selection Text-to-Speech Synthesis
Published in IEEE transactions on audio, speech, and language processing (01-03-2007)“…The level of quality that can be achieved by modern concatenative text-to-speech synthesis heavily depends on a judicious composition of the unit inventory…”
Get full text
Journal Article -
12
Unsupervised, language-independent grapheme-to-phoneme conversion by latent analogy
Published in 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03) (2003)“…Automatic, data-driven grapheme-to-phoneme conversion is a challenging but often necessary task. The top-down strategy implicitly followed by traditional…”
Get full text
Conference Proceeding -
13
Latent semantic mapping [information retrieval]
Published in IEEE signal processing magazine (01-09-2005)“…This article has described LSM, a data-driven framework for modeling globally meaningful relationships implicit in large volumes of data. LSM generalizes a…”
Get full text
Magazine Article -
14
Speech recognition experiments using multi-span statistical language models
Published in 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258) (1999)“…A multi-span framework was proposed to integrate the various constraints, both local and global, that are present in the language. In this approach, local…”
Get full text
Conference Proceeding -
15
Language-independent, short-enrollment voice verification over a far-field microphone
Published in 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221) (2001)“…An approach is presented for the dual verification of speaker identity and verbal content in a text-dependent voice authentication system. The application…”
Get full text
Conference Proceeding -
16
Exploiting both local and global constraints for multi-span statistical language modeling
Published in Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181) (1998)“…A new framework is proposed to integrate the various constraints, both local and global, that are present in the language. Local constraints are captured via…”
Get full text
Conference Proceeding -
17
Using a sigmoid transformation for improved modeling of phoneme duration
Published in 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258) (1999)“…The "sums-of-products" approach has emerged as one of the most promising avenues to model contextual influences on phoneme duration. The associated regression…”
Get full text
Conference Proceeding -
18
Congruential frequency hop signals for multi-user environments: a comparative analysis
Published in International Conference on Acoustics, Speech, and Signal Processing (1990)“…Frequency-hop pulse train signals used in applications such as coherent multiuser of multibeam echolocation must be selected on the basis of good…”
Get full text
Conference Proceeding -
19
Time-frequency properties of extended quadratic congruential frequency hop signals
Published in International Conference on Acoustics, Speech, and Signal Processing (1989)“…Frequency-hop pulse train signals, used in applications such as coherent multiuser or multibeam echo location, must be selected on the basis of good…”
Get full text
Conference Proceeding -
20
A fast statistical mixture algorithm for on-line handwriting recognition
Published in IEEE transactions on pattern analysis and machine intelligence (01-12-1994)“…The automatic recognition of online handwriting is considered from an information theoretic viewpoint. Emphasis is placed on the recognition of unconstrained…”
Get full text
Journal Article