Search Results - "Picheny, M.A."
-
1
The IBM expressive text-to-speech synthesis system for American English
Published in IEEE transactions on audio, speech, and language processing (01-07-2006)“…Expressive text-to-speech (TTS) synthesis should contribute to the pleasantness, intelligibility, and speed of speech-based human-machine interactions which…”
Get full text
Journal Article -
2
Decision trees for phonological rules in continuous speech
Published in [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing (1991)“…The authors present an automatic method for modeling phonological variation using decision trees. For each phone they construct a decision tree that specifies…”
Get full text
Conference Proceeding -
3
Towards Pooled-Speaker Concatenative Text-to-Speech
Published in 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings (2006)“…In this paper we explore the merging of data from various speakers in building a concatenative text-to-speech system. First, we investigate the pooling of data…”
Get full text
Conference Proceeding -
4
On a model-robust training method for speech recognition
Published in IEEE transactions on acoustics, speech, and signal processing (01-09-1988)“…Training methods for designing better decoders are compared. The training problem is considered as a statistical parameter estimation problem. In particular,…”
Get full text
Journal Article -
5
Robust methods for using context-dependent features and models in a continuous speech recognizer
Published in Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing (1994)“…In this paper we describe the method we use to derive acoustic features that reflect some of the dynamics of frame-based parameter vectors. Models for such…”
Get full text
Conference Proceeding -
6
Rapid likelihood calculation of subspace clustered Gaussian components
Published in 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100) (2000)“…In speech recognition systems, computing the likelihoods of the acoustic models is an intensive task. One approach to reduce this cost is to use subspace…”
Get full text
Conference Proceeding -
7
Context dependent phonetic duration models for decoding conversational speech
Published in 1995 International Conference on Acoustics, Speech, and Signal Processing (1995)“…Conversational speech provides a particularly difficult task for speech recognition. It provides much more variability than either dictation, read speech, or…”
Get full text
Conference Proceeding -
8
Adaptive labeling: normalization of speech by adaptive transformations based on vector quantization
Published in ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing (1988)“…A general technique termed adaptive labeling is presented for the normalization of the speech signal. In principle, adaptive labeling is applicable to any…”
Get full text
Conference Proceeding -
9
Speech recognition using noise-adaptive prototypes
Published in ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing (1988)“…A probabilistic mixture model is described for a frame (the short-term spectrum) of each to be used in speech recognition. Each component of the mixture is…”
Get full text
Conference Proceeding -
10
Speaker clustering and transformation for speaker adaptation in large-vocabulary speech recognition systems
Published in 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings (1996)“…A speaker adaptation strategy is described that is based on finding a subset of speakers, from the training set, who are acoustically close to the test…”
Get full text
Conference Proceeding -
11
Experiments using data augmentation for speaker adaptation
Published in 1995 International Conference on Acoustics, Speech, and Signal Processing (1995)“…Speaker adaptation typically involves customizing some existing (reference) models in order to account for the characteristics of a new speaker. This work…”
Get full text
Conference Proceeding -
12
A channel-bank-based phone detection strategy
Published in Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing (1994)“…This paper presents a channel-bank based phone detection algorithm, that can be used in greatly cut down the search space in the process of mapping a set of…”
Get full text
Conference Proceeding -
13
Decoder selection based on cross-entropies
Published in ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing (1988)“…The authors generalize the maximum likelihood and related optimization criteria for training and decoding with a speech recognizer. The generalizations are…”
Get full text
Conference Proceeding -
14
Acoustic Markov models used in the Tangora speech recognition system
Published in ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing (1988)“…The Speech Recognition Group at IBM Research has developed a real-time, isolated-word speech recognizer called Tangora, which accepts natural English sentences…”
Get full text
Conference Proceeding -
15
Performance of the IBM large vocabulary continuous speech recognition system on the ARPA Wall Street Journal task
Published in 1995 International Conference on Acoustics, Speech, and Signal Processing (1995)“…In this paper we discuss various experimental results using our continuous speech recognition system on the Wall Street Journal task. Experiments with…”
Get full text
Conference Proceeding -
16
An iterative 'flip-flop' approximation of the most informative split in the construction of decision trees
Published in [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing (1991)“…The authors seek a fast algorithm for finding the best question to ask (i.e., best split of predictor values) about a predictor variable when predicting…”
Get full text
Conference Proceeding -
17
Context dependent vector quantization for continuous speech recognition
Published in 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing (1993)“…The authors present a method for designing a vector quantizer for speech recognition that uses decision networks constructed by examining the phonetic context…”
Get full text
Conference Proceeding -
18
Speaker clustering and transformation for speaker adaptation in speech recognition systems
Published in IEEE transactions on speech and audio processing (01-01-1998)“…A speaker adaptation strategy is described that is based on finding a subset of speakers, from the training set, who are acoustically close to the test…”
Get full text
Journal Article -
19
Automatic phonetic baseform determination
Published in [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing (1991)“…The authors describe a series of experiments in which the phonetic baseform is deduced automatically for new words by utilizing actual utterances of the new…”
Get full text
Conference Proceeding -
20
Speech recognition using noise-adaptive prototypes
Published in IEEE transactions on acoustics, speech, and signal processing (01-10-1989)“…A probabilistic mixture mode is described for a frame (the short term spectrum) of speech to be used in speech recognition. Each component of the mixture is…”
Get full text
Journal Article