Search Results - "Picheny, M.A."

1
The IBM expressive text-to-speech synthesis system for American English by Pitrelli, J.F., Bakis, R., Eide, E.M., Fernandez, R., Hamza, W., Picheny, M.A.

Published in IEEE transactions on audio, speech, and language processing (01-07-2006)
“…Expressive text-to-speech (TTS) synthesis should contribute to the pleasantness, intelligibility, and speed of speech-based human-machine interactions which…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
2
Decision trees for phonological rules in continuous speech by Bahl, L.R., deSouza, P.V., Gopalakrishnan, P.S., Nahamoo, D., Picheny, M.A.

Published in [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing (1991)
“…The authors present an automatic method for modeling phonological variation using decision trees. For each phone they construct a decision tree that specifies…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
3
Towards Pooled-Speaker Concatenative Text-to-Speech by Eide, E.M., Picheny, M.A.

Published in 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings (2006)
“…In this paper we explore the merging of data from various speakers in building a concatenative text-to-speech system. First, we investigate the pooling of data…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
4
On a model-robust training method for speech recognition by Nadas, A., Nahamoo, D., Picheny, M.A.

Published in IEEE transactions on acoustics, speech, and signal processing (01-09-1988)
“…Training methods for designing better decoders are compared. The training problem is considered as a statistical parameter estimation problem. In particular,…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
5
Robust methods for using context-dependent features and models in a continuous speech recognizer by Bahl, L.R., de Souza, P.V., Gopalakrishnan, P.S., Nahamoo, D., Picheny, M.A.

Published in Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing (1994)
“…In this paper we describe the method we use to derive acoustic features that reflect some of the dynamics of frame-based parameter vectors. Models for such…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
6
Rapid likelihood calculation of subspace clustered Gaussian components by Aiyer, A., Gales, M.J.F., Picheny, M.A.

Published in 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100) (2000)
“…In speech recognition systems, computing the likelihoods of the acoustic models is an intensive task. One approach to reduce this cost is to use subspace…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
7
Context dependent phonetic duration models for decoding conversational speech by Monkowski, M.D., Picheny, M.A., Srinivasa Rao, P.

Published in 1995 International Conference on Acoustics, Speech, and Signal Processing (1995)
“…Conversational speech provides a particularly difficult task for speech recognition. It provides much more variability than either dictation, read speech, or…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
8
Adaptive labeling: normalization of speech by adaptive transformations based on vector quantization by Nadas, A., Nahamoo, D., Picheny, M.A.

Published in ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing (1988)
“…A general technique termed adaptive labeling is presented for the normalization of the speech signal. In principle, adaptive labeling is applicable to any…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
9
Speech recognition using noise-adaptive prototypes by Nadas, A., Nahamoo, D., Picheny, M.A.

Published in ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing (1988)
“…A probabilistic mixture model is described for a frame (the short-term spectrum) of each to be used in speech recognition. Each component of the mixture is…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
10
Speaker clustering and transformation for speaker adaptation in large-vocabulary speech recognition systems by Padmanabhan, M., Bahl, L.R., Nahamoo, D., Picheny, M.A.

Published in 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings (1996)
“…A speaker adaptation strategy is described that is based on finding a subset of speakers, from the training set, who are acoustically close to the test…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
11
Experiments using data augmentation for speaker adaptation by Bellegarda, J.R., de Souza, P.V., Nahamoo, D., Padmanabhan, M., Picheny, M.A., Bahl, L.R.

Published in 1995 International Conference on Acoustics, Speech, and Signal Processing (1995)
“…Speaker adaptation typically involves customizing some existing (reference) models in order to account for the characteristics of a new speaker. This work…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
12
A channel-bank-based phone detection strategy by Gopalakrishnan, P.S., Nahamoo, D., Padmanabhan, M., Picheny, M.A.

Published in Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing (1994)
“…This paper presents a channel-bank based phone detection algorithm, that can be used in greatly cut down the search space in the process of mapping a set of…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
13
Decoder selection based on cross-entropies by Gopalakrishnan, P.S., Kanevsky, D., Nadas, A., Nahamoo, D., Picheny, M.A.

Published in ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing (1988)
“…The authors generalize the maximum likelihood and related optimization criteria for training and decoding with a speech recognizer. The generalizations are…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
14
Acoustic Markov models used in the Tangora speech recognition system by Bahl, L.R., Brown, P.F., de Souza, P.V., Picheny, M.A.

Published in ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing (1988)
“…The Speech Recognition Group at IBM Research has developed a real-time, isolated-word speech recognizer called Tangora, which accepts natural English sentences…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
15
Performance of the IBM large vocabulary continuous speech recognition system on the ARPA Wall Street Journal task by Bahl, L.R., Balakrishnan-Aiyer, S., Bellgarda, J.R., Franz, M., Gopalakrishnan, P.S., Nahamoo, D., Novak, M., Padmanabhan, M., Picheny, M.A., Roukos, S.

Published in 1995 International Conference on Acoustics, Speech, and Signal Processing (1995)
“…In this paper we discuss various experimental results using our continuous speech recognition system on the Wall Street Journal task. Experiments with…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
16
An iterative 'flip-flop' approximation of the most informative split in the construction of decision trees by Nadas, A., Nahamoo, D., Picheny, M.A., Powell, J.

Published in [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing (1991)
“…The authors seek a fast algorithm for finding the best question to ask (i.e., best split of predictor values) about a predictor variable when predicting…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
17
Context dependent vector quantization for continuous speech recognition by Bahl, L.R., de Souza, P.V., Gopalakrishnan, P.S., Picheny, M.A.

Published in 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing (1993)
“…The authors present a method for designing a vector quantizer for speech recognition that uses decision networks constructed by examining the phonetic context…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
18
Speaker clustering and transformation for speaker adaptation in speech recognition systems by Padmanabhan, M., Bahl, L.R., Nahamoo, D., Picheny, M.A.

Published in IEEE transactions on speech and audio processing (01-01-1998)
“…A speaker adaptation strategy is described that is based on finding a subset of speakers, from the training set, who are acoustically close to the test…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
19
Automatic phonetic baseform determination by Bahl, L.R., Das, S., deSouza, P.V., Epstein, M., Mercer, R.L., Merialdo, B., Nahamoo, D., Picheny, M.A., Powell, J.

Published in [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing (1991)
“…The authors describe a series of experiments in which the phonetic baseform is deduced automatically for new words by utilizing actual utterances of the new…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
20
Speech recognition using noise-adaptive prototypes by Nadas, A., Nahamoo, D., Picheny, M.A.

Published in IEEE transactions on acoustics, speech, and signal processing (01-10-1989)
“…A probabilistic mixture mode is described for a frame (the short term spectrum) of speech to be used in speech recognition. Each component of the mixture is…”

Get full text

Journal Article
QR Code
Save to List

Saved in:

Search Results - "Picheny, M.A."

The IBM expressive text-to-speech synthesis system for American English by Pitrelli, J.F., Bakis, R., Eide, E.M., Fernandez, R., Hamza, W., Picheny, M.A.

Decision trees for phonological rules in continuous speech by Bahl, L.R., deSouza, P.V., Gopalakrishnan, P.S., Nahamoo, D., Picheny, M.A.

Towards Pooled-Speaker Concatenative Text-to-Speech by Eide, E.M., Picheny, M.A.

On a model-robust training method for speech recognition by Nadas, A., Nahamoo, D., Picheny, M.A.

Robust methods for using context-dependent features and models in a continuous speech recognizer by Bahl, L.R., de Souza, P.V., Gopalakrishnan, P.S., Nahamoo, D., Picheny, M.A.

Rapid likelihood calculation of subspace clustered Gaussian components by Aiyer, A., Gales, M.J.F., Picheny, M.A.

Context dependent phonetic duration models for decoding conversational speech by Monkowski, M.D., Picheny, M.A., Srinivasa Rao, P.

Adaptive labeling: normalization of speech by adaptive transformations based on vector quantization by Nadas, A., Nahamoo, D., Picheny, M.A.

Speech recognition using noise-adaptive prototypes by Nadas, A., Nahamoo, D., Picheny, M.A.

Speaker clustering and transformation for speaker adaptation in large-vocabulary speech recognition systems by Padmanabhan, M., Bahl, L.R., Nahamoo, D., Picheny, M.A.

Experiments using data augmentation for speaker adaptation by Bellegarda, J.R., de Souza, P.V., Nahamoo, D., Padmanabhan, M., Picheny, M.A., Bahl, L.R.

A channel-bank-based phone detection strategy by Gopalakrishnan, P.S., Nahamoo, D., Padmanabhan, M., Picheny, M.A.

Decoder selection based on cross-entropies by Gopalakrishnan, P.S., Kanevsky, D., Nadas, A., Nahamoo, D., Picheny, M.A.

Acoustic Markov models used in the Tangora speech recognition system by Bahl, L.R., Brown, P.F., de Souza, P.V., Picheny, M.A.

Performance of the IBM large vocabulary continuous speech recognition system on the ARPA Wall Street Journal task by Bahl, L.R., Balakrishnan-Aiyer, S., Bellgarda, J.R., Franz, M., Gopalakrishnan, P.S., Nahamoo, D., Novak, M., Padmanabhan, M., Picheny, M.A., Roukos, S.

An iterative 'flip-flop' approximation of the most informative split in the construction of decision trees by Nadas, A., Nahamoo, D., Picheny, M.A., Powell, J.

Context dependent vector quantization for continuous speech recognition by Bahl, L.R., de Souza, P.V., Gopalakrishnan, P.S., Picheny, M.A.

Speaker clustering and transformation for speaker adaptation in speech recognition systems by Padmanabhan, M., Bahl, L.R., Nahamoo, D., Picheny, M.A.

Automatic phonetic baseform determination by Bahl, L.R., Das, S., deSouza, P.V., Epstein, M., Mercer, R.L., Merialdo, B., Nahamoo, D., Picheny, M.A., Powell, J.

Speech recognition using noise-adaptive prototypes by Nadas, A., Nahamoo, D., Picheny, M.A.

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication