Search Results - "Stolcke, A."
-
1
The Microsoft 2017 Conversational Speech Recognition System
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-04-2018)“…We describe the latest version of Microsoft's conversational speech recognition system for the Switchboard and CallHome domains. The system adds a CNN-BLSTM…”
Get full text
Conference Proceeding -
2
The microsoft 2016 conversational speech recognition system
Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-03-2017)“…We describe Microsoft's conversational speech recognition system, in which we combine recent developments in neural-network-based acoustic and language…”
Get full text
Conference Proceeding -
3
Enriching speech recognition with automatic detection of sentence boundaries and disfluencies
Published in IEEE transactions on audio, speech, and language processing (01-09-2006)“…Effective human and automatic processing of speech requires recovery of more than just the words. It also involves recovering phenomena such as sentence…”
Get full text
Journal Article -
4
Modeling prosodic feature sequences for speaker recognition
Published in Speech communication (01-07-2005)“…We describe a novel approach to modeling idiosyncratic prosodic behavior for automatic speaker recognition. The approach computes various duration, pitch, and…”
Get full text
Journal Article Conference Proceeding -
5
Speaker Recognition With Session Variability Normalization Based on MLLR Adaptation Transforms
Published in IEEE transactions on audio, speech, and language processing (01-09-2007)“…We present a new modeling approach for speaker recognition that uses the maximum-likelihood linear regression (MLLR) adaptation transforms employed by a speech…”
Get full text
Journal Article -
6
Recent innovations in speech-to-text transcription at SRI-ICSI-UW
Published in IEEE transactions on audio, speech, and language processing (01-09-2006)“…We summarize recent progress in automatic speech-to-text transcription at SRI, ICSI, and the University of Washington. The work encompasses all components of…”
Get full text
Journal Article -
7
Can Prosody Aid the Automatic Classification of Dialog Acts in Conversational Speech?
Published in Language and speech (01-07-1998)“…Investigated whether current approaches to automatically classifying dialog acts (DAs) in natural conversation could be improved by adding prosodic…”
Get more information
Journal Article -
8
Statistical language modeling for speech disfluencies
Published in 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings (1996)“…Speech disfluencies (such as filled pauses, repetitions, restarts) are among the characteristics distinguishing spontaneous speech from planned or read speech…”
Get full text
Conference Proceeding -
9
The use of a linguistically motivated language model in conversational speech recognition
Published in 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (2004)“…Structured language models have recently been shown to give significant improvements in large-vocabulary recognition relative to traditional word N-gram…”
Get full text
Conference Proceeding -
10
Finding consensus in speech recognition: word error minimization and other applications of confusion networks
Published in Computer speech & language (01-10-2000)“…We describe a new framework for distilling information from word lattices to improve the accuracy of the speech recognition output and obtain a more…”
Get full text
Journal Article -
11
Editorial for computer speech and language
Published in Computer speech & language (2006)Get full text
Journal Article -
12
The CALO Meeting Assistant System
Published in IEEE transactions on audio, speech, and language processing (01-08-2010)“…The CALO Meeting Assistant (MA) provides for distributed meeting capture, annotation, automatic transcription and semantic analysis of multiparty meetings, and…”
Get full text
Journal Article -
13
Prosody-based automatic segmentation of speech into sentences and topics
Published in Speech communication (2000)“…A crucial step in processing speech audio data for information extraction, topic detection, or browsing/playback is to segment the input into sentence and…”
Get full text
Journal Article -
14
Multispeaker speech activity detection for the ICSI meeting recorder
Published in IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01 (2001)“…As part of a project into speech recognition in meeting environments, we have collected a corpus of multichannel meeting recordings. We expected the…”
Get full text
Conference Proceeding -
15
Meetings about meetings: research at ICSI on speech in multiparty conversations
Published in 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03) (2003)“…In early 2001, we reported (at the Human Language Technology meeting) the early stages of an ICSI (International Computer Science Institute) project on…”
Get full text
Conference Proceeding -
16
Nonparametric feature normalization for SVM-based speaker verification
Published in 2008 IEEE International Conference on Acoustics, Speech and Signal Processing (01-03-2008)“…We investigate several feature normalization and scaling approaches for use in speaker verification based on support vector machines. We are particularly…”
Get full text
Conference Proceeding -
17
Neural-network based measures of confidence for word recognition
Published in 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (1997)“…This paper proposes a probabilistic framework to define and evaluate confidence measures for word recognition. We describe a novel method to combine different…”
Get full text
Conference Proceeding -
18
Automatic linguistic segmentation of conversational speech
Published in Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96 (1996)“…As speech recognition moves toward more unconstrained domains such as conversational speech, we encounter a need to be able to segment (or resegment) waveforms…”
Get full text
Conference Proceeding -
19
Open-vocabulary spoken term detection using graphone-based hybrid recognition systems
Published in 2008 IEEE International Conference on Acoustics, Speech and Signal Processing (01-03-2008)“…We address the problem of retrieving out-of-vocabulary (OOV) words/queries from audio archives for spoken term detection (STD) task. Many STD systems use the…”
Get full text
Conference Proceeding -
20
Unsupervised Languagemodel Adaptation for Meeting Recognition
Published in 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07 (01-04-2007)“…We present an application of unsupervised language model (ML) adaptation to meeting recognition, in a scenario where sequences of multiparty meetings on…”
Get full text
Conference Proceeding