Search Results - "Child, Rewon"
-
1
Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on Images
Published 20-11-2020“…We present a hierarchical VAE that, for the first time, generates samples quickly while outperforming the PixelCNN in log-likelihood on all natural image…”
Get full text
Journal Article -
2
Exploring neural transducers for end-to-end speech recognition
Published in 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (01-12-2017)“…In this work, we perform an empirical comparison among the CTC, RNN-Transducer, and attention-based Seq2Seq models for end-to-end speech recognition. We show…”
Get full text
Conference Proceeding -
3
Generating Long Sequences with Sparse Transformers
Published 23-04-2019“…Transformers are powerful sequence models, but require time and memory that grows quadratically with the sequence length. In this paper we introduce sparse…”
Get full text
Journal Article -
4
Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model
Published 28-01-2022“…Pretrained general-purpose language models can achieve state-of-the-art accuracies in various natural language processing domains by adapting to downstream…”
Get full text
Journal Article -
5
Scaling Laws for Neural Language Models
Published 22-01-2020“…We study empirical scaling laws for language model performance on the cross-entropy loss. The loss scales as a power-law with model size, dataset size, and the…”
Get full text
Journal Article -
6
PaLM: Scaling Language Modeling with Pathways
Published 05-04-2022“…Large language models have been shown to achieve remarkable performance across a variety of natural language tasks using few-shot learning, which drastically…”
Get full text
Journal Article -
7
Active Learning for Speech Recognition: the Power of Gradients
Published 09-12-2016“…In training speech recognition systems, labeling audio clips can be expensive, and not all data is equally valuable. Active learning aims to label only the…”
Get full text
Journal Article -
8
Language Models are Few-Shot Learners
Published 28-05-2020“…Recent work has demonstrated substantial gains on many NLP tasks and benchmarks by pre-training on a large corpus of text followed by fine-tuning on a specific…”
Get full text
Journal Article -
9
Convolutional Recurrent Neural Networks for Small-Footprint Keyword Spotting
Published 15-03-2017“…Keyword spotting (KWS) constitutes a major component of human-technology interfaces. Maximizing the detection accuracy at a low false alarm (FA) rate, while…”
Get full text
Journal Article -
10
Exploring Neural Transducers for End-to-End Speech Recognition
Published 24-07-2017“…In this work, we perform an empirical comparison among the CTC, RNN-Transducer, and attention-based Seq2Seq models for end-to-end speech recognition. We show…”
Get full text
Journal Article -
11
Reducing Bias in Production Speech Models
Published 11-05-2017“…Replacing hand-engineered pipelines with end-to-end deep learning systems has enabled strong results in applications like speech and object recognition…”
Get full text
Journal Article