Search Results - "Kastner, Kyle"

1
Deep learning-based point-scanning super-resolution imaging by Fang, Linjing, Monroe, Fred, Novak, Sammy Weiser, Kirk, Lyndsey, Schiavon, Cara R., Yu, Seungyoon B., Zhang, Tong, Wu, Melissa, Kastner, Kyle, Latif, Alaa Abdel, Lin, Zijun, Shaw, Andrew, Kubota, Yoshiyuki, Mendenhall, John, Zhang, Zhao, Pekkurnaz, Gulcin, Harris, Kristen, Howard, Jeremy, Manor, Uri

Published in Nature methods (01-04-2021)
“…Point-scanning imaging systems are among the most widely used tools for high-resolution cellular and tissue imaging, benefiting from arbitrarily defined pixel…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
2
Representation Mixing for TTS Synthesis by Kastner, Kyle, Santos, Joao Felipe, Bengio, Yoshua, Courville, Aaron

Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2019)
“…Recent character and phoneme-based parametric TTS systems using deep learning have shown strong performance in natural speech generation. However, the choice…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
3
Understanding Shared Speech-Text Representations by Wang, Gary, Kastner, Kyle, Bapna, Ankur, Chen, Zhehuai, Rosenberg, Andrew, Ramabhadran, Bhuvana, Zhang, Yu

Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)
“…Recently, a number of approaches to train speech models by incorporating text into end-to-end models have been developed, with Maestro advancing…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
4
Deep Learning‐Based Point‐Scanning Super‐Resolution Imaging by Manor, Uri, Fang, Linjing, Howard, Jeremy, Monroe, Fred, Weiser, Sammy, Kastner, Kyle, Kirk, Lyndsey, Harris, Kristen, Pekkurnaz, Gulcin, Yoon, Blenda, Schiavon, Cara, Zhang, Tong

Published in The FASEB journal (01-04-2020)
“…Abstract only…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
5
Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data by Saeki, Takaaki, Wang, Gary, Morioka, Nobuyuki, Elias, Isaac, Kastner, Kyle, Rosenberg, Andrew, Ramabhadran, Bhuvana, Zen, Heiga, Beaufays, Francoise, Shemtov, Hadar

Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14-04-2024)
“…Collecting high-quality studio recordings of audio is challenging, which limits the language coverage of text-to-speech (TTS) systems. This paper proposes a…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
6
ReSeg: A Recurrent Neural Network-Based Model for Semantic Segmentation by Visin, Francesco, Romero, Adriana, Kyunghyun Cho, Matteucci, Matteo, Ciccone, Marco, Kastner, Kyle, Bengio, Yoshua, Courville, Aaron

Published in 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (01-06-2016)
“…We propose a structured prediction architecture, which exploits the local generic features extracted by Convolutional Neural Networks and the capacity of…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
7
R-MelNet: Reduced Mel-Spectral Modeling for Neural TTS by Kastner, Kyle, Courville, Aaron

Published 30-06-2022
“…This paper introduces R-MelNet, a two-part autoregressive architecture with a frontend based on the first tier of MelNet and a backend WaveRNN-style audio…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
8
Zero-shot Cross-lingual Voice Transfer for TTS by Biadsy, Fadi, Chen, Youzheng, Elias, Isaac, Kastner, Kyle, Wang, Gary, Rosenberg, Andrew, Ramabhadran, Bhuvana

Published 20-09-2024
“…In this paper, we introduce a zero-shot Voice Transfer (VT) module that can be seamlessly integrated into a multi-lingual Text-to-speech (TTS) system to…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
9
Understanding Shared Speech-Text Representations by Wang, Gary, Kastner, Kyle, Bapna, Ankur, Chen, Zhehuai, Rosenberg, Andrew, Ramabhadran, Bhuvana, Zhang, Yu

Published 27-04-2023
“…Recently, a number of approaches to train speech models by incorpo-rating text into end-to-end models have been developed, with Mae-stro advancing…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
10
Adversarial training of Keyword Spotting to Minimize TTS Data Overfitting by Park, Hyun Jin, Agarwal, Dhruuv, Chen, Neng, Sun, Rentao, Partridge, Kurt, Chen, Justin, Zhang, Harry, Zhu, Pai, Bartel, Jacob, Kastner, Kyle, Wang, Gary, Rosenberg, Andrew, Wang, Quan

Published 19-08-2024
“…The keyword spotting (KWS) problem requires large amounts of real speech training data to achieve high accuracy across diverse populations. Utilizing large…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
11
Utilizing TTS Synthesized Data for Efficient Development of Keyword Spotting Model by Park, Hyun Jin, Agarwal, Dhruuv, Chen, Neng, Sun, Rentao, Partridge, Kurt, Chen, Justin, Zhang, Harry, Zhu, Pai, Bartel, Jacob, Kastner, Kyle, Wang, Gary, Rosenberg, Andrew, Wang, Quan

Published 26-07-2024
“…This paper explores the use of TTS synthesized training data for KWS (keyword spotting) task while minimizing development cost and time. Keyword spotting…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
12
Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data by Saeki, Takaaki, Wang, Gary, Morioka, Nobuyuki, Elias, Isaac, Kastner, Kyle, Biadsy, Fadi, Rosenberg, Andrew, Ramabhadran, Bhuvana, Zen, Heiga, Beaufays, Françoise, Shemtov, Hadar

Published 29-02-2024
“…Collecting high-quality studio recordings of audio is challenging, which limits the language coverage of text-to-speech (TTS) systems. This paper proposes a…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
13
High-precision Voice Search Query Correction via Retrievable Speech-text Embedings by Li, Christopher, Wang, Gary, Kastner, Kyle, Su, Heng, Chen, Allen, Rosenberg, Andrew, Chen, Zhehuai, Wu, Zelin, Velikovich, Leonid, Rondon, Pat, Caseiro, Diamantino, Aleksic, Petar

Published 08-01-2024
“…Automatic speech recognition (ASR) systems can suffer from poor recall for various reasons, such as noisy audio, lack of sufficient training data, etc…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
14
MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical Modeling by Wu, Yusong, Manilow, Ethan, Deng, Yi, Swavely, Rigel, Kastner, Kyle, Cooijmans, Tim, Courville, Aaron, Huang, Cheng-Zhi Anna, Engel, Jesse

Published 16-12-2021
“…Musical expression requires control of both what notes are played, and how they are performed. Conventional audio synthesizers provide detailed expressive…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
15
Planning in Dynamic Environments with Conditional Autoregressive Models by Hansen, Johanna, Kastner, Kyle, Courville, Aaron, Dudek, Gregory

Published 25-11-2018
“…We demonstrate the use of conditional autoregressive generative models (van den Oord et al., 2016a) over a discrete latent space (van den Oord et al., 2017b)…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
16
Harmonic Recomposition using Conditional Autoregressive Modeling by Kastner, Kyle, Kumar, Rithesh, Cooijmans, Tim, Courville, Aaron

Published 18-11-2018
“…We demonstrate a conditional autoregressive pipeline for efficient music recomposition, based on methods presented in van den Oord et al.(2017). Recomposition…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
17
Representation Mixing for TTS Synthesis by Kastner, Kyle, Santos, João Felipe, Bengio, Yoshua, Courville, Aaron

Published 17-11-2018
“…Recent character and phoneme-based parametric TTS systems using deep learning have shown strong performance in natural speech generation. However, the choice…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
18
Blindfold Baselines for Embodied QA by Anand, Ankesh, Belilovsky, Eugene, Kastner, Kyle, Larochelle, Hugo, Courville, Aaron

Published 12-11-2018
“…We explore blindfold (question-only) baselines for Embodied Question Answering. The EmbodiedQA task requires an agent to answer a question by intelligently…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
19
Learning Distributed Representations from Reviews for Collaborative Filtering by Almahairi, Amjad, Kastner, Kyle, Cho, Kyunghyun, Courville, Aaron

Published 18-06-2018
“…Recent work has shown that collaborative filter-based recommender systems can be improved by incorporating side information, such as natural language reviews,…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
20
Learning to Discover Sparse Graphical Models by Belilovsky, Eugene, Kastner, Kyle, Varoquaux, Gaël, Blaschko, Matthew

Published 20-05-2016
“…We consider structure discovery of undirected graphical models from observational data. Inferring likely structures from few examples is a complex task often…”

Get full text

Journal Article
QR Code
Save to List

Saved in:

Search Results - "Kastner, Kyle"

Representation Mixing for TTS Synthesis by Kastner, Kyle, Santos, Joao Felipe, Bengio, Yoshua, Courville, Aaron

Understanding Shared Speech-Text Representations by Wang, Gary, Kastner, Kyle, Bapna, Ankur, Chen, Zhehuai, Rosenberg, Andrew, Ramabhadran, Bhuvana, Zhang, Yu

Deep Learning‐Based Point‐Scanning Super‐Resolution Imaging by Manor, Uri, Fang, Linjing, Howard, Jeremy, Monroe, Fred, Weiser, Sammy, Kastner, Kyle, Kirk, Lyndsey, Harris, Kristen, Pekkurnaz, Gulcin, Yoon, Blenda, Schiavon, Cara, Zhang, Tong

Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data by Saeki, Takaaki, Wang, Gary, Morioka, Nobuyuki, Elias, Isaac, Kastner, Kyle, Rosenberg, Andrew, Ramabhadran, Bhuvana, Zen, Heiga, Beaufays, Francoise, Shemtov, Hadar

ReSeg: A Recurrent Neural Network-Based Model for Semantic Segmentation by Visin, Francesco, Romero, Adriana, Kyunghyun Cho, Matteucci, Matteo, Ciccone, Marco, Kastner, Kyle, Bengio, Yoshua, Courville, Aaron

R-MelNet: Reduced Mel-Spectral Modeling for Neural TTS by Kastner, Kyle, Courville, Aaron

Zero-shot Cross-lingual Voice Transfer for TTS by Biadsy, Fadi, Chen, Youzheng, Elias, Isaac, Kastner, Kyle, Wang, Gary, Rosenberg, Andrew, Ramabhadran, Bhuvana

Understanding Shared Speech-Text Representations by Wang, Gary, Kastner, Kyle, Bapna, Ankur, Chen, Zhehuai, Rosenberg, Andrew, Ramabhadran, Bhuvana, Zhang, Yu

Adversarial training of Keyword Spotting to Minimize TTS Data Overfitting by Park, Hyun Jin, Agarwal, Dhruuv, Chen, Neng, Sun, Rentao, Partridge, Kurt, Chen, Justin, Zhang, Harry, Zhu, Pai, Bartel, Jacob, Kastner, Kyle, Wang, Gary, Rosenberg, Andrew, Wang, Quan

Utilizing TTS Synthesized Data for Efficient Development of Keyword Spotting Model by Park, Hyun Jin, Agarwal, Dhruuv, Chen, Neng, Sun, Rentao, Partridge, Kurt, Chen, Justin, Zhang, Harry, Zhu, Pai, Bartel, Jacob, Kastner, Kyle, Wang, Gary, Rosenberg, Andrew, Wang, Quan

Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data by Saeki, Takaaki, Wang, Gary, Morioka, Nobuyuki, Elias, Isaac, Kastner, Kyle, Biadsy, Fadi, Rosenberg, Andrew, Ramabhadran, Bhuvana, Zen, Heiga, Beaufays, Françoise, Shemtov, Hadar

High-precision Voice Search Query Correction via Retrievable Speech-text Embedings by Li, Christopher, Wang, Gary, Kastner, Kyle, Su, Heng, Chen, Allen, Rosenberg, Andrew, Chen, Zhehuai, Wu, Zelin, Velikovich, Leonid, Rondon, Pat, Caseiro, Diamantino, Aleksic, Petar

MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical Modeling by Wu, Yusong, Manilow, Ethan, Deng, Yi, Swavely, Rigel, Kastner, Kyle, Cooijmans, Tim, Courville, Aaron, Huang, Cheng-Zhi Anna, Engel, Jesse

Planning in Dynamic Environments with Conditional Autoregressive Models by Hansen, Johanna, Kastner, Kyle, Courville, Aaron, Dudek, Gregory

Harmonic Recomposition using Conditional Autoregressive Modeling by Kastner, Kyle, Kumar, Rithesh, Cooijmans, Tim, Courville, Aaron

Representation Mixing for TTS Synthesis by Kastner, Kyle, Santos, João Felipe, Bengio, Yoshua, Courville, Aaron

Blindfold Baselines for Embodied QA by Anand, Ankesh, Belilovsky, Eugene, Kastner, Kyle, Larochelle, Hugo, Courville, Aaron

Learning Distributed Representations from Reviews for Collaborative Filtering by Almahairi, Amjad, Kastner, Kyle, Cho, Kyunghyun, Courville, Aaron

Learning to Discover Sparse Graphical Models by Belilovsky, Eugene, Kastner, Kyle, Varoquaux, Gaël, Blaschko, Matthew

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication