Search Results - "Kastner, Kyle"

Refine Results
  1. 1

    Deep learning-based point-scanning super-resolution imaging by Fang, Linjing, Monroe, Fred, Novak, Sammy Weiser, Kirk, Lyndsey, Schiavon, Cara R., Yu, Seungyoon B., Zhang, Tong, Wu, Melissa, Kastner, Kyle, Latif, Alaa Abdel, Lin, Zijun, Shaw, Andrew, Kubota, Yoshiyuki, Mendenhall, John, Zhang, Zhao, Pekkurnaz, Gulcin, Harris, Kristen, Howard, Jeremy, Manor, Uri

    Published in Nature methods (01-04-2021)
    “…Point-scanning imaging systems are among the most widely used tools for high-resolution cellular and tissue imaging, benefiting from arbitrarily defined pixel…”
    Get full text
    Journal Article
  2. 2

    Representation Mixing for TTS Synthesis by Kastner, Kyle, Santos, Joao Felipe, Bengio, Yoshua, Courville, Aaron

    “…Recent character and phoneme-based parametric TTS systems using deep learning have shown strong performance in natural speech generation. However, the choice…”
    Get full text
    Conference Proceeding
  3. 3

    Understanding Shared Speech-Text Representations by Wang, Gary, Kastner, Kyle, Bapna, Ankur, Chen, Zhehuai, Rosenberg, Andrew, Ramabhadran, Bhuvana, Zhang, Yu

    “…Recently, a number of approaches to train speech models by incorporating text into end-to-end models have been developed, with Maestro advancing…”
    Get full text
    Conference Proceeding
  4. 4
  5. 5

    Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data by Saeki, Takaaki, Wang, Gary, Morioka, Nobuyuki, Elias, Isaac, Kastner, Kyle, Rosenberg, Andrew, Ramabhadran, Bhuvana, Zen, Heiga, Beaufays, Francoise, Shemtov, Hadar

    “…Collecting high-quality studio recordings of audio is challenging, which limits the language coverage of text-to-speech (TTS) systems. This paper proposes a…”
    Get full text
    Conference Proceeding
  6. 6

    ReSeg: A Recurrent Neural Network-Based Model for Semantic Segmentation by Visin, Francesco, Romero, Adriana, Kyunghyun Cho, Matteucci, Matteo, Ciccone, Marco, Kastner, Kyle, Bengio, Yoshua, Courville, Aaron

    “…We propose a structured prediction architecture, which exploits the local generic features extracted by Convolutional Neural Networks and the capacity of…”
    Get full text
    Conference Proceeding
  7. 7

    R-MelNet: Reduced Mel-Spectral Modeling for Neural TTS by Kastner, Kyle, Courville, Aaron

    Published 30-06-2022
    “…This paper introduces R-MelNet, a two-part autoregressive architecture with a frontend based on the first tier of MelNet and a backend WaveRNN-style audio…”
    Get full text
    Journal Article
  8. 8

    Zero-shot Cross-lingual Voice Transfer for TTS by Biadsy, Fadi, Chen, Youzheng, Elias, Isaac, Kastner, Kyle, Wang, Gary, Rosenberg, Andrew, Ramabhadran, Bhuvana

    Published 20-09-2024
    “…In this paper, we introduce a zero-shot Voice Transfer (VT) module that can be seamlessly integrated into a multi-lingual Text-to-speech (TTS) system to…”
    Get full text
    Journal Article
  9. 9

    Understanding Shared Speech-Text Representations by Wang, Gary, Kastner, Kyle, Bapna, Ankur, Chen, Zhehuai, Rosenberg, Andrew, Ramabhadran, Bhuvana, Zhang, Yu

    Published 27-04-2023
    “…Recently, a number of approaches to train speech models by incorpo-rating text into end-to-end models have been developed, with Mae-stro advancing…”
    Get full text
    Journal Article
  10. 10

    Adversarial training of Keyword Spotting to Minimize TTS Data Overfitting by Park, Hyun Jin, Agarwal, Dhruuv, Chen, Neng, Sun, Rentao, Partridge, Kurt, Chen, Justin, Zhang, Harry, Zhu, Pai, Bartel, Jacob, Kastner, Kyle, Wang, Gary, Rosenberg, Andrew, Wang, Quan

    Published 19-08-2024
    “…The keyword spotting (KWS) problem requires large amounts of real speech training data to achieve high accuracy across diverse populations. Utilizing large…”
    Get full text
    Journal Article
  11. 11

    Utilizing TTS Synthesized Data for Efficient Development of Keyword Spotting Model by Park, Hyun Jin, Agarwal, Dhruuv, Chen, Neng, Sun, Rentao, Partridge, Kurt, Chen, Justin, Zhang, Harry, Zhu, Pai, Bartel, Jacob, Kastner, Kyle, Wang, Gary, Rosenberg, Andrew, Wang, Quan

    Published 26-07-2024
    “…This paper explores the use of TTS synthesized training data for KWS (keyword spotting) task while minimizing development cost and time. Keyword spotting…”
    Get full text
    Journal Article
  12. 12

    Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data by Saeki, Takaaki, Wang, Gary, Morioka, Nobuyuki, Elias, Isaac, Kastner, Kyle, Biadsy, Fadi, Rosenberg, Andrew, Ramabhadran, Bhuvana, Zen, Heiga, Beaufays, Françoise, Shemtov, Hadar

    Published 29-02-2024
    “…Collecting high-quality studio recordings of audio is challenging, which limits the language coverage of text-to-speech (TTS) systems. This paper proposes a…”
    Get full text
    Journal Article
  13. 13

    High-precision Voice Search Query Correction via Retrievable Speech-text Embedings by Li, Christopher, Wang, Gary, Kastner, Kyle, Su, Heng, Chen, Allen, Rosenberg, Andrew, Chen, Zhehuai, Wu, Zelin, Velikovich, Leonid, Rondon, Pat, Caseiro, Diamantino, Aleksic, Petar

    Published 08-01-2024
    “…Automatic speech recognition (ASR) systems can suffer from poor recall for various reasons, such as noisy audio, lack of sufficient training data, etc…”
    Get full text
    Journal Article
  14. 14

    MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical Modeling by Wu, Yusong, Manilow, Ethan, Deng, Yi, Swavely, Rigel, Kastner, Kyle, Cooijmans, Tim, Courville, Aaron, Huang, Cheng-Zhi Anna, Engel, Jesse

    Published 16-12-2021
    “…Musical expression requires control of both what notes are played, and how they are performed. Conventional audio synthesizers provide detailed expressive…”
    Get full text
    Journal Article
  15. 15

    Planning in Dynamic Environments with Conditional Autoregressive Models by Hansen, Johanna, Kastner, Kyle, Courville, Aaron, Dudek, Gregory

    Published 25-11-2018
    “…We demonstrate the use of conditional autoregressive generative models (van den Oord et al., 2016a) over a discrete latent space (van den Oord et al., 2017b)…”
    Get full text
    Journal Article
  16. 16

    Harmonic Recomposition using Conditional Autoregressive Modeling by Kastner, Kyle, Kumar, Rithesh, Cooijmans, Tim, Courville, Aaron

    Published 18-11-2018
    “…We demonstrate a conditional autoregressive pipeline for efficient music recomposition, based on methods presented in van den Oord et al.(2017). Recomposition…”
    Get full text
    Journal Article
  17. 17

    Representation Mixing for TTS Synthesis by Kastner, Kyle, Santos, João Felipe, Bengio, Yoshua, Courville, Aaron

    Published 17-11-2018
    “…Recent character and phoneme-based parametric TTS systems using deep learning have shown strong performance in natural speech generation. However, the choice…”
    Get full text
    Journal Article
  18. 18

    Blindfold Baselines for Embodied QA by Anand, Ankesh, Belilovsky, Eugene, Kastner, Kyle, Larochelle, Hugo, Courville, Aaron

    Published 12-11-2018
    “…We explore blindfold (question-only) baselines for Embodied Question Answering. The EmbodiedQA task requires an agent to answer a question by intelligently…”
    Get full text
    Journal Article
  19. 19

    Learning Distributed Representations from Reviews for Collaborative Filtering by Almahairi, Amjad, Kastner, Kyle, Cho, Kyunghyun, Courville, Aaron

    Published 18-06-2018
    “…Recent work has shown that collaborative filter-based recommender systems can be improved by incorporating side information, such as natural language reviews,…”
    Get full text
    Journal Article
  20. 20

    Learning to Discover Sparse Graphical Models by Belilovsky, Eugene, Kastner, Kyle, Varoquaux, Gaël, Blaschko, Matthew

    Published 20-05-2016
    “…We consider structure discovery of undirected graphical models from observational data. Inferring likely structures from few examples is a complex task often…”
    Get full text
    Journal Article