Search Results - "Gulcehre, Caglar"

Refine Results
  1. 1
  2. 2

    On integrating a language model into neural machine translation by Gulcehre, Caglar, Firat, Orhan, Xu, Kelvin, Cho, Kyunghyun, Bengio, Yoshua

    Published in Computer speech & language (01-09-2017)
    “…Recent advances in end-to-end neural machine translation models have achieved promising results on high-resource language pairs such as En→ Fr and En→ De. One…”
    Get full text
    Journal Article
  3. 3
  4. 4

    Beyond Autoregression: Fast LLMs via Self-Distillation Through Time by Deschenaux, Justin, Gulcehre, Caglar

    Published 28-10-2024
    “…Autoregressive (AR) Large Language Models (LLMs) have demonstrated significant success across numerous tasks. However, the AR modeling paradigm presents…”
    Get full text
    Journal Article
  5. 5

    In Search for Architectures and Loss Functions in Multi-Objective Reinforcement Learning by Terekhov, Mikhail, Gulcehre, Caglar

    Published 23-07-2024
    “…Multi-objective reinforcement learning (MORL) is essential for addressing the intricacies of real-world RL problems, which often require trade-offs between…”
    Get full text
    Journal Article
  6. 6

    Promises, Outlooks and Challenges of Diffusion Language Modeling by Deschenaux, Justin, Gulcehre, Caglar

    Published 17-06-2024
    “…The modern autoregressive Large Language Models (LLMs) have achieved outstanding performance on NLP benchmarks, and they are deployed in the real world…”
    Get full text
    Journal Article
  7. 7

    The Role of Deep Learning Regularizations on Actors in Offline RL by Tarasov, Denis, Surina, Anja, Gulcehre, Caglar

    Published 11-09-2024
    “…Deep learning regularization techniques, such as dropout, layer normalization, or weight decay, are widely adopted in the construction of modern artificial…”
    Get full text
    Journal Article
  8. 8

    The Effect of Scheduling and Preemption on the Efficiency of LLM Inference Serving by Kim, Kyoungmin, Hong, Kijae, Gulcehre, Caglar, Ailamaki, Anastasia

    Published 11-11-2024
    “…The growing usage of Large Language Models (LLMs) highlights the demands and challenges in scalable LLM inference systems, affecting deployment and development…”
    Get full text
    Journal Article
  9. 9

    SIKeD: Self-guided Iterative Knowledge Distillation for mathematical reasoning by Adarsh, Shivam, Shridhar, Kumar, Gulcehre, Caglar, Monath, Nicholas, Sachan, Mrinmaya

    Published 24-10-2024
    “…Large Language Models (LLMs) can transfer their reasoning skills to smaller models by teaching them to generate the intermediate reasoning process required to…”
    Get full text
    Journal Article
  10. 10

    Investigating Low-Rank Training in Transformer Language Models: Efficiency and Scaling Analysis by Wei, Xiuying, Moalla, Skander, Pascanu, Razvan, Gulcehre, Caglar

    Published 13-07-2024
    “…State-of-the-art LLMs often rely on scale with high computational costs, which has sparked a research agenda to reduce parameter counts and costs without…”
    Get full text
    Journal Article
  11. 11

    HiPPO-Prophecy: State-Space Models can Provably Learn Dynamical Systems in Context by Joseph, Federico Arangath, Haefeli, Kilian Konstantin, Liniger, Noah, Gulcehre, Caglar

    Published 12-07-2024
    “…This work explores the in-context learning capabilities of State Space Models (SSMs) and presents, to the best of our knowledge, the first theoretical…”
    Get full text
    Journal Article
  12. 12

    Building on Efficient Foundations: Effectively Training LLMs with Structured Feedforward Layers by Wei, Xiuying, Moalla, Skander, Pascanu, Razvan, Gulcehre, Caglar

    Published 24-06-2024
    “…State-of-the-art results in large language models (LLMs) often rely on scale, which becomes computationally expensive. This has sparked a research agenda to…”
    Get full text
    Journal Article
  13. 13

    Aligning Large Language Models with Diverse Political Viewpoints by Stammbach, Dominik, Widmer, Philine, Cho, Eunjung, Gulcehre, Caglar, Ash, Elliott

    Published 20-06-2024
    “…Large language models such as ChatGPT exhibit striking political biases. If users query them about political information, they often take a normative stance…”
    Get full text
    Journal Article
  14. 14

    No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO by Moalla, Skander, Miele, Andrea, Pascanu, Razvan, Gulcehre, Caglar

    Published 01-05-2024
    “…Reinforcement learning (RL) is inherently rife with non-stationarity since the states and rewards the agent observes during training depend on its changing…”
    Get full text
    Journal Article
  15. 15

    Simple Hierarchical Planning with Diffusion by Chen, Chang, Deng, Fei, Kawaguchi, Kenji, Gulcehre, Caglar, Ahn, Sungjin

    Published 05-01-2024
    “…Diffusion-based generative methods have proven effective in modeling trajectories with offline datasets. However, they often face computational challenges and…”
    Get full text
    Journal Article
  16. 16

    Imagine the Unseen World: A Benchmark for Systematic Generalization in Visual World Models by Kim, Yeongbin, Singh, Gautam, Park, Junyeong, Gulcehre, Caglar, Ahn, Sungjin

    Published 15-11-2023
    “…Systematic compositionality, or the ability to adapt to novel situations by creating a mental model of the world using reusable pieces of knowledge, remains a…”
    Get full text
    Journal Article
  17. 17

    Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders by Surkov, Viacheslav, Wendler, Chris, Terekhov, Mikhail, Deschenaux, Justin, West, Robert, Gulcehre, Caglar

    Published 28-10-2024
    “…Sparse autoencoders (SAEs) have become a core ingredient in the reverse engineering of large-language models (LLMs). For LLMs, they have been shown to…”
    Get full text
    Journal Article
  18. 18

    Self-Recognition in Language Models by Davidson, Tim R, Surkov, Viacheslav, Veselovsky, Veniamin, Russo, Giuseppe, West, Robert, Gulcehre, Caglar

    Published 09-07-2024
    “…A rapidly growing number of applications rely on a small set of closed-source language models (LMs). This dependency might introduce novel security risks if…”
    Get full text
    Journal Article
  19. 19

    PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer by Chen, Chang, Baek, Junyeob, Deng, Fei, Kawaguchi, Kenji, Gulcehre, Caglar, Ahn, Sungjin

    Published 10-06-2024
    “…Despite the recent advancements in offline RL, no unified algorithm could achieve superior performance across a broad range of tasks. Offline \textit{value…”
    Get full text
    Journal Article
  20. 20

    Fleet of Agents: Coordinated Problem Solving with Large Language Models using Genetic Particle Filtering by Arora, Akhil, Klein, Lars, Potamitis, Nearchos, Aydin, Roland, Gulcehre, Caglar, West, Robert

    Published 07-05-2024
    “…Large language models (LLMs) have significantly evolved, moving from simple output generation to complex reasoning and from stand-alone usage to being embedded…”
    Get full text
    Journal Article