Search Results - "Gulcehre, Caglar"
-
1
EmoNets: Multimodal deep learning approaches for emotion recognition in video
Published in Journal on multimodal user interfaces (01-06-2016)“…The task of the Emotion Recognition in the Wild (EmotiW) Challenge is to assign one of seven emotions to short video clips extracted from Hollywood style…”
Get full text
Journal Article -
2
On integrating a language model into neural machine translation
Published in Computer speech & language (01-09-2017)“…Recent advances in end-to-end neural machine translation models have achieved promising results on high-resource language pairs such as En→ Fr and En→ De. One…”
Get full text
Journal Article -
3
Grandmaster level in StarCraft II using multi-agent reinforcement learning
Published in Nature (London) (01-11-2019)“…Many real-world applications require artificial agents to compete and coordinate with other agents in complex environments. As a stepping stone to this goal,…”
Get full text
Journal Article -
4
Beyond Autoregression: Fast LLMs via Self-Distillation Through Time
Published 28-10-2024“…Autoregressive (AR) Large Language Models (LLMs) have demonstrated significant success across numerous tasks. However, the AR modeling paradigm presents…”
Get full text
Journal Article -
5
In Search for Architectures and Loss Functions in Multi-Objective Reinforcement Learning
Published 23-07-2024“…Multi-objective reinforcement learning (MORL) is essential for addressing the intricacies of real-world RL problems, which often require trade-offs between…”
Get full text
Journal Article -
6
Promises, Outlooks and Challenges of Diffusion Language Modeling
Published 17-06-2024“…The modern autoregressive Large Language Models (LLMs) have achieved outstanding performance on NLP benchmarks, and they are deployed in the real world…”
Get full text
Journal Article -
7
The Role of Deep Learning Regularizations on Actors in Offline RL
Published 11-09-2024“…Deep learning regularization techniques, such as dropout, layer normalization, or weight decay, are widely adopted in the construction of modern artificial…”
Get full text
Journal Article -
8
The Effect of Scheduling and Preemption on the Efficiency of LLM Inference Serving
Published 11-11-2024“…The growing usage of Large Language Models (LLMs) highlights the demands and challenges in scalable LLM inference systems, affecting deployment and development…”
Get full text
Journal Article -
9
SIKeD: Self-guided Iterative Knowledge Distillation for mathematical reasoning
Published 24-10-2024“…Large Language Models (LLMs) can transfer their reasoning skills to smaller models by teaching them to generate the intermediate reasoning process required to…”
Get full text
Journal Article -
10
Investigating Low-Rank Training in Transformer Language Models: Efficiency and Scaling Analysis
Published 13-07-2024“…State-of-the-art LLMs often rely on scale with high computational costs, which has sparked a research agenda to reduce parameter counts and costs without…”
Get full text
Journal Article -
11
HiPPO-Prophecy: State-Space Models can Provably Learn Dynamical Systems in Context
Published 12-07-2024“…This work explores the in-context learning capabilities of State Space Models (SSMs) and presents, to the best of our knowledge, the first theoretical…”
Get full text
Journal Article -
12
Building on Efficient Foundations: Effectively Training LLMs with Structured Feedforward Layers
Published 24-06-2024“…State-of-the-art results in large language models (LLMs) often rely on scale, which becomes computationally expensive. This has sparked a research agenda to…”
Get full text
Journal Article -
13
Aligning Large Language Models with Diverse Political Viewpoints
Published 20-06-2024“…Large language models such as ChatGPT exhibit striking political biases. If users query them about political information, they often take a normative stance…”
Get full text
Journal Article -
14
No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO
Published 01-05-2024“…Reinforcement learning (RL) is inherently rife with non-stationarity since the states and rewards the agent observes during training depend on its changing…”
Get full text
Journal Article -
15
Simple Hierarchical Planning with Diffusion
Published 05-01-2024“…Diffusion-based generative methods have proven effective in modeling trajectories with offline datasets. However, they often face computational challenges and…”
Get full text
Journal Article -
16
Imagine the Unseen World: A Benchmark for Systematic Generalization in Visual World Models
Published 15-11-2023“…Systematic compositionality, or the ability to adapt to novel situations by creating a mental model of the world using reusable pieces of knowledge, remains a…”
Get full text
Journal Article -
17
Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders
Published 28-10-2024“…Sparse autoencoders (SAEs) have become a core ingredient in the reverse engineering of large-language models (LLMs). For LLMs, they have been shown to…”
Get full text
Journal Article -
18
Self-Recognition in Language Models
Published 09-07-2024“…A rapidly growing number of applications rely on a small set of closed-source language models (LMs). This dependency might introduce novel security risks if…”
Get full text
Journal Article -
19
PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer
Published 10-06-2024“…Despite the recent advancements in offline RL, no unified algorithm could achieve superior performance across a broad range of tasks. Offline \textit{value…”
Get full text
Journal Article -
20
Fleet of Agents: Coordinated Problem Solving with Large Language Models using Genetic Particle Filtering
Published 07-05-2024“…Large language models (LLMs) have significantly evolved, moving from simple output generation to complex reasoning and from stand-alone usage to being embedded…”
Get full text
Journal Article