Search Results - "Gulcehre, Caglar"

1
EmoNets: Multimodal deep learning approaches for emotion recognition in video by Kahou, Samira Ebrahimi, Bouthillier, Xavier, Lamblin, Pascal, Gulcehre, Caglar, Michalski, Vincent, Konda, Kishore, Jean, Sébastien, Froumenty, Pierre, Dauphin, Yann, Boulanger-Lewandowski, Nicolas, Chandias Ferrari, Raul, Mirza, Mehdi, Warde-Farley, David, Courville, Aaron, Vincent, Pascal, Memisevic, Roland, Pal, Christopher, Bengio, Yoshua

Published in Journal on multimodal user interfaces (01-06-2016)
“…The task of the Emotion Recognition in the Wild (EmotiW) Challenge is to assign one of seven emotions to short video clips extracted from Hollywood style…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
2
On integrating a language model into neural machine translation by Gulcehre, Caglar, Firat, Orhan, Xu, Kelvin, Cho, Kyunghyun, Bengio, Yoshua

Published in Computer speech & language (01-09-2017)
“…Recent advances in end-to-end neural machine translation models have achieved promising results on high-resource language pairs such as En→ Fr and En→ De. One…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
3
Grandmaster level in StarCraft II using multi-agent reinforcement learning by Vinyals, Oriol, Babuschkin, Igor, Czarnecki, Wojciech M., Mathieu, Michaël, Dudzik, Andrew, Chung, Junyoung, Choi, David H., Powell, Richard, Ewalds, Timo, Georgiev, Petko, Oh, Junhyuk, Horgan, Dan, Kroiss, Manuel, Danihelka, Ivo, Huang, Aja, Sifre, Laurent, Cai, Trevor, Agapiou, John P., Jaderberg, Max, Vezhnevets, Alexander S., Leblond, Rémi, Pohlen, Tobias, Dalibard, Valentin, Budden, David, Sulsky, Yury, Molloy, James, Paine, Tom L., Gulcehre, Caglar, Wang, Ziyu, Pfaff, Tobias, Wu, Yuhuai, Ring, Roman, Yogatama, Dani, Wünsch, Dario, McKinney, Katrina, Smith, Oliver, Schaul, Tom, Lillicrap, Timothy, Kavukcuoglu, Koray, Hassabis, Demis, Apps, Chris, Silver, David

Published in Nature (London) (01-11-2019)
“…Many real-world applications require artificial agents to compete and coordinate with other agents in complex environments. As a stepping stone to this goal,…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
4
Beyond Autoregression: Fast LLMs via Self-Distillation Through Time by Deschenaux, Justin, Gulcehre, Caglar

Published 28-10-2024
“…Autoregressive (AR) Large Language Models (LLMs) have demonstrated significant success across numerous tasks. However, the AR modeling paradigm presents…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
5
In Search for Architectures and Loss Functions in Multi-Objective Reinforcement Learning by Terekhov, Mikhail, Gulcehre, Caglar

Published 23-07-2024
“…Multi-objective reinforcement learning (MORL) is essential for addressing the intricacies of real-world RL problems, which often require trade-offs between…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
6
Promises, Outlooks and Challenges of Diffusion Language Modeling by Deschenaux, Justin, Gulcehre, Caglar

Published 17-06-2024
“…The modern autoregressive Large Language Models (LLMs) have achieved outstanding performance on NLP benchmarks, and they are deployed in the real world…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
7
The Role of Deep Learning Regularizations on Actors in Offline RL by Tarasov, Denis, Surina, Anja, Gulcehre, Caglar

Published 11-09-2024
“…Deep learning regularization techniques, such as dropout, layer normalization, or weight decay, are widely adopted in the construction of modern artificial…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
8
The Effect of Scheduling and Preemption on the Efficiency of LLM Inference Serving by Kim, Kyoungmin, Hong, Kijae, Gulcehre, Caglar, Ailamaki, Anastasia

Published 11-11-2024
“…The growing usage of Large Language Models (LLMs) highlights the demands and challenges in scalable LLM inference systems, affecting deployment and development…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
9
SIKeD: Self-guided Iterative Knowledge Distillation for mathematical reasoning by Adarsh, Shivam, Shridhar, Kumar, Gulcehre, Caglar, Monath, Nicholas, Sachan, Mrinmaya

Published 24-10-2024
“…Large Language Models (LLMs) can transfer their reasoning skills to smaller models by teaching them to generate the intermediate reasoning process required to…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
10
Investigating Low-Rank Training in Transformer Language Models: Efficiency and Scaling Analysis by Wei, Xiuying, Moalla, Skander, Pascanu, Razvan, Gulcehre, Caglar

Published 13-07-2024
“…State-of-the-art LLMs often rely on scale with high computational costs, which has sparked a research agenda to reduce parameter counts and costs without…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
11
HiPPO-Prophecy: State-Space Models can Provably Learn Dynamical Systems in Context by Joseph, Federico Arangath, Haefeli, Kilian Konstantin, Liniger, Noah, Gulcehre, Caglar

Published 12-07-2024
“…This work explores the in-context learning capabilities of State Space Models (SSMs) and presents, to the best of our knowledge, the first theoretical…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
12
Building on Efficient Foundations: Effectively Training LLMs with Structured Feedforward Layers by Wei, Xiuying, Moalla, Skander, Pascanu, Razvan, Gulcehre, Caglar

Published 24-06-2024
“…State-of-the-art results in large language models (LLMs) often rely on scale, which becomes computationally expensive. This has sparked a research agenda to…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
13
Aligning Large Language Models with Diverse Political Viewpoints by Stammbach, Dominik, Widmer, Philine, Cho, Eunjung, Gulcehre, Caglar, Ash, Elliott

Published 20-06-2024
“…Large language models such as ChatGPT exhibit striking political biases. If users query them about political information, they often take a normative stance…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
14
No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO by Moalla, Skander, Miele, Andrea, Pascanu, Razvan, Gulcehre, Caglar

Published 01-05-2024
“…Reinforcement learning (RL) is inherently rife with non-stationarity since the states and rewards the agent observes during training depend on its changing…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
15
Simple Hierarchical Planning with Diffusion by Chen, Chang, Deng, Fei, Kawaguchi, Kenji, Gulcehre, Caglar, Ahn, Sungjin

Published 05-01-2024
“…Diffusion-based generative methods have proven effective in modeling trajectories with offline datasets. However, they often face computational challenges and…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
16
Imagine the Unseen World: A Benchmark for Systematic Generalization in Visual World Models by Kim, Yeongbin, Singh, Gautam, Park, Junyeong, Gulcehre, Caglar, Ahn, Sungjin

Published 15-11-2023
“…Systematic compositionality, or the ability to adapt to novel situations by creating a mental model of the world using reusable pieces of knowledge, remains a…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
17
Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders by Surkov, Viacheslav, Wendler, Chris, Terekhov, Mikhail, Deschenaux, Justin, West, Robert, Gulcehre, Caglar

Published 28-10-2024
“…Sparse autoencoders (SAEs) have become a core ingredient in the reverse engineering of large-language models (LLMs). For LLMs, they have been shown to…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
18
Self-Recognition in Language Models by Davidson, Tim R, Surkov, Viacheslav, Veselovsky, Veniamin, Russo, Giuseppe, West, Robert, Gulcehre, Caglar

Published 09-07-2024
“…A rapidly growing number of applications rely on a small set of closed-source language models (LMs). This dependency might introduce novel security risks if…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
19
PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer by Chen, Chang, Baek, Junyeob, Deng, Fei, Kawaguchi, Kenji, Gulcehre, Caglar, Ahn, Sungjin

Published 10-06-2024
“…Despite the recent advancements in offline RL, no unified algorithm could achieve superior performance across a broad range of tasks. Offline \textit{value…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
20
Fleet of Agents: Coordinated Problem Solving with Large Language Models using Genetic Particle Filtering by Arora, Akhil, Klein, Lars, Potamitis, Nearchos, Aydin, Roland, Gulcehre, Caglar, West, Robert

Published 07-05-2024
“…Large language models (LLMs) have significantly evolved, moving from simple output generation to complex reasoning and from stand-alone usage to being embedded…”

Get full text

Journal Article
QR Code
Save to List

Saved in:

Search Results - "Gulcehre, Caglar"

On integrating a language model into neural machine translation by Gulcehre, Caglar, Firat, Orhan, Xu, Kelvin, Cho, Kyunghyun, Bengio, Yoshua

Beyond Autoregression: Fast LLMs via Self-Distillation Through Time by Deschenaux, Justin, Gulcehre, Caglar

In Search for Architectures and Loss Functions in Multi-Objective Reinforcement Learning by Terekhov, Mikhail, Gulcehre, Caglar

Promises, Outlooks and Challenges of Diffusion Language Modeling by Deschenaux, Justin, Gulcehre, Caglar

The Role of Deep Learning Regularizations on Actors in Offline RL by Tarasov, Denis, Surina, Anja, Gulcehre, Caglar

The Effect of Scheduling and Preemption on the Efficiency of LLM Inference Serving by Kim, Kyoungmin, Hong, Kijae, Gulcehre, Caglar, Ailamaki, Anastasia

SIKeD: Self-guided Iterative Knowledge Distillation for mathematical reasoning by Adarsh, Shivam, Shridhar, Kumar, Gulcehre, Caglar, Monath, Nicholas, Sachan, Mrinmaya

Investigating Low-Rank Training in Transformer Language Models: Efficiency and Scaling Analysis by Wei, Xiuying, Moalla, Skander, Pascanu, Razvan, Gulcehre, Caglar

HiPPO-Prophecy: State-Space Models can Provably Learn Dynamical Systems in Context by Joseph, Federico Arangath, Haefeli, Kilian Konstantin, Liniger, Noah, Gulcehre, Caglar

Building on Efficient Foundations: Effectively Training LLMs with Structured Feedforward Layers by Wei, Xiuying, Moalla, Skander, Pascanu, Razvan, Gulcehre, Caglar

Aligning Large Language Models with Diverse Political Viewpoints by Stammbach, Dominik, Widmer, Philine, Cho, Eunjung, Gulcehre, Caglar, Ash, Elliott

No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO by Moalla, Skander, Miele, Andrea, Pascanu, Razvan, Gulcehre, Caglar

Simple Hierarchical Planning with Diffusion by Chen, Chang, Deng, Fei, Kawaguchi, Kenji, Gulcehre, Caglar, Ahn, Sungjin

Imagine the Unseen World: A Benchmark for Systematic Generalization in Visual World Models by Kim, Yeongbin, Singh, Gautam, Park, Junyeong, Gulcehre, Caglar, Ahn, Sungjin

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders by Surkov, Viacheslav, Wendler, Chris, Terekhov, Mikhail, Deschenaux, Justin, West, Robert, Gulcehre, Caglar

Self-Recognition in Language Models by Davidson, Tim R, Surkov, Viacheslav, Veselovsky, Veniamin, Russo, Giuseppe, West, Robert, Gulcehre, Caglar

PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer by Chen, Chang, Baek, Junyeob, Deng, Fei, Kawaguchi, Kenji, Gulcehre, Caglar, Ahn, Sungjin

Fleet of Agents: Coordinated Problem Solving with Large Language Models using Genetic Particle Filtering by Arora, Akhil, Klein, Lars, Potamitis, Nearchos, Aydin, Roland, Gulcehre, Caglar, West, Robert

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication