Search Results - "Oh, Junhyuk"

Refine Results
  1. 1

    Learning Transferrable Knowledge for Semantic Segmentation with Deep Convolutional Neural Network by Seunghoon Hong, Junhyuk Oh, Honglak Lee, Bohyung Han

    “…We propose a novel weakly-supervised semantic segmentation algorithm based on Deep Convolutional Neural Network (DCNN). Contrary to existing weakly-supervised…”
    Get full text
    Conference Proceeding
  2. 2

    A design testbed for eco-friendly corrugated setter tray packaging by Oh, Junhyuk, Choi, Woojin, Jee, Haeseong

    “…Plain and simple, a corrugated cardboard material provides the right level of protection for goods in transit, which helps keep shipping costs down. As made…”
    Get full text
    Journal Article
  3. 3
  4. 4

    Deep Learning‐Assisted Design of Bilayer Nanowire Gratings for High‐Performance MWIR Polarizers by Lee, Junghyun, Oh, Junhyuk, Chi, Hyung‐gun, Lee, Minseok, Hwang, Jehwan, Jeong, Seungjin, Kang, Sang‐Woo, Jee, Haeseong, Bae, Hagyoul, Hyun, Jae‐Sang, Kim, Jun Oh, Kim, Bongjoong

    Published in Advanced materials technologies (07-10-2024)
    “…Optical metamaterials have revolutionized imaging capabilities by manipulating light‐matter interactions at the nanoscale beyond the diffraction limit. Bilayer…”
    Get full text
    Journal Article
  5. 5

    Deep Reinforcement Learning with Plasticity Injection by Nikishin, Evgenii, Oh, Junhyuk, Ostrovski, Georg, Lyle, Clare, Pascanu, Razvan, Dabney, Will, Barreto, André

    Published 24-05-2023
    “…A growing body of evidence suggests that neural networks employed in deep reinforcement learning (RL) gradually lose their plasticity, the ability to learn…”
    Get full text
    Journal Article
  6. 6

    Preference Optimization as Probabilistic Inference by Abdolmaleki, Abbas, Piot, Bilal, Shahriari, Bobak, Springenberg, Jost Tobias, Hertweck, Tim, Joshi, Rishabh, Oh, Junhyuk, Bloesch, Michael, Lampe, Thomas, Heess, Nicolas, Buchli, Jonas, Riedmiller, Martin

    Published 05-10-2024
    “…Existing preference optimization methods are mainly designed for directly learning from human feedback with the assumption that paired examples (preferred vs…”
    Get full text
    Journal Article
  7. 7

    Hierarchical Reinforcement Learning for Zero-shot Generalization with Subtask Dependencies by Sohn, Sungryull, Oh, Junhyuk, Lee, Honglak

    Published 19-07-2018
    “…We introduce a new RL problem where the agent is required to generalize to a previously-unseen environment characterized by a subtask graph which describes a…”
    Get full text
    Journal Article
  8. 8

    Many-Goals Reinforcement Learning by Veeriah, Vivek, Oh, Junhyuk, Singh, Satinder

    Published 22-06-2018
    “…All-goals updating exploits the off-policy nature of Q-learning to update all possible goals an agent could have from each transition in the world, and was…”
    Get full text
    Journal Article
  9. 9

    Introducing Symmetries to Black Box Meta Reinforcement Learning by Kirsch, Louis, Flennerhag, Sebastian, van Hasselt, Hado, Friesen, Abram, Oh, Junhyuk, Chen, Yutian

    Published 22-09-2021
    “…Meta reinforcement learning (RL) attempts to discover new RL algorithms automatically from environment interaction. In so-called black-box approaches, the…”
    Get full text
    Journal Article
  10. 10

    On Learning Intrinsic Rewards for Policy Gradient Methods by Zheng, Zeyu, Oh, Junhyuk, Singh, Satinder

    Published 17-04-2018
    “…In many sequential decision making tasks, it is challenging to design reward functions that help an RL agent efficiently learn behavior that is considered good…”
    Get full text
    Journal Article
  11. 11

    Value Prediction Network by Oh, Junhyuk, Singh, Satinder, Lee, Honglak

    Published 11-07-2017
    “…This paper proposes a novel deep reinforcement learning (RL) architecture, called Value Prediction Network (VPN), which integrates model-free and model-based…”
    Get full text
    Journal Article
  12. 12

    Pick Your Battles: Interaction Graphs as Population-Level Objectives for Strategic Diversity by Garnelo, Marta, Czarnecki, Wojciech Marian, Liu, Siqi, Tirumala, Dhruva, Oh, Junhyuk, Gidel, Gauthier, van Hasselt, Hado, Balduzzi, David

    Published 08-10-2021
    “…Strategic diversity is often essential in games: in multi-player games, for example, evaluating a player against a diverse set of strategies will yield a more…”
    Get full text
    Journal Article
  13. 13

    Balancing Constraints and Rewards with Meta-Gradient D4PG by Calian, Dan A, Mankowitz, Daniel J, Zahavy, Tom, Xu, Zhongwen, Oh, Junhyuk, Levine, Nir, Mann, Timothy

    Published 13-10-2020
    “…Deploying Reinforcement Learning (RL) agents to solve real-world applications often requires satisfying complex system constraints. Often the constraint…”
    Get full text
    Journal Article
  14. 14

    Generative Adversarial Self-Imitation Learning by Guo, Yijie, Oh, Junhyuk, Singh, Satinder, Lee, Honglak

    Published 03-12-2018
    “…This paper explores a simple regularizer for reinforcement learning by proposing Generative Adversarial Self-Imitation Learning (GASIL), which encourages the…”
    Get full text
    Journal Article
  15. 15

    Discovering Reinforcement Learning Algorithms by Oh, Junhyuk, Hessel, Matteo, Czarnecki, Wojciech M, Xu, Zhongwen, van Hasselt, Hado, Singh, Satinder, Silver, David

    Published 17-07-2020
    “…Reinforcement learning (RL) algorithms update an agent's parameters according to one of several possible rules, discovered manually through years of research…”
    Get full text
    Journal Article
  16. 16

    Meta-Gradient Reinforcement Learning with an Objective Discovered Online by Xu, Zhongwen, van Hasselt, Hado, Hessel, Matteo, Oh, Junhyuk, Singh, Satinder, Silver, David

    Published 16-07-2020
    “…Deep reinforcement learning includes a broad family of algorithms that parameterise an internal representation, such as a value function or policy, by a deep…”
    Get full text
    Journal Article
  17. 17

    Discovery of Options via Meta-Learned Subgoals by Veeriah, Vivek, Zahavy, Tom, Hessel, Matteo, Xu, Zhongwen, Oh, Junhyuk, Kemaev, Iurii, van Hasselt, Hado, Silver, David, Singh, Satinder

    Published 12-02-2021
    “…Temporal abstractions in the form of options have been shown to help reinforcement learning (RL) agents learn faster. However, despite prior work on this…”
    Get full text
    Journal Article
  18. 18

    Self-Imitation Learning by Oh, Junhyuk, Guo, Yijie, Singh, Satinder, Lee, Honglak

    Published 14-06-2018
    “…This paper proposes Self-Imitation Learning (SIL), a simple off-policy actor-critic algorithm that learns to reproduce the agent's past good decisions. This…”
    Get full text
    Journal Article
  19. 19

    In-context Reinforcement Learning with Algorithm Distillation by Laskin, Michael, Wang, Luyu, Oh, Junhyuk, Parisotto, Emilio, Spencer, Stephen, Steigerwald, Richie, Strouse, DJ, Hansen, Steven, Filos, Angelos, Brooks, Ethan, Gazeau, Maxime, Sahni, Himanshu, Singh, Satinder, Mnih, Volodymyr

    Published 25-10-2022
    “…We propose Algorithm Distillation (AD), a method for distilling reinforcement learning (RL) algorithms into neural networks by modeling their training…”
    Get full text
    Journal Article
  20. 20

    A Self-Tuning Actor-Critic Algorithm by Zahavy, Tom, Xu, Zhongwen, Veeriah, Vivek, Hessel, Matteo, Oh, Junhyuk, van Hasselt, Hado, Silver, David, Singh, Satinder

    Published 28-02-2020
    “…Reinforcement learning algorithms are highly sensitive to the choice of hyperparameters, typically requiring significant manual effort to identify…”
    Get full text
    Journal Article