Search Results - "Kotar, Klemen"
-
1
Interactron: Embodied Adaptive Object Detection
Published in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01-06-2022)“…Over the years various methods have been proposed for the problem of object detection. Recently, we have wit-nessed great strides in this domain owing to the…”
Get full text
Conference Proceeding -
2
Contrasting Contrastive Self-Supervised Representation Learning Pipelines
Published in 2021 IEEE/CVF International Conference on Computer Vision (ICCV) (01-10-2021)“…In the past few years, we have witnessed remarkable breakthroughs in self-supervised representation learning. Despite the success and adoption of…”
Get full text
Conference Proceeding -
3
Interactron: Embodied Adaptive Object Detection
Published 01-02-2022“…Over the years various methods have been proposed for the problem of object detection. Recently, we have witnessed great strides in this domain owing to the…”
Get full text
Journal Article -
4
ENTL: Embodied Navigation Trajectory Learner
Published 05-04-2023“…We propose Embodied Navigation Trajectory Learner (ENTL), a method for extracting long sequence representations for embodied navigation. Our approach unifies…”
Get full text
Journal Article -
5
WhisBERT: Multimodal Text-Audio Language Modeling on 100M Words
Published 05-12-2023“…Training on multiple modalities of input can augment the capabilities of a language model. Here, we ask whether such a training regime can improve the quality…”
Get full text
Journal Article -
6
Are These the Same Apple? Comparing Images Based on Object Intrinsics
Published 01-11-2023“…The human visual system can effortlessly recognize an object under different extrinsic factors such as lighting, object poses, and background, yet current…”
Get full text
Journal Article -
7
Contrasting Contrastive Self-Supervised Representation Learning Pipelines
Published 25-03-2021“…In the past few years, we have witnessed remarkable breakthroughs in self-supervised representation learning. Despite the success and adoption of…”
Get full text
Journal Article -
8
Break and Make: Interactive Structural Understanding Using LEGO Bricks
Published 27-07-2022“…Visual understanding of geometric structures with complex spatial relationships is a fundamental component of human intelligence. As children, we learn how to…”
Get full text
Journal Article -
9
Unifying (Machine) Vision via Counterfactual World Modeling
Published 02-06-2023“…Leading approaches in machine vision employ different architectures for different tasks, trained on costly task-specific labeled datasets. This complexity has…”
Get full text
Journal Article -
10
Understanding Physical Dynamics with Counterfactual World Modeling
Published 10-12-2023“…The ability to understand physical dynamics is critical for agents to act in the world. Here, we use Counterfactual World Modeling (CWM) to extract vision…”
Get full text
Journal Article -
11
AllenAct: A Framework for Embodied AI Research
Published 28-08-2020“…The domain of Embodied AI, in which agents learn to complete tasks through interaction with their environment from egocentric observations, has experienced…”
Get full text
Journal Article