Search Results - "Caron, Mathilde"
-
1
Unsupervised Pre-Training of Image Features on Non-Curated Data
Published in 2019 IEEE/CVF International Conference on Computer Vision (ICCV) (01-10-2019)“…Pre-training general-purpose visual features with convolutional neural networks without relying on annotations is a challenging and important task. Most recent…”
Get full text
Conference Proceeding -
2
ResMLP: Feedforward Networks for Image Classification With Data-Efficient Training
Published in IEEE transactions on pattern analysis and machine intelligence (01-04-2023)“…We present ResMLP, an architecture built entirely upon multi-layer perceptrons for image classification. It is a simple residual network that alternates (i) a…”
Get full text
Journal Article -
3
Emerging Properties in Self-Supervised Vision Transformers
Published in 2021 IEEE/CVF International Conference on Computer Vision (ICCV) (01-10-2021)“…In this paper, we question if self-supervised learning provides new properties to Vision Transformer (ViT) [16] that stand out compared to convolutional…”
Get full text
Conference Proceeding -
4
Location-Aware Self-Supervised Transformers for Semantic Segmentation
Published in 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (03-01-2024)“…Pixel-level labels are particularly expensive to acquire. Hence, pretraining is a critical step to improve models on a task like semantic segmentation…”
Get full text
Conference Proceeding -
5
Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting View Assignments with Support Samples
Published in 2021 IEEE/CVF International Conference on Computer Vision (ICCV) (01-10-2021)“…This paper proposes a novel method of learning by predicting view assignments with support samples (PAWS). The method trains a model to minimize a consistency…”
Get full text
Conference Proceeding -
6
FlexiViT: One Model for All Patch Sizes
Published in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01-06-2023)“…Vision Transformers convert images to sequences by slicing them into patches. The size of these patches controls a speed/accuracy tradeoff, with smaller…”
Get full text
Conference Proceeding -
7
Web-Scale Visual Entity Recognition: An LLM-Driven Data Approach
Published 31-10-2024“…Web-scale visual entity recognition, the task of associating images with their corresponding entities within vast knowledge bases like Wikipedia, presents…”
Get full text
Journal Article -
8
Self-Masking Networks for Unsupervised Adaptation
Published 11-09-2024“…With the advent of billion-parameter foundation models, efficient fine-tuning has become increasingly important for the adaptation of models to downstream…”
Get full text
Journal Article -
9
Location-Aware Self-Supervised Transformers for Semantic Segmentation
Published 05-12-2022“…Pixel-level labels are particularly expensive to acquire. Hence, pretraining is a critical step to improve models on a task like semantic segmentation…”
Get full text
Journal Article -
10
A Generative Approach for Wikipedia-Scale Visual Entity Recognition
Published in 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (16-06-2024)“…In this paper, we address web-scale visual entity recognition, specifically the task of mapping a given query image to one of the 6 million existing entities…”
Get full text
Conference Proceeding -
11
A Generative Approach for Wikipedia-Scale Visual Entity Recognition
Published 04-03-2024“…In this paper, we address web-scale visual entity recognition, specifically the task of mapping a given query image to one of the 6 million existing entities…”
Get full text
Journal Article -
12
Retrieval-Enhanced Contrastive Vision-Text Models
Published 12-06-2023“…Contrastive image-text models such as CLIP form the building blocks of many state-of-the-art systems. While they excel at recognizing common generic concepts,…”
Get full text
Journal Article -
13
Verbs in Action: Improving verb understanding in video-language models
Published 13-04-2023“…Understanding verbs is crucial to modelling how people and objects interact with each other and the environment through space and time. Recently,…”
Get full text
Journal Article -
14
A Memory Transformer Network for Incremental Learning
Published 10-10-2022“…We study class-incremental learning, a training setup in which new classes of data are observed over time for the model to learn from. Despite the…”
Get full text
Journal Article -
15
Guided Diffusion from Self-Supervised Diffusion Features
Published 14-12-2023“…Guidance serves as a key concept in diffusion models, yet its effectiveness is often limited by the need for extra data annotation or classifier pretraining…”
Get full text
Journal Article -
16
Weakly-Supervised Surgical Phase Recognition
Published 26-10-2023“…A key element of computer-assisted surgery systems is phase recognition of surgical videos. Existing phase recognition algorithms require frame-wise annotation…”
Get full text
Journal Article -
17
Self-Supervised Learning for Endoscopic Video Analysis
Published 23-08-2023“…MICCAI 2023 Self-supervised learning (SSL) has led to important breakthroughs in computer vision by allowing learning from large amounts of unlabeled data. As…”
Get full text
Journal Article -
18
Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision
Published 16-02-2022“…Discriminative self-supervised learning allows training models on any random group of internet images, and possibly recover salient information that helps…”
Get full text
Journal Article -
19
FlexiViT: One Model for All Patch Sizes
Published 15-12-2022“…Vision Transformers convert images to sequences by slicing them into patches. The size of these patches controls a speed/accuracy tradeoff, with smaller…”
Get full text
Journal Article -
20
Unsupervised Dense Information Retrieval with Contrastive Learning
Published 16-12-2021“…Recently, information retrieval has seen the emergence of dense retrievers, using neural networks, as an alternative to classical sparse methods based on…”
Get full text
Journal Article