Search Results - "Han, Yizeng"
-
1
Dynamic Neural Networks: A Survey
Published in IEEE transactions on pattern analysis and machine intelligence (01-11-2022)“…Dynamic neural network is an emerging research topic in deep learning. Compared to static models which have fixed computational graphs and parameters at the…”
Get full text
Journal Article -
2
Resolution Adaptive Networks for Efficient Inference
Published in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01-06-2020)“…Adaptive inference is an effective mechanism to achieve a dynamic tradeoff between accuracy and computational cost in deep networks. Existing works mainly…”
Get full text
Conference Proceeding -
3
Fine-grained Recognition with Learnable Semantic Data Augmentation
Published in IEEE transactions on image processing (01-01-2024)“…Fine-grained image recognition is a longstanding computer vision challenge that focuses on differentiating objects belonging to multiple subordinate categories…”
Get full text
Journal Article -
4
Latency-Aware Unified Dynamic Networks for Efficient Image Recognition
Published in IEEE transactions on pattern analysis and machine intelligence (01-12-2024)“…Dynamic networks have become a pivotal area of study in deep learning due to their ability to selectively activate computing units (such as layers or channels)…”
Get full text
Journal Article -
5
EfficientTrain++: Generalized Curriculum Learning for Efficient Visual Backbone Training
Published in IEEE transactions on pattern analysis and machine intelligence (01-12-2024)“…The superior performance of modern computer vision backbones (e.g., vision Transformers learned on ImageNet-1 K/22 K) usually comes with a costly training…”
Get full text
Journal Article -
6
Spatially Adaptive Feature Refinement for Efficient Inference
Published in IEEE transactions on image processing (2021)“…Spatial redundancy commonly exists in the learned representations of convolutional neural networks (CNNs), leading to unnecessary computation on…”
Get full text
Journal Article -
7
Adaptive Focus for Efficient Video Recognition
Published in 2021 IEEE/CVF International Conference on Computer Vision (ICCV) (01-10-2021)“…In this paper, we explore the spatial redundancy in video recognition with the aim to improve the computational efficiency. It is observed that the most…”
Get full text
Conference Proceeding -
8
OStr-DARTS: Differentiable Neural Architecture Search Based on Operation Strength
Published in IEEE transactions on cybernetics (01-11-2024)“…Differentiable architecture search (DARTS) has emerged as a promising technique for effective neural architecture search, and it mainly contains two steps to…”
Get full text
Journal Article -
9
Towards Learning Spatially Discriminative Feature Representations
Published in 2021 IEEE/CVF International Conference on Computer Vision (ICCV) (01-10-2021)“…The backbone of traditional CNN classifier is generally considered as a feature extractor, followed by a linear layer which performs the classification. We…”
Get full text
Conference Proceeding -
10
SimPro: A Simple Probabilistic Framework Towards Realistic Long-Tailed Semi-Supervised Learning
Published 20-02-2024“…Recent advancements in semi-supervised learning have focused on a more realistic yet challenging task: addressing imbalances in labeled data while the class…”
Get full text
Journal Article -
11
FLatten Transformer: Vision Transformer using Focused Linear Attention
Published 01-08-2023“…The quadratic computation complexity of self-attention has been a persistent challenge when applying Transformer models to vision tasks. Linear attention, on…”
Get full text
Journal Article -
12
ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis
Published 11-11-2024“…Recently, token-based generation have demonstrated their effectiveness in image synthesis. As a representative example, non-autoregressive Transformers (NATs)…”
Get full text
Journal Article -
13
DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
Published 04-11-2024“…MLLMs have demonstrated remarkable comprehension and reasoning capabilities with complex language and visual data. These advances have spurred the vision of…”
Get full text
Journal Article -
14
Exploring contextual modeling with linear complexity for point cloud segmentation
Published 28-10-2024“…Point cloud segmentation is an important topic in 3D understanding that has traditionally has been tackled using either the CNN or Transformer. Recently, Mamba…”
Get full text
Journal Article -
15
Dynamic Diffusion Transformer
Published 04-10-2024“…Diffusion Transformer (DiT), an emerging diffusion model for image generation, has demonstrated superior performance but suffers from substantial computational…”
Get full text
Journal Article -
16
Semantic Refocused Tuning for Open-Vocabulary Panoptic Segmentation
Published 24-09-2024“…Open-vocabulary panoptic segmentation is an emerging task aiming to accurately segment the image into semantically meaningful masks based on a set of texts…”
Get full text
Journal Article -
17
OStr-DARTS: Differentiable Neural Architecture Search based on Operation Strength
Published 22-09-2024“…Differentiable architecture search (DARTS) has emerged as a promising technique for effective neural architecture search, and it mainly contains two steps to…”
Get full text
Journal Article -
18
UniTTA: Unified Benchmark and Versatile Framework Towards Realistic Test-Time Adaptation
Published 29-07-2024“…Test-Time Adaptation (TTA) aims to adapt pre-trained models to the target domain during testing. In reality, this adaptability can be influenced by multiple…”
Get full text
Journal Article -
19
DyFADet: Dynamic Feature Aggregation for Temporal Action Detection
Published 03-07-2024“…Recent proposed neural network-based Temporal Action Detection (TAD) models are inherently limited to extracting the discriminative representations and…”
Get full text
Journal Article -
20
Mask Grounding for Referring Image Segmentation
Published in 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (16-06-2024)“…Referring Image Segmentation (RIS) is a challenging task that requires an algorithm to segment objects referred by free-form language expressions. Despite…”
Get full text
Conference Proceeding