Search Results - "Lei, Stan Weixian"
-
1
Generic Event Boundary Detection: A Benchmark for Event Segmentation
Published in 2021 IEEE/CVF International Conference on Computer Vision (ICCV) (01-10-2021)“…This paper presents a novel task together with a new benchmark for detecting generic, taxonomy-free event boundaries that segment a whole video into chunks…”
Get full text
Conference Proceeding -
2
Too Large; Data Reduction for Vision-Language Pre-Training
Published 31-05-2023“…This paper examines the problems of severe image-text misalignment and high redundancy in the widely-used large-scale Vision-Language Pre-Training (VLP)…”
Get full text
Journal Article -
3
GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval
Published 01-04-2022“…Cognitive science has shown that humans perceive videos in terms of events separated by the state changes of dominant subjects. State changes trigger new…”
Get full text
Journal Article -
4
AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant
Published 08-03-2022“…A long-standing goal of intelligent assistants such as AR glasses/robots has been to assist users in affordance-centric real-world scenarios, such as "how can…”
Get full text
Journal Article -
5
AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant
Published 29-11-2021“…It is still a pipe dream that personal AI assistants on the phone and AR glasses can assist our daily life in addressing our questions like ``how to adjust the…”
Get full text
Journal Article -
6
Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task
Published 24-08-2022“…VQA is an ambitious task aiming to answer any image-related question. However, in reality, it is hard to build such a system once for all since the needs of…”
Get full text
Journal Article -
7
Generic Event Boundary Detection: A Benchmark for Event Segmentation
Published 25-01-2021“…This paper presents a novel task together with a new benchmark for detecting generic, taxonomy-free event boundaries that segment a whole video into chunks…”
Get full text
Journal Article