Search Results - "Lei, Stan Weixian"

  • Showing 1 - 7 results of 7
Refine Results
  1. 1

    Generic Event Boundary Detection: A Benchmark for Event Segmentation by Shou, Mike Zheng, Lei, Stan Weixian, Wang, Weiyao, Ghadiyaram, Deepti, Feiszli, Matt

    “…This paper presents a novel task together with a new benchmark for detecting generic, taxonomy-free event boundaries that segment a whole video into chunks…”
    Get full text
    Conference Proceeding
  2. 2

    Too Large; Data Reduction for Vision-Language Pre-Training by Wang, Alex Jinpeng, Lin, Kevin Qinghong, Zhang, David Junhao, Lei, Stan Weixian, Shou, Mike Zheng

    Published 31-05-2023
    “…This paper examines the problems of severe image-text misalignment and high redundancy in the widely-used large-scale Vision-Language Pre-Training (VLP)…”
    Get full text
    Journal Article
  3. 3

    GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval by Wang, Yuxuan, Gao, Difei, Yu, Licheng, Lei, Stan Weixian, Feiszli, Matt, Shou, Mike Zheng

    Published 01-04-2022
    “…Cognitive science has shown that humans perceive videos in terms of events separated by the state changes of dominant subjects. State changes trigger new…”
    Get full text
    Journal Article
  4. 4

    AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant by Wong, Benita, Chen, Joya, Wu, You, Lei, Stan Weixian, Mao, Dongxing, Gao, Difei, Shou, Mike Zheng

    Published 08-03-2022
    “…A long-standing goal of intelligent assistants such as AR glasses/robots has been to assist users in affordance-centric real-world scenarios, such as "how can…”
    Get full text
    Journal Article
  5. 5

    AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant by Lei, Stan Weixian, Gao, Difei, Wang, Yuxuan, Mao, Dongxing, Liang, Zihan, Ran, Lingmin, Shou, Mike Zheng

    Published 29-11-2021
    “…It is still a pipe dream that personal AI assistants on the phone and AR glasses can assist our daily life in addressing our questions like ``how to adjust the…”
    Get full text
    Journal Article
  6. 6

    Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task by Lei, Stan Weixian, Gao, Difei, Wu, Jay Zhangjie, Wang, Yuxuan, Liu, Wei, Zhang, Mengmi, Shou, Mike Zheng

    Published 24-08-2022
    “…VQA is an ambitious task aiming to answer any image-related question. However, in reality, it is hard to build such a system once for all since the needs of…”
    Get full text
    Journal Article
  7. 7

    Generic Event Boundary Detection: A Benchmark for Event Segmentation by Shou, Mike Zheng, Lei, Stan Weixian, Wang, Weiyao, Ghadiyaram, Deepti, Feiszli, Matt

    Published 25-01-2021
    “…This paper presents a novel task together with a new benchmark for detecting generic, taxonomy-free event boundaries that segment a whole video into chunks…”
    Get full text
    Journal Article