Search Results - "Cun, Xiaodong"

Refine Results
  1. 1

    A shell dataset, for shell features extraction and recognition by Zhang, Qi, Zhou, Jianhang, He, Jing, Cun, Xiaodong, Zeng, Shaoning, Zhang, Bob

    Published in Scientific data (22-10-2019)
    “…Shells are very common objects in the world, often used for decorations, collections, academic research, etc. With tens of thousands of species, shells are not…”
    Get full text
    Journal Article
  2. 2

    Uformer: A General U-Shaped Transformer for Image Restoration by Wang, Zhendong, Cun, Xiaodong, Bao, Jianmin, Zhou, Wengang, Liu, Jianzhuang, Li, Houqiang

    “…In this paper, we present Uformer, an effective and efficient Transformer-based architecture for image restoration, in which we build a hierarchical…”
    Get full text
    Conference Proceeding
  3. 3

    Improving the Harmony of the Composite Image by Spatial-Separated Attention Module by Cun, Xiaodong, Pun, Chi-Man

    Published in IEEE transactions on image processing (01-01-2020)
    “…Image composition is one of the most important applications in image processing. However, the inharmonious appearance between the spliced region and background…”
    Get full text
    Journal Article
  4. 4

    DH-GAN: Image manipulation localization via a dual homology-aware generative adversarial network by Liu, Weihuang, Cun, Xiaodong, Pun, Chi-Man

    Published in Pattern recognition (01-11-2024)
    “…Image manipulation localization is a binary segmentation task that sensitive to the tampered artifacts other than awareness of the object. Thus, both…”
    Get full text
    Journal Article
  5. 5

    Generating Human Motion from Textual Descriptions with Discrete Representations by Zhang, Jianrong, Zhang, Yangsong, Cun, Xiaodong, Zhang, Yong, Zhao, Hongwei, Lu, Hongtao, Shen, Xi, Ying, Shan

    “…In this work, we investigate a simple and must-known conditional generative framework based on Vector Quantised-Variational AutoEncoder (VQ-VAE) and Generative…”
    Get full text
    Conference Proceeding
  6. 6

    Explicit Visual Prompting for Low-Level Structure Segmentations by Liu, Weihuang, Shen, Xi, Pun, Chi-Man, Cun, Xiaodong

    “…We consider the generic problem of detecting low-level structures in images, which includes segmenting the manipulated parts, identifying out-of-focus pixels,…”
    Get full text
    Conference Proceeding
  7. 7

    SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation by Zhang, Wenxuan, Cun, Xiaodong, Wang, Xuan, Zhang, Yong, Shen, Xi, Guo, Yu, Shan, Ying, Wang, Fei

    “…Generating talking head videos through a face image and a piece of speech audio still contains many challenges. i.e., unnatural head movement, distorted…”
    Get full text
    Conference Proceeding
  8. 8

    Sketch Video Synthesis by Zheng, Yudian, Cun, Xiaodong, Xia, Menghan, Pun, Chi‐Man

    Published in Computer graphics forum (01-05-2024)
    “…Understanding semantic intricacies and high‐level concepts is essential in image sketch generation, and this challenge becomes even more formidable when…”
    Get full text
    Journal Article
  9. 9

    CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior by Xing, Jinbo, Xia, Menghan, Zhang, Yuechen, Cun, Xiaodong, Wang, Jue, Wong, Tien-Tsin

    “…Speech-driven 3D facial animation has been widely studied, yet there is still a gap to achieving realism and vividness due to the highly ill-posed nature and…”
    Get full text
    Conference Proceeding
  10. 10

    Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance by Xing, Jinbo, Xia, Menghan, Liu, Yuxin, Zhang, Yuechen, Zhang, Yong, He, Yingqing, Liu, Hanyuan, Chen, Haoxin, Cun, Xiaodong, Wang, Xintao, Shan, Ying, Wong, Tien-Tsin

    “…Creating a vivid video from the event or scenario in our imagination is a truly fascinating experience. Recent advancements in text-to-video synthesis have…”
    Get full text
    Journal Article
  11. 11

    Applying stochastic second-order entropy images to multi-modal image registration by Cun, Xiaodong, Pun, Chi-Man, Gao, Hao

    Published in Signal processing. Image communication (01-07-2018)
    “…Which metric to use for multi-modal image registration is still a nontrivial research problem. Recently, some methods have used structural representations of…”
    Get full text
    Journal Article
  12. 12

    3D GAN Inversion with Facial Symmetry Prior by Yin, Fei, Zhang, Yong, Wang, Xuan, Wang, Tengfei, Li, Xiaoyu, Gong, Yuan, Fan, Yanbo, Cun, Xiaodong, Shan, Ying, Oztireli, Cengiz, Yang, Yujiu

    “…Recently, a surge of high-quality 3D-aware GANs have been proposed, which leverage the generative power of neural rendering. It is natural to associate 3D GANs…”
    Get full text
    Conference Proceeding
  13. 13

    Shadocnet: Learning Spatial-Aware Tokens in Transformer for Document Shadow Removal by Chen, Xuhang, Cun, Xiaodong, Pun, Chi-Man, Wang, Shuqiang

    “…Shadow removal improves the visual quality and legibility of digital copies of documents. However, document shadow removal remains an unresolved subject…”
    Get full text
    Conference Proceeding
  14. 14

    DPE: Disentanglement of Pose and Expression for General Video Portrait Editing by Pang, Youxin, Zhang, Yong, Quan, Weize, Fan, Yanbo, Cun, Xiaodong, Shan, Ying, Yan, Dong-Ming

    “…One-shot video-driven talking face generation aims at producing a synthetic talking video by transferring the facial motion from a video to an arbitrary…”
    Get full text
    Conference Proceeding
  15. 15

    ForgeryTTT: Zero-Shot Image Manipulation Localization with Test-Time Training by Liu, Weihuang, Shen, Xi, Pun, Chi-Man, Cun, Xiaodong

    Published 05-10-2024
    “…Social media is increasingly plagued by realistic fake images, making it hard to trust content. Previous algorithms to detect these fakes often fail in new,…”
    Get full text
    Journal Article
  16. 16

    ZeroSmooth: Training-free Diffuser Adaptation for High Frame Rate Video Generation by Yang, Shaoshu, Zhang, Yong, Cun, Xiaodong, Shan, Ying, He, Ran

    Published 02-06-2024
    “…Video generation has made remarkable progress in recent years, especially since the advent of the video diffusion models. Many video generation models can…”
    Get full text
    Journal Article
  17. 17

    Split then Refine: Stacked Attention-guided ResUNets for Blind Single Image Visible Watermark Removal by Cun, Xiaodong, Pun, Chi-Man

    Published 13-12-2020
    “…Digital watermark is a commonly used technique to protect the copyright of medias. Simultaneously, to increase the robustness of watermark, attacking…”
    Get full text
    Journal Article
  18. 18

    Sketch Video Synthesis by Zheng, Yudian, Cun, Xiaodong, Xia, Menghan, Pun, Chi-Man

    Published 26-11-2023
    “…Understanding semantic intricacies and high-level concepts is essential in image sketch generation, and this challenge becomes even more formidable when…”
    Get full text
    Journal Article
  19. 19

    Defocus Blur Detection via Depth Distillation by Cun, Xiaodong, Pun, Chi-Man

    Published 16-07-2020
    “…Defocus Blur Detection(DBD) aims to separate in-focus and out-of-focus regions from a single image pixel-wisely. This task has been paid much attention since…”
    Get full text
    Journal Article
  20. 20

    High-Resolution Document Shadow Removal via A Large-Scale Real-World Dataset and A Frequency-Aware Shadow Erasing Net by Li, Zinuo, Chen, Xuhang, Pun, Chi-Man, Cun, Xiaodong

    Published 27-08-2023
    “…Shadows often occur when we capture the documents with casual equipment, which influences the visual quality and readability of the digital copies. Different…”
    Get full text
    Journal Article