Search Results - "Mukhopadhyay, Rudrabha"
-
1
Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis
Published in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (01-01-2020)“…Humans involuntarily tend to infer parts of the conversation from lip movements when the speech is absent or corrupted by external noise. In this work, we…”
Get full text
Conference Proceeding -
2
Audio-Visual Face Reenactment
Published in 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (01-01-2023)“…This work proposes a novel method to generate realistic talking head videos using audio and visual streams. We animate a source image by transferring head…”
Get full text
Conference Proceeding -
3
FaceOff: A Video-to-Video Face Swapping System
Published in 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (01-01-2023)“…Doubles play an indispensable role in the movie industry. They take the place of the actors in dangerous stunt scenes or scenes where the same actor plays…”
Get full text
Conference Proceeding -
4
Towards MOOCs for Lipreading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale
Published in 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (01-01-2023)“…Many people with some form of hearing loss consider lipreading as their primary mode of day-to-day communication. However, finding resources to learn or…”
Get full text
Conference Proceeding -
5
Towards Generating Ultra-High Resolution Talking-Face Videos with Lip synchronization
Published in 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (01-01-2023)“…Talking-face video generation works have achieved state-of-the-art results in synthesizing videos with lip synchronization. However, most of the previous works…”
Get full text
Conference Proceeding -
6
Visual Speech Enhancement Without A Real Visual Stream
Published in 2021 IEEE Winter Conference on Applications of Computer Vision (WACV) (01-01-2021)“…In this work, we re-think the task of speech enhancement in unconstrained real-world environments. Current state- of-the-art methods use only the audio stream…”
Get full text
Conference Proceeding -
7
2D-3D CNN Based Architectures for Spectral Reconstruction from RGB Images
Published in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (01-06-2018)“…Hyperspectral cameras are used to preserve fine spectral details of scenes that are not captured by traditional RGB cameras that comprehensively quantizes…”
Get full text
Conference Proceeding -
8
NTIRE 2018 Challenge on Spectral Reconstruction from RGB Images
Published in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (01-06-2018)“…This paper reviews the first challenge on spectral image reconstruction from RGB images, i.e., the recovery of whole-scene hyperspectral (HS) information from…”
Get full text
Conference Proceeding -
9
Audio-Visual Face Reenactment
Published 06-10-2022“…This work proposes a novel method to generate realistic talking head videos using audio and visual streams. We animate a source image by transferring head…”
Get full text
Journal Article -
10
Towards Accurate Lip-to-Speech Synthesis in-the-Wild
Published 02-03-2024“…In Proceedings of the 31st ACM International Conference on Multimedia, 2023 In this paper, we introduce a novel approach to address the task of synthesizing…”
Get full text
Journal Article -
11
Compressing Video Calls using Synthetic Talking Heads
Published 07-10-2022“…We leverage the modern advancements in talking head generation to propose an end-to-end system for talking head video compression. Our algorithm transmits…”
Get full text
Journal Article -
12
FaceOff: A Video-to-Video Face Swapping System
Published 20-08-2022“…Doubles play an indispensable role in the movie industry. They take the place of the actors in dangerous stunt scenes or scenes where the same actor plays…”
Get full text
Journal Article -
13
Towards MOOCs for Lipreading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale
Published 20-08-2022“…Many people with some form of hearing loss consider lipreading as their primary mode of day-to-day communication. However, finding resources to learn or…”
Get full text
Journal Article -
14
Extreme-scale Talking-Face Video Upsampling with Audio-Visual Priors
Published 17-08-2022“…In this paper, we explore an interesting question of what can be obtained from an $8\times8$ pixel video sequence. Surprisingly, it turns out to be quite a…”
Get full text
Journal Article -
15
A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild
Published 23-08-2020“…In this work, we investigate the problem of lip-syncing a talking face video of an arbitrary identity to match a target speech segment. Current works excel at…”
Get full text
Journal Article -
16
Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis
Published 17-05-2020“…Humans involuntarily tend to infer parts of the conversation from lip movements when the speech is absent or corrupted by external noise. In this work, we…”
Get full text
Journal Article -
17
Personalized One-Shot Lipreading for an ALS Patient
Published 02-11-2021“…BMVC 2021 Lipreading or visually recognizing speech from the mouth movements of a speaker is a challenging and mentally taxing task. Unfortunately, multiple…”
Get full text
Journal Article -
18
Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild
Published 01-09-2022“…In this work, we address the problem of generating speech from silent lip videos for any speaker in the wild. In stark contrast to previous works, our method…”
Get full text
Journal Article -
19
Towards Automatic Speech to Sign Language Generation
Published 24-06-2021“…We aim to solve the highly challenging task of generating continuous sign language videos solely from speech segments for the first time. Recent efforts in…”
Get full text
Journal Article -
20
NTIRE 2019 Challenge on Video Super-Resolution: Methods and Results
Published in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (01-06-2019)“…This paper reviews the first NTIRE challenge on video super-resolution (restoration of rich details in low-resolution video frames) with focus on proposed…”
Get full text
Conference Proceeding