Search Results - "Gu, Rongzhi"
-
1
The Sound Demixing Challenge 2023 – Cinematic Demixing Track
Published in Transactions of the International Society for Music Information Retrieval (17-04-2024)“…This paper summarizes the cinematic demixing (CDX) track of the Sound Demixing Challenge 2023 (SDX’23). We provide a comprehensive summary of the challenge…”
Get full text
Journal Article -
2
ReZero: Region-Customizable Sound Extraction
Published in IEEE/ACM transactions on audio, speech, and language processing (2024)“…We introduce region-customizable sound extraction (ReZero), a general and flexible framework for the multi-channel region-wise sound extraction (R-SE) task…”
Get full text
Journal Article -
3
Improving Music Source Separation with Simo Stereo Band-Split Rnn
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14-04-2024)“…With the recent developments of novel neural network designs, the state-of-the-art of music source separation systems has been significantly advanced. For…”
Get full text
Conference Proceeding -
4
Multi-Modal Multi-Channel Target Speech Separation
Published in IEEE journal of selected topics in signal processing (01-03-2020)“…Target speech separation refers to extracting a target speaker's voice from an overlapped audio of simultaneous talkers. Previously the use of visual modality…”
Get full text
Journal Article -
5
Complex Neural Spatial Filter: Enhancing Multi-Channel Target Speech Separation in Complex Domain
Published in IEEE signal processing letters (2021)“…To date, mainstream target speech separation (TSS) approaches are formulated to estimate the complex ratio mask (cRM) of target speech in time-frequency domain…”
Get full text
Journal Article -
6
Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)“…Hand-crafted spatial features, such as inter-channel intensity difference (IID) and inter-channel phase difference (IPD), play a fundamental role in recent…”
Get full text
Conference Proceeding -
7
Enhancing End-to-End Multi-Channel Speech Separation Via Spatial Feature Learning
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2020)“…Hand-crafted spatial features (e.g., inter-channel phase difference, IPD) play a fundamental role in recent deep learning based multi-channel speech separation…”
Get full text
Conference Proceeding -
8
Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Published in IEEE/ACM transactions on audio, speech, and language processing (2023)“…Recently, frequency domain all-neural beamforming methods have achieved remarkable progress for multichannel speech separation. In parallel, the integration of…”
Get full text
Journal Article -
9
Interest degree of products analysis by RFID technology for offline shops marketing optimization
Published in 2016 IEEE Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC) (01-10-2016)“…With the booming of e-commerce, salerooms of online shops have affected the offline shops marketing severely. It requires a smart system to help dealers to…”
Get full text
Conference Proceeding -
10
TSpeech-AI System Description to the 5th Deep Noise Suppression (DNS) Challenge
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)“…This report presents the development of Tencent AI Lab's personalized speech enhancement system for the 2023 ICASSP Signal Processing Grand Challenge - deep…”
Get full text
Conference Proceeding -
11
Luminescence property, energy transfer and thermal property of color tunable phosphor Ca9-wCe0.5Y0.5-x-y-z(PO4)7:xTb3+, yEu3+, zSm3+, wMn2
Published in Journal of alloys and compounds (15-02-2019)“…Series of Ca9-wCe0.5Y0.5-x-y-z(PO4)7:xTb3+, yEu3+, zSm3+, wMn2+ phosphors are synthesized by a high temperature solid state method. The spectral property and…”
Get full text
Journal Article -
12
Luminescence property, energy transfer and thermal property of color tunable phosphor Ca^sub 9-w^Ce^sub 0.5^Y^sub 0.5-x-y-z^(PO^sub 4^)^sub 7^:xTb^sup 3+^, yEu^sup 3+^, zSm^sup 3+^, wMn^sup 2
Published in Journal of alloys and compounds (15-02-2019)“…Series of Ca9-wCe0.5Y0.5-x-y-z(PO4)7:xTb3+, yEu3+, zSm3+, wMn2+ phosphors are synthesized by a high temperature solid state method. The spectral property and…”
Get full text
Journal Article -
13
-
14
Learning Decoupling Features Through Orthogonality Regularization
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)“…Keyword spotting (KWS) and speaker verification (SV) are two important tasks in speech applications. Research shows that the state-of-art KWS and SV models are…”
Get full text
Conference Proceeding -
15
Parameter-Efficient Transfer Learning of Pre-Trained Transformer Models for Speaker Verification Using Adapters
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)“…Recently, the pre-trained Transformer models have received a rising interest in the field of speech processing thanks to their great success in various…”
Get full text
Conference Proceeding -
16
Fast Random Approximation of Multi-Channel Room Impulse Response
Published in 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW) (14-04-2024)“…The training of modern neural-network-based speech processing systems typically requires a large amount of reverberant data to make the systems robust against…”
Get full text
Conference Proceeding -
17
ReZero: Region-customizable Sound Extraction
Published 31-08-2023“…We introduce region-customizable sound extraction (ReZero), a general and flexible framework for the multi-channel region-wise sound extraction (R-SE) task…”
Get full text
Journal Article -
18
Fast Random Approximation of Multi-channel Room Impulse Response
Published 17-04-2023“…Modern neural-network-based speech processing systems are typically required to be robust against reverberation, and the training of such systems thus needs a…”
Get full text
Journal Article -
19
3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty
Published 26-02-2023“…Multi-channel speech separation using speaker's directional information has demonstrated significant gains over blind speech separation. However, it has two…”
Get full text
Journal Article -
20
Temporal-Spatial Neural Filter: Direction Informed End-to-End Multi-channel Target Speech Separation
Published 02-01-2020“…Target speech separation refers to extracting the target speaker's speech from mixed signals. Despite the recent advances in deep learning based close-talk…”
Get full text
Journal Article