Search Results - "Gu, Rongzhi"

1
The Sound Demixing Challenge 2023 – Cinematic Demixing Track by Uhlich, Stefan, Fabbro, Giorgio, Hirano, Masato, Takahashi, Shusuke, Wichern, Gordon, Le Roux, Jonathan, Chakraborty, Dipam, Mohanty, Sharada, Li, Kai, Luo, Yi, Yu, Jianwei, Gu, Rongzhi, Solovyev, Roman, Stempkovskiy, Alexander, Habruseva, Tatiana, Sukhovei, Mikhail, Mitsufuji, Yuki

Published in Transactions of the International Society for Music Information Retrieval (17-04-2024)
“…This paper summarizes the cinematic demixing (CDX) track of the Sound Demixing Challenge 2023 (SDX’23). We provide a comprehensive summary of the challenge…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
2
ReZero: Region-Customizable Sound Extraction by Gu, Rongzhi, Luo, Yi

Published in IEEE/ACM transactions on audio, speech, and language processing (2024)
“…We introduce region-customizable sound extraction (ReZero), a general and flexible framework for the multi-channel region-wise sound extraction (R-SE) task…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
3
Improving Music Source Separation with Simo Stereo Band-Split Rnn by Luo, Yi, Gu, Rongzhi

Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14-04-2024)
“…With the recent developments of novel neural network designs, the state-of-the-art of music source separation systems has been significantly advanced. For…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
4
Multi-Modal Multi-Channel Target Speech Separation by Gu, Rongzhi, Zhang, Shi-Xiong, Xu, Yong, Chen, Lianwu, Zou, Yuexian, Yu, Dong

Published in IEEE journal of selected topics in signal processing (01-03-2020)
“…Target speech separation refers to extracting a target speaker's voice from an overlapped audio of simultaneous talkers. Previously the use of visual modality…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
5
Complex Neural Spatial Filter: Enhancing Multi-Channel Target Speech Separation in Complex Domain by Gu, Rongzhi, Zhang, Shi-Xiong, Zou, Yuexian, Yu, Dong

Published in IEEE signal processing letters (2021)
“…To date, mainstream target speech separation (TSS) approaches are formulated to estimate the complex ratio mask (cRM) of target speech in time-frequency domain…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
6
Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention by Xu, Xinmeng, Gu, Rongzhi, Zou, Yuexian

Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)
“…Hand-crafted spatial features, such as inter-channel intensity difference (IID) and inter-channel phase difference (IPD), play a fundamental role in recent…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
7
Enhancing End-to-End Multi-Channel Speech Separation Via Spatial Feature Learning by Gu, Rongzhi, Zhang, Shi-Xiong, Chen, Lianwu, Xu, Yong, Yu, Meng, Su, Dan, Zou, Yuexian, Yu, Dong

Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01-05-2020)
“…Hand-crafted spatial features (e.g., inter-channel phase difference, IPD) play a fundamental role in recent deep learning based multi-channel speech separation…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
8
Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation by Gu, Rongzhi, Zhang, Shi-Xiong, Zou, Yuexian, Yu, Dong

Published in IEEE/ACM transactions on audio, speech, and language processing (2023)
“…Recently, frequency domain all-neural beamforming methods have achieved remarkable progress for multichannel speech separation. In parallel, the integration of…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
9
Interest degree of products analysis by RFID technology for offline shops marketing optimization by Xin Su, Rongzhi Gu, Can Qi, Xuewu Zhang, Dongmin Choi

Published in 2016 IEEE Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC) (01-10-2016)
“…With the booming of e-commerce, salerooms of online shops have affected the offline shops marketing severely. It requires a smart system to help dealers to…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
10
TSpeech-AI System Description to the 5th Deep Noise Suppression (DNS) Challenge by Yu, Jianwei, Chen, Hangting, Luo, Yi, Gu, Rongzhi, Li, Weihua, Weng, Chao

Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)
“…This report presents the development of Tencent AI Lab's personalized speech enhancement system for the 2023 ICASSP Signal Processing Grand Challenge - deep…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
11
Luminescence property, energy transfer and thermal property of color tunable phosphor Ca9-wCe0.5Y0.5-x-y-z(PO4)7:xTb3+, yEu3+, zSm3+, wMn2 by Gu, Rongzhi, Guan, Mengyuan, Jiang, Na, Yuan, Tianyu, Ma, Guoquan, Wang, Chao, Yang, Zhiping, Li, Panlai, Wang, Zhijun

Published in Journal of alloys and compounds (15-02-2019)
“…Series of Ca9-wCe0.5Y0.5-x-y-z(PO4)7:xTb3+, yEu3+, zSm3+, wMn2+ phosphors are synthesized by a high temperature solid state method. The spectral property and…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
12
Luminescence property, energy transfer and thermal property of color tunable phosphor Ca^sub 9-w^Ce^sub 0.5^Y^sub 0.5-x-y-z^(PO^sub 4^)^sub 7^:xTb^sup 3+^, yEu^sup 3+^, zSm^sup 3+^, wMn^sup 2 by Gu, Rongzhi, Guan, Mengyuan, Jiang, Na, Yuan, Tianyu, Ma, Guoquan, Wang, Chao, Yang, Zhiping, Li, Panlai, Wang, Zhijun

Published in Journal of alloys and compounds (15-02-2019)
“…Series of Ca9-wCe0.5Y0.5-x-y-z(PO4)7:xTb3+, yEu3+, zSm3+, wMn2+ phosphors are synthesized by a high temperature solid state method. The spectral property and…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
13
Luminescence property, energy transfer and thermal property of color tunable phosphor Ca9-Ce0.5Y0.5---(PO4)7:xTb3+, yEu3+, zSm3+, wMn2 by Gu, Rongzhi, Guan, Mengyuan, Jiang, Na, Yuan, Tianyu, Ma, Guoquan, Wang, Chao, Yang, Zhiping, Li, Panlai, Wang, Zhijun

Published in Journal of alloys and compounds (01-02-2019)

Get full text

Journal Article
QR Code
Save to List

Saved in:
14
Learning Decoupling Features Through Orthogonality Regularization by Wang, Li, Gu, Rongzhi, Zhuang, Weiji, Gao, Peng, Wang, Yujun, Zou, Yuexian

Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23-05-2022)
“…Keyword spotting (KWS) and speaker verification (SV) are two important tasks in speech applications. Research shows that the state-of-art KWS and SV models are…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
15
Parameter-Efficient Transfer Learning of Pre-Trained Transformer Models for Speaker Verification Using Adapters by Peng, Junyi, Stafylakis, Themos, Gu, Rongzhi, Plchot, Oldrich, Mosner, Ladislav, Burget, Lukas, Cernocky, Jan

Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04-06-2023)
“…Recently, the pre-trained Transformer models have received a rising interest in the field of speech processing thanks to their great success in various…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
16
Fast Random Approximation of Multi-Channel Room Impulse Response by Luo, Yi, Gu, Rongzhi

Published in 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW) (14-04-2024)
“…The training of modern neural-network-based speech processing systems typically requires a large amount of reverberant data to make the systems robust against…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
17
ReZero: Region-customizable Sound Extraction by Gu, Rongzhi, Luo, Yi

Published 31-08-2023
“…We introduce region-customizable sound extraction (ReZero), a general and flexible framework for the multi-channel region-wise sound extraction (R-SE) task…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
18
Fast Random Approximation of Multi-channel Room Impulse Response by Luo, Yi, Gu, Rongzhi

Published 17-04-2023
“…Modern neural-network-based speech processing systems are typically required to be robust against reverberation, and the training of such systems thus needs a…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
19
3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty by Gu, Rongzhi, Zhang, Shi-Xiong, Yu, Dong

Published 26-02-2023
“…Multi-channel speech separation using speaker's directional information has demonstrated significant gains over blind speech separation. However, it has two…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
20
Temporal-Spatial Neural Filter: Direction Informed End-to-End Multi-channel Target Speech Separation by Gu, Rongzhi, Zou, Yuexian

Published 02-01-2020
“…Target speech separation refers to extracting the target speaker's speech from mixed signals. Despite the recent advances in deep learning based close-talk…”

Get full text

Journal Article
QR Code
Save to List

Saved in:

Search Results - "Gu, Rongzhi"

ReZero: Region-Customizable Sound Extraction by Gu, Rongzhi, Luo, Yi

Improving Music Source Separation with Simo Stereo Band-Split Rnn by Luo, Yi, Gu, Rongzhi

Multi-Modal Multi-Channel Target Speech Separation by Gu, Rongzhi, Zhang, Shi-Xiong, Xu, Yong, Chen, Lianwu, Zou, Yuexian, Yu, Dong

Complex Neural Spatial Filter: Enhancing Multi-Channel Target Speech Separation in Complex Domain by Gu, Rongzhi, Zhang, Shi-Xiong, Zou, Yuexian, Yu, Dong

Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention by Xu, Xinmeng, Gu, Rongzhi, Zou, Yuexian

Enhancing End-to-End Multi-Channel Speech Separation Via Spatial Feature Learning by Gu, Rongzhi, Zhang, Shi-Xiong, Chen, Lianwu, Xu, Yong, Yu, Meng, Su, Dan, Zou, Yuexian, Yu, Dong

Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation by Gu, Rongzhi, Zhang, Shi-Xiong, Zou, Yuexian, Yu, Dong

Interest degree of products analysis by RFID technology for offline shops marketing optimization by Xin Su, Rongzhi Gu, Can Qi, Xuewu Zhang, Dongmin Choi

TSpeech-AI System Description to the 5th Deep Noise Suppression (DNS) Challenge by Yu, Jianwei, Chen, Hangting, Luo, Yi, Gu, Rongzhi, Li, Weihua, Weng, Chao

Luminescence property, energy transfer and thermal property of color tunable phosphor Ca9-wCe0.5Y0.5-x-y-z(PO4)7:xTb3+, yEu3+, zSm3+, wMn2 by Gu, Rongzhi, Guan, Mengyuan, Jiang, Na, Yuan, Tianyu, Ma, Guoquan, Wang, Chao, Yang, Zhiping, Li, Panlai, Wang, Zhijun

Luminescence property, energy transfer and thermal property of color tunable phosphor Ca9-Ce0.5Y0.5---(PO4)7:xTb3+, yEu3+, zSm3+, wMn2 by Gu, Rongzhi, Guan, Mengyuan, Jiang, Na, Yuan, Tianyu, Ma, Guoquan, Wang, Chao, Yang, Zhiping, Li, Panlai, Wang, Zhijun

Learning Decoupling Features Through Orthogonality Regularization by Wang, Li, Gu, Rongzhi, Zhuang, Weiji, Gao, Peng, Wang, Yujun, Zou, Yuexian

Parameter-Efficient Transfer Learning of Pre-Trained Transformer Models for Speaker Verification Using Adapters by Peng, Junyi, Stafylakis, Themos, Gu, Rongzhi, Plchot, Oldrich, Mosner, Ladislav, Burget, Lukas, Cernocky, Jan

Fast Random Approximation of Multi-Channel Room Impulse Response by Luo, Yi, Gu, Rongzhi

ReZero: Region-customizable Sound Extraction by Gu, Rongzhi, Luo, Yi

Fast Random Approximation of Multi-channel Room Impulse Response by Luo, Yi, Gu, Rongzhi

3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty by Gu, Rongzhi, Zhang, Shi-Xiong, Yu, Dong

Temporal-Spatial Neural Filter: Direction Informed End-to-End Multi-channel Target Speech Separation by Gu, Rongzhi, Zou, Yuexian

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication