A Hybrid Approach for Co-Channel Speech Segregation based on CASA, HMM Multipitch Tracking, and Medium Frame Harmonic Model
International Journal of Advanced Computer Science and Applications (IJACSA)Volume 4 Issue 7, 2013 This paper proposes a hybrid approach for co-channel speech segregation. HMM (hidden Markov model) is used to track the pitches of 2 talkers. The resulting pitch tracks are then enriched with the promi...
Saved in:
Main Authors: | , |
---|---|
Format: | Journal Article |
Language: | English |
Published: |
15-12-2013
|
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | International Journal of Advanced Computer Science and
Applications (IJACSA)Volume 4 Issue 7, 2013 This paper proposes a hybrid approach for co-channel speech segregation. HMM
(hidden Markov model) is used to track the pitches of 2 talkers. The resulting
pitch tracks are then enriched with the prominent pitch. The enriched tracks
are correctly grouped using pitch continuity. Medium frame harmonics are used
to extract the second pitch for frames with only one pitch deduced using the
previous steps. Finally, the pitch tracks are input to CASA (computational
auditory scene analysis) to segregate the mixed speech. The center frequency
range of the gamma tone filter banks is maximized to reduce the overlap between
the channels filtered for better segregation. Experiments were conducted using
this hybrid approach on the speech separation challenge database and compared
to the single (non-hybrid) approaches, i.e. signal processing and CASA. Results
show that using the hybrid approach outperforms the single approaches. |
---|---|
DOI: | 10.48550/arxiv.1312.4127 |