AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition

Wearable devices like smart glasses are approaching the compute capability to seamlessly generate real-time closed captions for live conversations. We build on our recently introduced directional Automatic Speech Recognition (ASR) for smart glasses that have microphone arrays, which fuses multi-chan...

Full description

Saved in:
Bibliographic Details
Published in:ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) pp. 11951 - 11955
Main Authors: Lin, Ju, Moritz, Niko, Huang, Yiteng, Xie, Ruiming, Sun, Ming, Fuegen, Christian, Seide, Frank
Format: Conference Proceeding
Language:English
Published: IEEE 14-04-2024
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Be the first to leave a comment!
You must be logged in first