ConferencingSpeech 2021
Far-field Multi-Channel Speech Enhancement Challenge for Video Conferencing
Introduction
Welcome to Far-field Multi-channel Speech Enhancement Challenge for Video Conferencing (ConferencingSpeech 2021).
With the advances in video conferencing technology, we are able to seamlessly connect with people of our choice anytime anywhere in the world. Video conferencing helps break barriers of distance among people. However, during video conference, the speech quality will be significantly affected by background noise, reverberation, number of recording microphones, the layout of microphone array, the acoustic and circuit design of microphone arrays, and so on. Effective speech enhancement plays an important role in the video conferencing system. Although the performance of speech enhancement has been improved dramatically in the past several decades, there are still a set of open research problems that should be further addressed in the far-field and complex meeting room environments, which includes but not limited to:
- Multi-channel speech enhancement with single microphone array
- Multi-channel speech enhancement with multiple distributed microphone arrays
- Multi-channel speech enhancement with noisy and reverberant environments
- Multi-channel speech enhancement with low-latency and zero look-ahead (casual system)
ConferencingSpeech 2021 challenge is proposed to stimulate research in the areas mentioned above. Targeting the real video conferencing room application, the challenge database is recorded from real speakers. Number of speakers and distances between speakers and microphone arrays vary according to the sizes of meeting rooms. Multiple microphone arrays with different geometric topologies are allocated in each recording environment. The challenge will provide the list of training set, development set, scripts for simulation, and baseline systems for participants to develop their systems. The final ranking of the challenge will be decided by the subjective evaluation. The subjective evaluation will be performed using Absolute Category Ratings (ACR) to estimate a Mean Opinion Score (MOS) through Tencent Online Media Subjective Evaluation platform.
The challenge has two tasks:
Task 1
Multi-channel speech enhancement with single microphone array
Task 2
Multi-channel speech enhancement with multiple distributed microphone arrays
Each registered team could participate in any one or both tasks. Participating in both tasks are encouraged. Top two winning teams from each task will be awarded the prizes provided by Tencent Ethereal Audio Lab. The First Prize is 1500 USD. The Second Prize is 800 USD. The internship opportunities at Tencent Ethereal Audio Lab will be provided to the students with excellent performance. More details about the data and ConferencingSpeech 2021 challenge can be found here.
Evaluation PlanTimelinePacific Time:
22 Jan 2021
Release of evaluation plan, the list of training data, noise dataset, development test set, and scripts for simulation
05 Feb 2021
Release of baseline system
07 Mar 2021
Deadline of challenge registration
08 Mar 2021
Release of evaluation test set
16 Mar 2021
Deadline of submitting the results for subjective evaluation on the evaluation test set
25 Mar 2021
Notification of the results of participants
26 Mar 2021
Interspeech paper submission deadline
02 Apr 2021
Interspeech paper update deadline
02 Jun 2021
Paper acceptance/rejection notification
05 Jun 2021
Notification of the winners
31 Aug 2021
Opening of Interspeech 2021
Registration & Submission
Registration
Please register by the following entrance. Once the registration is completed, you will receive our confirmation email.
RegistrationSubmission
Please submit your processed evaluation audio sets and system description by the following link.
SubmissionDownload
Participants will receive the download link/password when their registrations are confirmed.
Organizers
Wei Rao
Senior Researcher
Tencent Ethereal Audio Lab
China
Tencent Ethereal Audio Lab
China
Lei Xie
Professor
Northwestern Polytechnical
University
China
Northwestern Polytechnical
University
China
Yannan Wang
Senior Researcher
Tencent Ethereal Audio Lab
China
Tencent Ethereal Audio Lab
China
Tao Yu
Principal Research Engineer
& Manager
Tencent Ethereal Audio Lab
USA
& Manager
Tencent Ethereal Audio Lab
USA
Shinji Watanabe
Associate Professor
Carnegie Mellon University/
Johns Hopkins University
USA
Carnegie Mellon University/
Johns Hopkins University
USA
Zheng-Hua Tan
Professor
Aalborg University
Denmark
Aalborg University
Denmark
Hui Bu
CEO
AISHELL foundation
China
AISHELL foundation
China
Shidong Shang
Senior Director
Tencent Ethereal Audio Lab
China
Tencent Ethereal Audio Lab
China
Rankings
RankingsContact Us
If you have questions about this challenge, please email us at ConferencingSpeech@tencent.com. If you haven’t received the reply in three days, please send email to ConferencingSpeech@gmail.com.