ConferencingSpeech 2021

Far-field Multi-Channel Speech Enhancement Challenge for Video Conferencing

Introduction

Welcome to Far-field Multi-channel Speech Enhancement Challenge for Video Conferencing (ConferencingSpeech 2021).

With the advances in video conferencing technology, we are able to seamlessly connect with people of our choice anytime anywhere in the world. Video conferencing helps break barriers of distance among people. However, during video conference, the speech quality will be significantly affected by background noise, reverberation, number of recording microphones, the layout of microphone array, the acoustic and circuit design of microphone arrays, and so on. Effective speech enhancement plays an important role in the video conferencing system. Although the performance of speech enhancement has been improved dramatically in the past several decades, there are still a set of open research problems that should be further addressed in the far-field and complex meeting room environments, which includes but not limited to:

Multi-channel speech enhancement with single microphone array
Multi-channel speech enhancement with multiple distributed microphone arrays
Multi-channel speech enhancement with noisy and reverberant environments
Multi-channel speech enhancement with low-latency and zero look-ahead (casual system)

ConferencingSpeech 2021 challenge is proposed to stimulate research in the areas mentioned above. Targeting the real video conferencing room application, the challenge database is recorded from real speakers. Number of speakers and distances between speakers and microphone arrays vary according to the sizes of meeting rooms. Multiple microphone arrays with different geometric topologies are allocated in each recording environment. The challenge will provide the list of training set, development set, scripts for simulation, and baseline systems for participants to develop their systems. The final ranking of the challenge will be decided by the subjective evaluation. The subjective evaluation will be performed using Absolute Category Ratings (ACR) to estimate a Mean Opinion Score (MOS) through Tencent Online Media Subjective Evaluation platform.

The challenge has two tasks:

Task 1

Multi-channel speech enhancement with single microphone array

Task 2

Multi-channel speech enhancement with multiple distributed microphone arrays

Each registered team could participate in any one or both tasks. Participating in both tasks are encouraged. Top two winning teams from each task will be awarded the prizes provided by Tencent Ethereal Audio Lab. The First Prize is 1500 USD. The Second Prize is 800 USD. The internship opportunities at Tencent Ethereal Audio Lab will be provided to the students with excellent performance. More details about the data and ConferencingSpeech 2021 challenge can be found here.

Evaluation Plan

TimelinePacific Time：

22 Jan 2021

Release of evaluation plan, the list of training data, noise dataset, development test set, and scripts for simulation

05 Feb 2021

Release of baseline system

07 Mar 2021

Deadline of challenge registration

08 Mar 2021

Release of evaluation test set

16 Mar 2021

Deadline of submitting the results for subjective evaluation on the evaluation test set

25 Mar 2021

Notification of the results of participants

26 Mar 2021

Interspeech paper submission deadline

02 Apr 2021

Interspeech paper update deadline

02 Jun 2021

Paper acceptance/rejection notification

05 Jun 2021

Notification of the winners

31 Aug 2021

Opening of Interspeech 2021

Registration & Submission

Registration

Please register by the following entrance. Once the registration is completed, you will receive our confirmation email.

Registration

Submission

Please submit your processed evaluation audio sets and system description by the following link.

Submission

Download

Participants will receive the download link/password when their registrations are confirmed.

Contest Rules

More >

FAQ

More >

List of training set and development test set

More >

List of training set and development test set

(For participants in China)

More >

Scripts for simulation

More >

Baseline systems

More >

Instruction of submission

More >

Evaluation test set

More >

Organizers

Wei Rao

Senior Researcher
Tencent Ethereal Audio Lab
China

Lei Xie

Professor
Northwestern Polytechnical
University
China

Yannan Wang

Senior Researcher
Tencent Ethereal Audio Lab
China

Tao Yu

Principal Research Engineer
& Manager
Tencent Ethereal Audio Lab
USA

Shinji Watanabe

Associate Professor
Carnegie Mellon University/
Johns Hopkins University
USA

Zheng-Hua Tan

Professor
Aalborg University
Denmark

Hui Bu

CEO
AISHELL foundation
China

Shidong Shang

Senior Director
Tencent Ethereal Audio Lab
China

Rankings

If you have questions about this challenge, please email us at ConferencingSpeech@tencent.com. If you haven’t received the reply in three days, please send email to ConferencingSpeech@gmail.com.