Schrödinger Bridge Mamba for One-Step Speech Enhancement

Jing Yang, Sirui Wang, Chao Wu, Fan Fan
Central Media Technology Institute, Huawei

Abstract

We propose Schrödinger Bridge Mamba (SBM), a new concept of training-inference framework motivated by the inherent compatibility between Schrödinger Bridge (SB) training paradigm and selective state-space model architecture. We exemplify the concept of SBM with an implementation for speech enhancement. Experiments on a joint denoising and dereverberation task using four benchmark datasets demonstrate that SBM, with only 1-step inference, outperforms strong baselines with 1-step or iterative inference and achieves the best real-time factor (RTF). The integration of SB paradigm and state-space model indicates a promising direction for exploring new deep generative models, with strong potential for application in a broad range of generative tasks beyond audio.

DNS Real Recordings

Group 1
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 2
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 3
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 4
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 5
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 6
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 7
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 8
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H

DNS With Reverb

Group 1
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 2
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 3
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 4
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 5
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 6
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 7
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 8
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H

DNS No Reverb

Group 1
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 2
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 3
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 4
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 5
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 6
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 7
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 8
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H

VoiceBand-Demand

Group 1
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 2
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 3
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 4
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 5
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 6
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 7
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H
Group 8
Input
Spectrogram A
SBM
Spectrogram B
SB-NCSN++(50)
Spectrogram C
SB-NCSN++(10)
Spectrogram D
SB-NCSN++(1)
Spectrogram E
SBCTM
Spectrogram F
SB-UFOGen
Spectrogram G
Mamba-base
Spectrogram H