VLB Suche

Suche in den Daten des Verzeichnisses lieferbarer Bücher (VLB)

Drucken

Suchergebnisse

Produktdetails

Deep Neural Network-based Approaches for Single-channel Speaker-conditioned Target Speaker Extraction

Autor
Ragini Sinha

Deep Neural Network-based Approaches for Single-channel Speaker-conditioned Target Speaker Extraction

Beschreibung

In everyday communication scenarios, such as meetings and social gatherings, undesired interfering speakers and background noise often degrade the quality and intelligibility of the desired target speaker. Various approaches have been developed to address this issue, such as blind source separation and speaker-conditioned target speaker extraction (SC-TSE). SC-TSE algorithms aim at extracting the desired speaker from the mixture by utilizing auxiliary information about the target speaker, such as reference speech, visual information, directional information, or speaker activity. A typical SC-TSE system consists of a speaker embedder network and a speaker separator network. The speaker embedder network generates target speaker-specific discriminative features from the auxiliary information, which guides the speaker separator network to extract the target speaker from the mixture. The aim of this thesis is to develop and evaluate novel DNN-based architectures, both objectively and subjectively to enhance the reliability, efficiency and robustness of single-channel SC-TSE algorithms utilizing reference speech as auxiliary information.

Verlag
Dr. Hut
ISBN/EAN
978-3-8439-5689-5
Preis
84,00 EUR
Status
lieferbar