The Speed Submission to DIHARD II: Contributions & Lessons Learned - Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2019

The Speed Submission to DIHARD II: Contributions & Lessons Learned

Résumé

This paper describes the speaker diarization systems developed for the Second DIHARD Speech Diarization Challenge (DIHARD II) by the Speed team. Besides describing the system, which considerably outperformed the challenge baselines, we also focus on the lessons learned from numerous approaches that we tried for single and multi-channel systems. We present several components of our diarization system, including categorization of domains, speech enhancement, speech activity detection, speaker embeddings, clustering methods, resegmentation, and system fusion. We analyze and discuss the effect of each such component on the overall diarization performance within the realistic settings of the challenge.
Fichier principal
Vignette du fichier
DIHARDSpeed.pdf (263.68 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02352840 , version 1 (07-11-2019)
hal-02352840 , version 2 (30-06-2020)

Identifiants

  • HAL Id : hal-02352840 , version 1

Citer

Md Sahidullah, Jose Patino, Samuele Cornell, Ruiqing Yin, Sunit Sivasankaran, et al.. The Speed Submission to DIHARD II: Contributions & Lessons Learned. 2019. ⟨hal-02352840v1⟩
160 Consultations
462 Téléchargements

Partager

Gmail Facebook X LinkedIn More