Abstract : This paper proposes a method for segmenting and clustering an audio flow on the basis of speaker turns. This process, also known as speaker diarization, is of major importance in multimedia indexation. Here, we propose to realize this process online and without any prior knowledge on the number of speakers. This is done thanks to a statistical modelling of speakers based on a size-monitored growing neural gas algorithm.
Document type :
Conference papers
Complete list of metadatas
https://hal-supelec.archives-ouvertes.fr/hal-00552988
Contributor : Sébastien van Luchene <>
Submitted on : Thursday, January 6, 2011 - 12:01:14 PM Last modification on : Monday, December 14, 2020 - 2:10:02 PM