TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Auto-Tuning Spectral Clustering for Speaker Diarization Us...

Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap

Tae Jin Park, Kyu J. Han, Manoj Kumar, Shrikanth Narayanan

2020-03-05ClusteringSpeaker Diarization
PaperPDFCode(official)

Abstract

In this study, we propose a new spectral clustering framework that can auto-tune the parameters of the clustering algorithm in the context of speaker diarization. The proposed framework uses normalized maximum eigengap (NME) values to estimate the number of clusters and the parameters for the threshold of the elements of each row in an affinity matrix during spectral clustering, without the use of parameter tuning on the development set. Even through this hands-off approach, we achieve a comparable or better performance across various evaluation sets than the results found using traditional clustering methods that apply careful parameter tuning and development data. A relative improvement of 17% in the speaker error rate on the well-known CALLHOME evaluation set shows the effectiveness of our proposed spectral clustering with auto-tuning.

Results

TaskDatasetMetricValueModel
Speaker DiarizationCALLHOMEDER(%)21.13COS+AHC (Oracle SAD)
Speaker DiarizationCALLHOMEDER(%)24.05COS+NJW-SC (Oracle SAD)
Speaker DiarizationCALLHOMEDER(ig olp)7.29COS+NME-SC (Oracle SAD)
Speaker DiarizationCALLHOMEDER(ig olp)8.39PLDA+AHC (Oracle SAD)
Speaker DiarizationCALLHOMEDER(ig olp)8.78COS+B-SC (Oracle SAD)

Related Papers

Tri-Learn Graph Fusion Network for Attributed Graph Clustering2025-07-18Ranking Vectors Clustering: Theory and Applications2025-07-16Car Object Counting and Position Estimation via Extension of the CLIP-EBC Framework2025-07-11GNN-ViTCap: GNN-Enhanced Multiple Instance Learning with Vision Transformers for Whole Slide Image Classification and Captioning2025-07-09Consistency and Inconsistency in $K$-Means Clustering2025-07-08MC-INR: Efficient Encoding of Multivariate Scientific Simulation Data using Meta-Learning and Clustered Implicit Neural Representations2025-07-03Supercm: Revisiting Clustering for Semi-Supervised Learning2025-06-30Temporal Rate Reduction Clustering for Human Motion Segmentation2025-06-26