TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Dual Quaternion Ambisonics Array for Six-Degree-of-Freedom...

Dual Quaternion Ambisonics Array for Six-Degree-of-Freedom Acoustic Representation

Eleonora Grassucci, Gioia Mancini, Christian Brignone, Aurelio Uncini, Danilo Comminiello

2022-04-04Sound Event Localization and Detection
PaperPDFCode(official)

Abstract

Spatial audio methods are gaining a growing interest due to the spread of immersive audio experiences and applications, such as virtual and augmented reality. For these purposes, 3D audio signals are often acquired through arrays of Ambisonics microphones, each comprising four capsules that decompose the sound field in spherical harmonics. In this paper, we propose a dual quaternion representation of the spatial sound field acquired through an array of two First Order Ambisonics (FOA) microphones. The audio signals are encapsulated in a dual quaternion that leverages quaternion algebra properties to exploit correlations among them. This augmented representation with 6 degrees of freedom (6DOF) involves a more accurate coverage of the sound field, resulting in a more precise sound localization and a more immersive audio experience. We evaluate our approach on a sound event localization and detection (SELD) benchmark. We show that our dual quaternion SELD model with temporal convolution blocks (DualQSELD-TCN) achieves better results with respect to real and quaternion-valued baselines thanks to our augmented representation of the sound field. Full code is available at: https://github.com/ispamm/DualQSELD-TCN.

Results

TaskDatasetMetricValueModel
Sound Event Localization and DetectionL3DAS21SELD score0.324DualQSELD-TCN (parallel)

Related Papers

Spatial and Semantic Embedding Integration for Stereo Sound Event Localization and Detection in Regular Videos2025-07-07Stereo sound event localization and detection based on PSELDnet pretraining and BiMamba sequence modeling2025-06-16CST-former: Multidimensional Attention-based Transformer for Sound Event Localization and Detection in Real Scenes2025-04-17Reverberation-based Features for Sound Event Localization and Detection with Distance Estimation2025-04-11An Experimental Study on Joint Modeling for Sound Event Localization and Detection with Source Distance Estimation2025-01-18MVANet: Multi-Stage Video Attention Network for Sound Event Localization and Detection with Source Distance Estimation2024-11-21Class-Incremental Learning for Sound Event Localization and Detection2024-11-19PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection2024-11-10