TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Hybrid Neural Networks for On-device Directional Hearing

Hybrid Neural Networks for On-device Directional Hearing

Anran Wang, Maruchi Kim, Hao Zhang, Shyamnath Gollakota

2021-12-11AAAI 2022 2Directional HearingAudio Source SeparationReal-time Directional HearingCausal Inference
PaperPDFCode(official)

Abstract

On-device directional hearing requires audio source separation from a given direction while achieving stringent human-imperceptible latency requirements. While neural nets can achieve significantly better performance than traditional beamformers, all existing models fall short of supporting low-latency causal inference on computationally-constrained wearables. We present DeepBeam, a hybrid model that combines traditional beamformers with a custom lightweight neural net. The former reduces the computational burden of the latter and also improves its generalizability, while the latter is designed to further reduce the memory and computational overhead to enable real-time and low-latency operations. Our evaluation shows comparable performance to state-of-the-art causal inference models on synthetic data while achieving a 5x reduction of model size, 4x reduction of computation per second, 5x reduction in processing time and generalizing better to real hardware data. Further, our real-time hybrid model runs in 8 ms on mobile CPUs designed for low-power wearable devices and achieves an end-to-end latency of 17.5 ms.

Results

TaskDatasetMetricValueModel
Audio Source SeparationVCTKSI-SDRi13.3HybridBeam+
Audio Source SeparationVCTKSI-SDRi13.3HybridBeam+
Directional HearingVCTKSI-SDRi13.3HybridBeam+
Directional HearingVCTKSI-SDRi13.3HybridBeam+

Related Papers

Towards Reliable Objective Evaluation Metrics for Generative Singing Voice Separation Models2025-07-15Estimating Interventional Distributions with Uncertain Causal Graphs through Meta-Learning2025-07-07Causal-Aware Intelligent QoE Optimization for VR Interaction with Adaptive Keyframe Extraction2025-06-24Quantum Neural Networks for Propensity Score Estimation and Survival Analysis in Observational Biomedical Studies2025-06-24Bayesian Evolutionary Swarm Architecture: A Formal Epistemic System Grounded in Truth-Based Competition2025-06-23T-CPDL: A Temporal Causal Probabilistic Description Logic for Developing Logic-RAG Agent2025-06-23An Empirical Comparison of Weak-IV-Robust Procedures in Just-Identified Models2025-06-22Causal Interventions in Bond Multi-Dealer-to-Client Platforms2025-06-22