TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Time Masking: Leveraging Temporal Information in Spoken Di...

Time Masking: Leveraging Temporal Information in Spoken Dialogue Systems

Rylan Conway, Lambert Mathias

2019-07-25WS 2019 9Video Salient Object DetectionSpoken Dialogue Systems
PaperPDF

Abstract

In a spoken dialogue system, dialogue state tracker (DST) components track the state of the conversation by updating a distribution of values associated with each of the slots being tracked for the current user turn, using the interactions until then. Much of the previous work has relied on modeling the natural order of the conversation, using distance based offsets as an approximation of time. In this work, we hypothesize that leveraging the wall-clock temporal difference between turns is crucial for finer-grained control of dialogue scenarios. We develop a novel approach that applies a {\it time mask}, based on the wall-clock time difference, to the associated slot embeddings and empirically demonstrate that our proposed approach outperforms existing approaches that leverage distance offsets, on both an internal benchmark dataset as well as DSTC2.

Results

TaskDatasetMetricValueModel
VideoSegTrack v2AVERAGE MAE0.116TIMP
VideoSegTrack v2S-Measure0.644TIMP
VideoSegTrack v2max E-measure0.768TIMP
VideoMCLAVERAGE MAE0.113TIMP
VideoMCLMAX E-MEASURE0.76TIMP
VideoMCLS-Measure0.642TIMP
Object DetectionSegTrack v2AVERAGE MAE0.116TIMP
Object DetectionSegTrack v2S-Measure0.644TIMP
Object DetectionSegTrack v2max E-measure0.768TIMP
Object DetectionMCLAVERAGE MAE0.113TIMP
Object DetectionMCLMAX E-MEASURE0.76TIMP
Object DetectionMCLS-Measure0.642TIMP
3DSegTrack v2AVERAGE MAE0.116TIMP
3DSegTrack v2S-Measure0.644TIMP
3DSegTrack v2max E-measure0.768TIMP
3DMCLAVERAGE MAE0.113TIMP
3DMCLMAX E-MEASURE0.76TIMP
3DMCLS-Measure0.642TIMP
Video Object SegmentationSegTrack v2AVERAGE MAE0.116TIMP
Video Object SegmentationSegTrack v2S-Measure0.644TIMP
Video Object SegmentationSegTrack v2max E-measure0.768TIMP
Video Object SegmentationMCLAVERAGE MAE0.113TIMP
Video Object SegmentationMCLMAX E-MEASURE0.76TIMP
Video Object SegmentationMCLS-Measure0.642TIMP
RGB Salient Object DetectionSegTrack v2AVERAGE MAE0.116TIMP
RGB Salient Object DetectionSegTrack v2S-Measure0.644TIMP
RGB Salient Object DetectionSegTrack v2max E-measure0.768TIMP
RGB Salient Object DetectionMCLAVERAGE MAE0.113TIMP
RGB Salient Object DetectionMCLMAX E-MEASURE0.76TIMP
RGB Salient Object DetectionMCLS-Measure0.642TIMP
2D ClassificationSegTrack v2AVERAGE MAE0.116TIMP
2D ClassificationSegTrack v2S-Measure0.644TIMP
2D ClassificationSegTrack v2max E-measure0.768TIMP
2D ClassificationMCLAVERAGE MAE0.113TIMP
2D ClassificationMCLMAX E-MEASURE0.76TIMP
2D ClassificationMCLS-Measure0.642TIMP
2D Object DetectionSegTrack v2AVERAGE MAE0.116TIMP
2D Object DetectionSegTrack v2S-Measure0.644TIMP
2D Object DetectionSegTrack v2max E-measure0.768TIMP
2D Object DetectionMCLAVERAGE MAE0.113TIMP
2D Object DetectionMCLMAX E-MEASURE0.76TIMP
2D Object DetectionMCLS-Measure0.642TIMP
16kSegTrack v2AVERAGE MAE0.116TIMP
16kSegTrack v2S-Measure0.644TIMP
16kSegTrack v2max E-measure0.768TIMP
16kMCLAVERAGE MAE0.113TIMP
16kMCLMAX E-MEASURE0.76TIMP
16kMCLS-Measure0.642TIMP

Related Papers

Prompt-Guided Turn-Taking Prediction2025-06-26Towards Efficient Speech-Text Jointly Decoding within One Speech Language Model2025-06-04Towards a Japanese Full-duplex Spoken Dialogue System2025-06-03Chain-of-Thought Training for Open E2E Spoken Dialogue Systems2025-05-31WavReward: Spoken Dialogue Models With Generalist Reward Evaluators2025-05-14Saliency-Motion Guided Trunk-Collateral Network for Unsupervised Video Object Segmentation2025-04-08Speculative End-Turn Detector for Efficient Speech Chatbot Assistant2025-03-30ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems2025-03-11