TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/ReLER@ZJU-Alibaba Submission to the Ego4D Natural Language...

ReLER@ZJU-Alibaba Submission to the Ego4D Natural Language Queries Challenge 2022

Naiyuan Liu, Xiaohan Wang, Xiaobo Li, Yi Yang, Yueting Zhuang

2022-07-01Data AugmentationNatural Language Queries
PaperPDFCode(official)

Abstract

In this report, we present the ReLER@ZJU-Alibaba submission to the Ego4D Natural Language Queries (NLQ) Challenge in CVPR 2022. Given a video clip and a text query, the goal of this challenge is to locate a temporal moment of the video clip where the answer to the query can be obtained. To tackle this task, we propose a multi-scale cross-modal transformer and a video frame-level contrastive loss to fully uncover the correlation between language queries and video clips. Besides, we propose two data augmentation strategies to increase the diversity of training samples. The experimental results demonstrate the effectiveness of our method. The final submission ranked first on the leaderboard.

Results

TaskDatasetMetricValueModel
Natural Language QueriesEgo4DR@1 IoU=0.312.89ReLER@ZJU-Alibaba
Natural Language QueriesEgo4DR@1 IoU=0.58.14ReLER@ZJU-Alibaba
Natural Language QueriesEgo4DR@1 Mean(0.3 and 0.5)10.52ReLER@ZJU-Alibaba
Natural Language QueriesEgo4DR@5 IoU=0.315.41ReLER@ZJU-Alibaba
Natural Language QueriesEgo4DR@5 IoU=0.59.94ReLER@ZJU-Alibaba

Related Papers

Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16Data Augmentation in Time Series Forecasting through Inverted Framework2025-07-15Iceberg: Enhancing HLS Modeling with Synthetic Data2025-07-14AI-Enhanced Pediatric Pneumonia Detection: A CNN-Based Approach Using Data Augmentation and Generative Adversarial Networks (GANs)2025-07-13FreeAudio: Training-Free Timing Planning for Controllable Long-Form Text-to-Audio Generation2025-07-11DS@GT at CheckThat! 2025: Detecting Subjectivity via Transfer-Learning and Corrective Data Augmentation2025-07-08