TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Unleash the Potential of CLIP for Video Highlight Detection

Unleash the Potential of CLIP for Video Highlight Detection

Donghoon Han, Seunghyeon Seo, Eunhwan Park, Seong-Uk Nam, Nojun Kwak

2024-04-02Highlight Detection
PaperPDFCode

Abstract

Multimodal and large language models (LLMs) have revolutionized the utilization of open-world knowledge, unlocking novel potentials across various tasks and applications. Among these domains, the video domain has notably benefited from their capabilities. In this paper, we present Highlight-CLIP (HL-CLIP), a method designed to excel in the video highlight detection task by leveraging the pre-trained knowledge embedded in multimodal models. By simply fine-tuning the multimodal encoder in combination with our innovative saliency pooling technique, we have achieved the state-of-the-art performance in the highlight detection task, the QVHighlight Benchmark, to the best of our knowledge.

Results

TaskDatasetMetricValueModel
Highlight DetectionQVHighlightsHit@170.6HL-CLIP
Highlight DetectionQVHighlightsmAP41.94HL-CLIP
16kQVHighlightsHit@170.6HL-CLIP
16kQVHighlightsmAP41.94HL-CLIP

Related Papers

Unsupervised Transcript-assisted Video Summarization and Highlight Detection2025-05-29Rhapsody: A Dataset for Highlight Detection in Podcasts2025-05-26Gameplay Highlights Generation2025-05-12TEMPURA: Temporal Event Masked Prediction and Understanding for Reasoning in Action2025-05-02Automatic Detection of Intro and Credits in Video using CLIP and Multihead Attention2025-04-13LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection2025-01-18Multi-modal Fusion and Query Refinement Network for Video Moment Retrieval and Highlight Detection2025-01-18Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection2025-01-05