TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/TransNet V2: An effective deep network architecture for fa...

TransNet V2: An effective deep network architecture for fast shot transition detection

Tomáš Souček, Jakub Lokoč

2020-08-11Boundary DetectionCamera shot boundary detection
PaperPDFCodeCode(official)CodeCode

Abstract

Although automatic shot transition detection approaches are already investigated for more than two decades, an effective universal human-level model was not proposed yet. Even for common shot transitions like hard cuts or simple gradual changes, the potential diversity of analyzed video contents may still lead to both false hits and false dismissals. Recently, deep learning-based approaches significantly improved the accuracy of shot transition detection using 3D convolutional architectures and artificially created training data. Nevertheless, one hundred percent accuracy is still an unreachable ideal. In this paper, we share the current version of our deep network TransNet V2 that reaches state-of-the-art performance on respected benchmarks. A trained instance of the model is provided so it can be instantly utilized by the community for a highly efficient analysis of large video archives. Furthermore, the network architecture, as well as our experience with the training process, are detailed, including simple code snippets for convenient usage of the proposed model and visualization of results.

Results

TaskDatasetMetricValueModel
Video SegmentationClipShotsF1 score77.9TransNet V2

Related Papers

SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation2025-07-16Design and Implementation of an OCR-Powered Pipeline for Table Extraction from Invoices2025-07-09Real Time Self-Tuning Adaptive Controllers on Temperature Control Loops using Event-based Game Theory2025-06-16Self-Route: Automatic Mode Switching via Capability Estimation for Efficient Reasoning2025-05-27A Semantic Change Detection Network Based on Boundary Detection and Task Interaction for High-Resolution Remote Sensing Images2025-05-26Tropical Geometry Based Edge Detection Using Min-Plus and Max-Plus Algebra2025-05-24Rethinking Boundary Detection in Deep Learning-Based Medical Image Segmentation2025-05-06Tiger200K: Manually Curated High Visual Quality Video Dataset from UGC Platform2025-04-21