TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

TextVidBench: A Benchmark for Long Video Scene Text Understanding

Yangyang Zhong, Ji Qi, Yuan YAO, Pengxin Luo, Yunfeng Yan et al.

2025-06-05Question AnsweringPrompt EngineeringVideo Understanding+1
Paper
Bringing SAM to new heights: Leveraging elevation data for tree crown segmentation from drone imagery

Mélisande Teng, Arthur Ouaknine, Etienne Laliberté, Yoshua Bengio, David Rolnick et al.

2025-06-05Semantic SegmentationInstance Segmentation
Paper
FEAT: Full-Dimensional Efficient Attention Transformer for Medical Video Generation

Huihan Wang, Zhiwen Yang, HUI ZHANG, Dan Zhao, Bingzheng Wei et al.

2025-06-05DenoisingVideo Generation
PaperCode
APVR: Hour-Level Long Video Understanding with Adaptive Pivot Visual Information Retrieval

Hong Gao, Yiming Bao, Xuezhan Tu, Bin Zhong, MinLing Zhang et al.

2025-06-05Information RetrievalVideo UnderstandingRetrieval+1
Paper
Robustness as Architecture: Designing IQA Models to Withstand Adversarial Perturbations

Igor Meleshin, Anna Chistyakova, Anastasia Antsiferova, Dmitriy Vatolin

2025-06-05Image Quality Assessment
Paper
Time-Lapse Video-Based Embryo Grading via Complementary Spatial-Temporal Pattern Mining

Yong Sun, Yipeng Wang, Junyu Shi, Zhiyuan Zhang, Yanmei Xiao et al.

2025-06-05
Paper
CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx

Lukas Picek, Elisa Belotti, Michal Bojda, Ludek Bufka, Vojtech Cermak et al.

2025-06-05BenchmarkingSemantic SegmentationPose Estimation+2
Paper
Light and 3D: a methodological exploration of digitisation techniques adapted to a selection of objects from the Mus{é}e d'Arch{é}ologie Nationale

Antoine Laurent, Jean Mélou, Catherine Schwab, Rolande Simon-Millot, Sophie Féret et al.

2025-06-05
Paper
Generating Synthetic Stereo Datasets using 3D Gaussian Splatting and Expert Knowledge Transfer

Filip Slezak, Magnus K. Gjerde, Joakim B. Haurum, Ivan Nikolov, Morten S. Laursen et al.

2025-06-05Zero-shot GeneralizationTransfer Learning3DGS
Paper
From Objects to Anywhere: A Holistic Benchmark for Multi-level Visual Grounding in 3D Scenes

Tianxu Wang, Zhuofan Zhang, Ziyu Zhu, Yue Fan, Jing Xiong et al.

2025-06-05Spatial ReasoningVisual GroundingReferring Expression+1
Paper
Learning to Plan via Supervised Contrastive Learning and Strategic Interpolation: A Chess Case Study

Andrew Hamara, Greg Hamerly, Pablo Rivas, Andrew C. Freeman

2025-06-05Contrastive Learning
PaperCode
Invisible Backdoor Triggers in Image Editing Model via Deep Watermarking

Yu-Feng Chen, Tzuhsuan Huang, Pin-Yen Chiu, Jun-Cheng Chen

2025-06-05Image Generation
PaperCode
Geological Field Restoration through the Lens of Image Inpainting

Vladislav Trifonov, Ivan Oseledets, Ekaterina Muravleva

2025-06-05Image Inpainting
Paper
OpenMaskDINO3D : Reasoning 3D Segmentation via Large Language Model

Kunshen Zhang

2025-06-05SegmentationSemantic SegmentationLarge Language Model+3
PaperCode
DualX-VSR: Dual Axial Spatial$\times$Temporal Transformer for Real-World Video Super-Resolution without Motion Compensation

Shuo Cao, Yihao Liu, Xiaohui Li, Yuanting Gao, Yu Zhou et al.

2025-06-05Super-ResolutionMotion CompensationOptical Flow Estimation+3
Paper
Fool the Stoplight: Realistic Adversarial Patch Attacks on Traffic Light Detectors

Svetlana Pavlitska, Jamie Robb, Nikolai Polley, Melih Yazgan, J. Marius Zöllner et al.

2025-06-05Autonomous Vehicles
PaperCode
Spike-TBR: a Noise Resilient Neuromorphic Event Representation

Gabriele Magrini, Federico Becattini, Luca Cultrera, Lorenzo Berlincioni, Pietro Pala et al.

2025-06-05
Paper
MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Categories

Yuyi Zhang, Yongxin Shi, Peirong Zhang, Yixin Zhao, Zhenhua Yang et al.

2025-06-05BenchmarkingZero-Shot LearningOptical Character Recognition (OCR)
PaperCode
SupeRANSAC: One RANSAC to Rule Them All

Daniel Barath

2025-06-05Pose EstimationAllSimultaneous Localization and Mapping
PaperCode
LotusFilter: Fast Diverse Nearest Neighbor Search via a Learned Cutoff Table

Yusuke Matsui

2025-06-05CVPR 2025 1RAG
PaperCode
PreviousPage 333 of 28782Next