Papers

575,626 papers

TextVidBench: A Benchmark for Long Video Scene Text Understanding

Yangyang Zhong, Ji Qi, Yuan YAO, Pengxin Luo, Yunfeng Yan et al.

2025-06-05Question AnsweringPrompt EngineeringVideo Understanding+1

Paper

Bringing SAM to new heights: Leveraging elevation data for tree crown segmentation from drone imagery

Mélisande Teng, Arthur Ouaknine, Etienne Laliberté, Yoshua Bengio, David Rolnick et al.

2025-06-05Semantic SegmentationInstance Segmentation

Paper

FEAT: Full-Dimensional Efficient Attention Transformer for Medical Video Generation

Huihan Wang, Zhiwen Yang, HUI ZHANG, Dan Zhao, Bingzheng Wei et al.

2025-06-05DenoisingVideo Generation

Paper Code

APVR: Hour-Level Long Video Understanding with Adaptive Pivot Visual Information Retrieval

Hong Gao, Yiming Bao, Xuezhan Tu, Bin Zhong, MinLing Zhang et al.

2025-06-05Information RetrievalVideo UnderstandingRetrieval+1

Paper

Robustness as Architecture: Designing IQA Models to Withstand Adversarial Perturbations

Igor Meleshin, Anna Chistyakova, Anastasia Antsiferova, Dmitriy Vatolin

2025-06-05Image Quality Assessment

Paper

Time-Lapse Video-Based Embryo Grading via Complementary Spatial-Temporal Pattern Mining

Yong Sun, Yipeng Wang, Junyu Shi, Zhiyuan Zhang, Yanmei Xiao et al.

2025-06-05

Paper

CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx

Lukas Picek, Elisa Belotti, Michal Bojda, Ludek Bufka, Vojtech Cermak et al.

2025-06-05BenchmarkingSemantic SegmentationPose Estimation+2

Paper

Light and 3D: a methodological exploration of digitisation techniques adapted to a selection of objects from the Mus{é}e d'Arch{é}ologie Nationale

Antoine Laurent, Jean Mélou, Catherine Schwab, Rolande Simon-Millot, Sophie Féret et al.

2025-06-05

Paper

Generating Synthetic Stereo Datasets using 3D Gaussian Splatting and Expert Knowledge Transfer

Filip Slezak, Magnus K. Gjerde, Joakim B. Haurum, Ivan Nikolov, Morten S. Laursen et al.

2025-06-05Zero-shot GeneralizationTransfer Learning3DGS

Paper

From Objects to Anywhere: A Holistic Benchmark for Multi-level Visual Grounding in 3D Scenes

Tianxu Wang, Zhuofan Zhang, Ziyu Zhu, Yue Fan, Jing Xiong et al.

2025-06-05Spatial ReasoningVisual GroundingReferring Expression+1

Paper

Learning to Plan via Supervised Contrastive Learning and Strategic Interpolation: A Chess Case Study

Andrew Hamara, Greg Hamerly, Pablo Rivas, Andrew C. Freeman

2025-06-05Contrastive Learning

Paper Code

Invisible Backdoor Triggers in Image Editing Model via Deep Watermarking

Yu-Feng Chen, Tzuhsuan Huang, Pin-Yen Chiu, Jun-Cheng Chen

2025-06-05Image Generation

Paper Code

Geological Field Restoration through the Lens of Image Inpainting

Vladislav Trifonov, Ivan Oseledets, Ekaterina Muravleva

2025-06-05Image Inpainting

Paper

OpenMaskDINO3D : Reasoning 3D Segmentation via Large Language Model

Kunshen Zhang

2025-06-05SegmentationSemantic SegmentationLarge Language Model+3

Paper Code

DualX-VSR: Dual Axial Spatial$\times$Temporal Transformer for Real-World Video Super-Resolution without Motion Compensation

Shuo Cao, Yihao Liu, Xiaohui Li, Yuanting Gao, Yu Zhou et al.

2025-06-05Super-ResolutionMotion CompensationOptical Flow Estimation+3

Paper

Fool the Stoplight: Realistic Adversarial Patch Attacks on Traffic Light Detectors

Svetlana Pavlitska, Jamie Robb, Nikolai Polley, Melih Yazgan, J. Marius Zöllner et al.

2025-06-05Autonomous Vehicles

Paper Code

Spike-TBR: a Noise Resilient Neuromorphic Event Representation

Gabriele Magrini, Federico Becattini, Luca Cultrera, Lorenzo Berlincioni, Pietro Pala et al.

2025-06-05

Paper

MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Categories

Yuyi Zhang, Yongxin Shi, Peirong Zhang, Yixin Zhao, Zhenhua Yang et al.

2025-06-05BenchmarkingZero-Shot LearningOptical Character Recognition (OCR)

Paper Code

SupeRANSAC: One RANSAC to Rule Them All

Daniel Barath

2025-06-05Pose EstimationAllSimultaneous Localization and Mapping

Paper Code

LotusFilter: Fast Diverse Nearest Neighbor Search via a Learned Cutoff Table

Yusuke Matsui

2025-06-05CVPR 2025 1RAG

Paper Code

PreviousPage 333 of 28782Next