Papers

575,626 papers

Single GPU Task Adaptation of Pathology Foundation Models for Whole Slide Image Analysis

Neeraj Kumar, Swaraj Nanda, Siddharth Singi, Jamal Benhamida, David Kim et al.

2025-06-05whole slide imagesMultiple Instance LearningMulti-Label Classification

Paper

Track Any Anomalous Object: A Granular Video Anomaly Detection Pipeline

Yuzhi Huang, Chenxin Li, HaiTao Zhang, Zixu Lin, Yunlong Lin et al.

2025-06-05Anomaly LocalizationVideo Anomaly DetectionAnomaly Detection+1

Paper

Through-the-Wall Radar Human Activity Recognition WITHOUT Using Neural Networks

Weicheng Gao

2025-06-05Human Activity RecognitionActivity Recognition

Paper Code

FRED: The Florence RGB-Event Drone Dataset

Gabriele Magrini, Niccolò Marini, Federico Becattini, Lorenzo Berlincioni, Niccolò Biondi et al.

2025-06-05BenchmarkingTrajectory Forecasting

Paper

CIVET: Systematic Evaluation of Understanding in VLMs

Massimo Rizzoli, Simone Alghisi, Olha Khomyn, Gabriel Roccabruna, Seyed Mahed Mousavi et al.

2025-06-05

Paper

Practical Manipulation Model for Robust Deepfake Detection

Benedikt Hopf, Radu Timofte

2025-06-05Super-ResolutionDeepFake DetectionImage Super-Resolution+1

Paper

DIMCIM: A Quantitative Evaluation Framework for Default-mode Diversity and Generalization in Text-to-Image Generative Models

Revant Teotia, Candace Ross, Karen Ullrich, Sumit Chopra, Adriana Romero-Soriano et al.

2025-06-05BenchmarkingLarge Language ModelSpecificity

Paper

Astraea: A GPU-Oriented Token-wise Acceleration Framework for Video Diffusion Transformers

Haosong Liu, Yuge Cheng, Zihan Liu, Aiyue Chen, Jing Lin et al.

2025-06-05Text-to-Video GenerationVideo Generation

Paper

FG 2025 TrustFAA: the First Workshop on Towards Trustworthy Facial Affect Analysis: Advancing Insights of Fairness, Explainability, and Safety (TrustFAA)

Jiaee Cheong, Yang Liu, Harold Soh, Hatice Gunes

2025-06-05FairnessDepression DetectionFacial Action Unit Detection+4

Paper

Interpretable Multimodal Framework for Human-Centered Street Assessment: Integrating Visual-Language Models for Perceptual Urban Diagnostics

Haotian Lan

2025-06-05Large Language Model

Paper

SeedEdit 3.0: Fast and High-Quality Generative Image Editing

Peng Wang, Yichun Shi, Xiaochen Lian, Zhonghua Zhai, Xin Xia et al.

2025-06-05Instruction Following

Paper

A Survey on Vietnamese Document Analysis and Recognition: Challenges and Future Directions

Anh Le, Thanh Lam, Dung Nguyen

2025-06-05document understandingInformation RetrievalOptical Character Recognition (OCR)+1

Paper

FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing

Guangzhao Li, Yanming Yang, Chenxi Song, Chi Zhang

2025-06-05Video EditingText-to-Video Editing

Paper

Physical Annotation for Automated Optical Inspection: A Concept for In-Situ, Pointer-Based Trainingdata Generation

Oliver Krumpek, Oliver Heimann, Jörg Krüger

2025-06-05

Paper

UAV4D: Dynamic Neural Rendering of Human-Centric UAV Imagery using Gaussian Splatting

Jaehoon Choi, Dongki Jung, Christopher Maxey, Yonghan Lee, Sungmin Eum et al.

2025-06-05Neural RenderingNovel View Synthesis

Paper

Point Cloud Segmentation of Agricultural Vehicles using 3D Gaussian Splatting

Alfred T. Christiansen, Andreas H. Højrup, Morten K. Stephansen, Md Ibtihaj A. Sakib, Taman S. Poojary et al.

2025-06-05Semantic SegmentationPoint Cloud Segmentation3DGS

Paper

Structure-Aware Radar-Camera Depth Estimation

Fuyi Zhang, Zhu Yu, Chunhao Li, Runmin Zhang, Xiaokai Bai et al.

2025-06-05regressionDepth EstimationMonocular Depth Estimation

Paper

Beyond Cropped Regions: New Benchmark and Corresponding Baseline for Chinese Scene Text Retrieval in Diverse Layouts

Gengluo Li, Huawen Shen, Yu Zhou

2025-06-05Text RetrievalRetrieval

Paper

PATS: Proficiency-Aware Temporal Sampling for Multi-View Sports Skill Assessment

Edoardo Bianchi, Antonio Liotta

2025-06-05

Paper

Multi-scale Image Super Resolution with a Single Auto-Regressive Model

Enrique Sanchez, Isma Hadji, Adrian Bulat, Christos Tzelepis, Brais Martinez et al.

2025-06-05Super-ResolutionImage Super-Resolution

Paper

PreviousPage 332 of 28782Next