TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

Single GPU Task Adaptation of Pathology Foundation Models for Whole Slide Image Analysis

Neeraj Kumar, Swaraj Nanda, Siddharth Singi, Jamal Benhamida, David Kim et al.

2025-06-05whole slide imagesMultiple Instance LearningMulti-Label Classification
Paper
Track Any Anomalous Object: A Granular Video Anomaly Detection Pipeline

Yuzhi Huang, Chenxin Li, HaiTao Zhang, Zixu Lin, Yunlong Lin et al.

2025-06-05Anomaly LocalizationVideo Anomaly DetectionAnomaly Detection+1
Paper
Through-the-Wall Radar Human Activity Recognition WITHOUT Using Neural Networks

Weicheng Gao

2025-06-05Human Activity RecognitionActivity Recognition
PaperCode
FRED: The Florence RGB-Event Drone Dataset

Gabriele Magrini, Niccolò Marini, Federico Becattini, Lorenzo Berlincioni, Niccolò Biondi et al.

2025-06-05BenchmarkingTrajectory Forecasting
Paper
CIVET: Systematic Evaluation of Understanding in VLMs

Massimo Rizzoli, Simone Alghisi, Olha Khomyn, Gabriel Roccabruna, Seyed Mahed Mousavi et al.

2025-06-05
Paper
Practical Manipulation Model for Robust Deepfake Detection

Benedikt Hopf, Radu Timofte

2025-06-05Super-ResolutionDeepFake DetectionImage Super-Resolution+1
Paper
DIMCIM: A Quantitative Evaluation Framework for Default-mode Diversity and Generalization in Text-to-Image Generative Models

Revant Teotia, Candace Ross, Karen Ullrich, Sumit Chopra, Adriana Romero-Soriano et al.

2025-06-05BenchmarkingLarge Language ModelSpecificity
Paper
Astraea: A GPU-Oriented Token-wise Acceleration Framework for Video Diffusion Transformers

Haosong Liu, Yuge Cheng, Zihan Liu, Aiyue Chen, Jing Lin et al.

2025-06-05Text-to-Video GenerationVideo Generation
Paper
FG 2025 TrustFAA: the First Workshop on Towards Trustworthy Facial Affect Analysis: Advancing Insights of Fairness, Explainability, and Safety (TrustFAA)

Jiaee Cheong, Yang Liu, Harold Soh, Hatice Gunes

2025-06-05FairnessDepression DetectionFacial Action Unit Detection+4
Paper
Interpretable Multimodal Framework for Human-Centered Street Assessment: Integrating Visual-Language Models for Perceptual Urban Diagnostics

Haotian Lan

2025-06-05Large Language Model
Paper
SeedEdit 3.0: Fast and High-Quality Generative Image Editing

Peng Wang, Yichun Shi, Xiaochen Lian, Zhonghua Zhai, Xin Xia et al.

2025-06-05Instruction Following
Paper
A Survey on Vietnamese Document Analysis and Recognition: Challenges and Future Directions

Anh Le, Thanh Lam, Dung Nguyen

2025-06-05document understandingInformation RetrievalOptical Character Recognition (OCR)+1
Paper
FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing

Guangzhao Li, Yanming Yang, Chenxi Song, Chi Zhang

2025-06-05Video EditingText-to-Video Editing
Paper
Physical Annotation for Automated Optical Inspection: A Concept for In-Situ, Pointer-Based Trainingdata Generation

Oliver Krumpek, Oliver Heimann, Jörg Krüger

2025-06-05
Paper
UAV4D: Dynamic Neural Rendering of Human-Centric UAV Imagery using Gaussian Splatting

Jaehoon Choi, Dongki Jung, Christopher Maxey, Yonghan Lee, Sungmin Eum et al.

2025-06-05Neural RenderingNovel View Synthesis
Paper
Point Cloud Segmentation of Agricultural Vehicles using 3D Gaussian Splatting

Alfred T. Christiansen, Andreas H. Højrup, Morten K. Stephansen, Md Ibtihaj A. Sakib, Taman S. Poojary et al.

2025-06-05Semantic SegmentationPoint Cloud Segmentation3DGS
Paper
Structure-Aware Radar-Camera Depth Estimation

Fuyi Zhang, Zhu Yu, Chunhao Li, Runmin Zhang, Xiaokai Bai et al.

2025-06-05regressionDepth EstimationMonocular Depth Estimation
Paper
Beyond Cropped Regions: New Benchmark and Corresponding Baseline for Chinese Scene Text Retrieval in Diverse Layouts

Gengluo Li, Huawen Shen, Yu Zhou

2025-06-05Text RetrievalRetrieval
Paper
PATS: Proficiency-Aware Temporal Sampling for Multi-View Sports Skill Assessment

Edoardo Bianchi, Antonio Liotta

2025-06-05
Paper
Multi-scale Image Super Resolution with a Single Auto-Regressive Model

Enrique Sanchez, Isma Hadji, Adrian Bulat, Christos Tzelepis, Brais Martinez et al.

2025-06-05Super-ResolutionImage Super-Resolution
Paper
PreviousPage 332 of 28782Next