TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

RAG-6DPose: Retrieval-Augmented 6D Pose Estimation via Leveraging CAD as Knowledge Base

Kuanning Wang, Yuqian Fu, Tianyu Wang, Yanwei Fu, Longfei Liang et al.

2025-06-23Object LocalizationPose EstimationRetrieval+2
Paper
Benchmarking histopathology foundation models in a multi-center dataset for skin cancer subtyping

Pablo Meseguer, Rocío del Amor, Valery Naranjo

2025-06-23BenchmarkingMultiple Instance LearningTransfer Learning
PaperCode
SIM-Net: A Multimodal Fusion Network Using Inferred 3D Object Shape Point Clouds from RGB Images for 2D Classification

Youcef Sklab, Hanane Ariouat, Eric Chenin, Edi Prifti, Jean-Daniel Zucker et al.

2025-06-23Image ClassificationClassificationPoint Cloud Generation
Paper
USVTrack: USV-Based 4D Radar-Camera Tracking Dataset for Autonomous Driving in Inland Waterways

Shanliang Yao, Runwei Guan, Yi Ni, Sen Xu, Yong Yue et al.

2025-06-23Autonomous DrivingObject Tracking
Paper
TReB: A Comprehensive Benchmark for Evaluating Table Reasoning Capabilities of Large Language Models

Ce Li, Xiaofan Liu, Zhiyan Song, Ce Chi, Chen Zhao et al.

2025-06-23
Paper
Advancing Talking Head Generation: A Comprehensive Survey of Multi-Modal Methodologies, Datasets, Evaluation Metrics, and Loss Functions

Vineet Kumar Rakesh, Soumya Mazumdar, Research Pratim Maity, Sarbajit Pal, Amitabha Das et al.

2025-06-23Talking Head Generation
PaperCode
Semantic Structure-Aware Generative Attacks for Enhanced Adversarial Transferability

Jongoh Jeong, Hunmin Yang, Jaeseok Jeong, Kuk-Jin Yoon

2025-06-23
Paper
YouTube-Occ: Learning Indoor 3D Semantic Occupancy Prediction from YouTube Videos

Haoming Chen, Lichen Yuan, Tianfang Sun, Jingyu Gong, Xin Tan et al.

2025-06-23Representation Learning3D Semantic Occupancy Prediction
Paper
MinD: Unified Visual Imagination and Control via Hierarchical World Models

Xiaowei Chi, Kuangzhi Ge, Jiaming Liu, Siyuan Zhou, Peidong Jia et al.

2025-06-23Video PredictionVideo Generation
Paper
RDPO: Real Data Preference Optimization for Physics Consistency Video Generation

Wenxu Qian, Chaoyue Wang, Hou Peng, Zhiyu Tan, Hao Li et al.

2025-06-23Video Generation
Paper
PrITTI: Primitive-based Generation of Controllable and Editable 3D Semantic Scenes

Christina Ourania Tze, Daniel Dauner, Yiyi Liao, Dzmitry Tsishkou, Andreas Geiger et al.

2025-06-23Scene Generation
Paper
AI Agents-as-Judge: Automated Assessment of Accuracy, Consistency, Completeness and Clarity for Enterprise Documents

Sudip Dasgupta, Himanshu Shankar

2025-06-23AI Agent
Paper
TrajTok: Technical Report for 2025 Waymo Open Sim Agents Challenge

Zhiyuan Zhang, Xiaosong Jia, GuanYu Chen, QiFeng Li, Junchi Yan et al.

2025-06-23
Paper
Generalizing Vision-Language Models to Novel Domains: A Comprehensive Survey

Xinyao Li, Jingjing Li, Fengling Li, Lei Zhu, Yang Yang et al.

2025-06-23BenchmarkingTransfer Learning
Paper
Drive-R1: Bridging Reasoning and Planning in VLMs for Autonomous Driving with Reinforcement Learning

Yue Li, Meng Tian, Dechang Zhu, Jiangtong Zhu, Zhenyu Lin et al.

2025-06-23Motion PlanningAutonomous Driving
Paper
ARD-LoRA: Dynamic Rank Allocation for Parameter-Efficient Fine-Tuning of Foundation Models with Heterogeneous Adaptation Needs

Haseeb Ullah Khan Shinwari, Muhammad Usama

2025-06-23parameter-efficient fine-tuning
Paper
DIP: Unsupervised Dense In-Context Post-training of Visual Representations

Sophia Sirko-Galouchenko, Spyros Gidaris, Antonin Vobecky, Andrei Bursuc, Nicolas Thome et al.

2025-06-23Meta-LearningScene Understanding
PaperCode
A Comment On "The Illusion of Thinking": Reframing the Reasoning Cliff as an Agentic Gap

Sheraz Khan, Subha Madhavan, Kannan Natarajan

2025-06-23
Paper
Sequential keypoint density estimator: an overlooked baseline of skeleton-based video anomaly detection

Anja Delić, Matej Grcić, Siniša Šegvić

2025-06-23Video Anomaly DetectionAnomaly Detection
PaperCode
Including Semantic Information via Word Embeddings for Skeleton-based Action Recognition

Dustin Aganian, Erik Franze, Markus Eisenbach, Horst-Michael Gross

2025-06-23Skeleton Based Action RecognitionWord EmbeddingsAction Recognition+1
Paper
PreviousPage 110 of 28782Next