TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models

Xuanchi Ren, Yifan Lu, Tianshi Cao, Ruiyuan Gao, Shengyu Huang et al.

2025-06-10Synthetic Data Generation3D Lane Detectionobject-detection+4
PaperCode
Princeton365: A Diverse Dataset with Accurate Camera Pose

Karhan Kayan, Stamatis Alexandropoulos, Rishabh Jain, Yiming Zuo, Erich Liang et al.

2025-06-10Novel View SynthesisOptical Flow EstimationPose Estimation+1
PaperCode
Do MIL Models Transfer?

Daniel Shao, Richard J. Chen, Andrew H. Song, Joel Runevic, Ming Y. Lu et al.

2025-06-10Multiple Instance LearningTransfer Learning
PaperCode
SDTagNet: Leveraging Text-Annotated Navigation Maps for Online HD Map Construction

Fabian Immel, Jan-Hendrik Pauls, Richard Fehler, Frank Bieder, Jonas Merkert et al.

2025-06-10Autonomous Vehicles
PaperCode
Do Concept Replacement Techniques Really Erase Unacceptable Concepts?

Anudeep Das, Gurjot Singh, Prach Chantasantitam, N. Asokan

2025-06-10
Paper
Rethinking Range-View LiDAR Segmentation in Adverse Weather

Longyu Yang, Ping Hu, Lu Zhang, Jun Liu, Yap-Peng Tan et al.

2025-06-10Segmentation
Paper
ADAM: Autonomous Discovery and Annotation Model using LLMs for Context-Aware Annotations

Amirreza Rouhi, Solmaz Arezoomandan, Knut Peterson, Joseph T. Woods, David K. Han et al.

2025-06-10Re-Rankingobject-detectionObject Detection
Paper
ORIDa: Object-centric Real-world Image Composition Dataset

Jinwoo Kim, Sangmin Han, Jinho Jeong, Jiwoo Choi, Dongyoung Kim et al.

2025-06-10CVPR 2025 1
Paper
Cross-Spectral Body Recognition with Side Information Embedding: Benchmarks on LLCM and Analyzing Range-Induced Occlusions on IJB-MDF

Anirudh Nanduri, Siyuan Huang, Rama Chellappa

2025-06-10Occlusion HandlingPerson Re-Identification
Paper
SSS: Semi-Supervised SAM-2 with Efficient Prompting for Medical Imaging Segmentation

Hongjie Zhu, Xiwei Liu, Rundong Xue, Zeyu Zhang, Yong Xu et al.

2025-06-10Data AugmentationTransfer LearningSemantic Segmentation+4
PaperCode
What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities

Wendong Bu, Yang Wu, Qifan Yu, Minghe Gao, Bingchen Miao et al.

2025-06-10
Paper
SkipVAR: Accelerating Visual Autoregressive Modeling via Adaptive Frequency-Aware Skipping

Jiajun Li, Yue Ma, Xinyu Zhang, Qingyan Wei, Songhua Liu et al.

2025-06-10SSIMImage Generation
PaperCode
Hyperbolic Dual Feature Augmentation for Open-Environment

Peilin Yu, Yuwei Wu, Zhi Gao, Xiaomeng Fan, Shuo Yang et al.

2025-06-10Few-Shot LearningOpen Set LearningMeta-Learning+5
Paper
MIRAGE: Multimodal foundation model and benchmark for comprehensive retinal OCT image analysis

José Morano, Botond Fazekas, Emese Sükei, Ronald Fecso, Taha Emre et al.

2025-06-10Segmentation
PaperCode
WetCat: Automating Skill Assessment in Wetlab Cataract Surgery Videos

Negin Ghamsarian, Raphael Sznitman, Klaus Schoeffmann, Jens Kowal

2025-06-10
Paper
DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval

Leqi Shen, Guoqiang Gong, Tianxiang Hao, Tao He, Yifeng Zhang et al.

2025-06-10CVPR 2025 1Video-Text RetrievalText RetrievalImage Captioning+2
PaperCode
Adapting Vision-Language Foundation Model for Next Generation Medical Ultrasound Image Analysis

Jingguo Qu, Xinyang Han, Tonghuan Xiao, Jia Ai, Juan Wu et al.

2025-06-10Large Language ModelDomain Adaptation
PaperCode
Video-CoT: A Comprehensive Dataset for Spatiotemporal Understanding of Videos Based on Chain-of-Thought

Shuyi Zhang, Xiaoshuai Hao, Yingbo Tang, Lingfeng Zhang, Pengwei Wang et al.

2025-06-10
Paper
HiSin: Efficient High-Resolution Sinogram Inpainting via Resolution-Guided Progressive Inference

Jiaze E, Srutarshi Banerjee, Tekin Bicer, Guannan Wang, yanfu Zhang et al.

2025-06-10Diagnostic
Paper
HunyuanVideo-HOMA: Generic Human-Object Interaction in Multimodal Driven Human Animation

Ziyao Huang, Zixiang Zhou, Juan Cao, Yifeng Ma, Yi Chen et al.

2025-06-10Human-Object Interaction DetectionVideo Generation
Paper
PreviousPage 260 of 28782Next