TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

Language-guided Learning for Object Detection Tackling Multiple Variations in Aerial Images

Sungjune Park, Hyunjun Kim, Beomchan Park, Yong Man Ro

2025-05-29Object Detection In Aerial ImagesNovel Object Detectionobject-detection+1
Paper
HiGarment: Cross-modal Harmony Based Diffusion Model for Flat Sketch to Realistic Garment Image

Junyi Guo, JingXuan Zhang, Fangyu Wu, Huanda Lu, Qiufeng Wang et al.

2025-05-29
Paper
Proximal Algorithm Unrolling: Flexible and Efficient Reconstruction Networks for Single-Pixel Imaging

Ping Wang, Lishun Wang, Gang Qu, Xiaodong Wang, Yulun Zhang et al.

2025-05-29CVPR 2025 1
PaperCode
DIP-R1: Deep Inspection and Perception with RL Looking Through and Understanding Complex Scenes

Sungjune Park, Hyunjun Kim, Junho Kim, Seongho Kim, Yong Man Ro et al.

2025-05-29Reinforcement LearningDecision Making
Paper
RoboTransfer: Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer

Liu Liu, XiaoFeng Wang, Guosheng Zhao, Keyu Li, Wenkang Qin et al.

2025-05-29Imitation LearningVideo Generation
Paper
Implicit Inversion turns CLIP into a Decoder

Antonio D'Orazio, Maria Rosaria Briglia, Donato Crisostomi, Dario Loi, Emanuele RodolĂ  et al.

2025-05-29Text-to-Image GenerationStyle TransferImage Reconstruction+2
PaperCode
LODGE: Level-of-Detail Large-Scale Gaussian Splatting with Efficient Rendering

Jonas Kulhanek, Marie-Julie Rakotosaona, Fabian Manhardt, Christina Tsalicoglou, Michael Niemeyer et al.

2025-05-293DGS
Paper
PreFM: Online Audio-Visual Event Parsing via Predictive Future Modeling

Xiao Yu, Yan Fang, Xiaojie Jin, Yao Zhao, Yunchao Wei et al.

2025-05-29Video Understanding
PaperCode
FlowAlign: Trajectory-Regularized, Inversion-Free Flow-based Image Editing

Jeongsol Kim, Yeobin Hong, Jong Chul Ye

2025-05-29
PaperCode
Interpreting Chest X-rays Like a Radiologist: A Benchmark with Clinical Reasoning

Jinquan Guan, Qi Chen, Lizhou Liang, Yuhang Liu, Vu Minh Hieu Phan et al.

2025-05-29Question AnsweringDiagnosticVisual Question Answering (VQA)+1
PaperCode
Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editing

Tongtong Su, Chengyu Wang, Jun Huang, Dongming Lu

2025-05-29Video EditingOptical Flow EstimationVideo Restoration
PaperCode
PhotoArtAgent: Intelligent Photo Retouching with Language Model-Based Artist Agents

Haoyu Chen, Keda Tao, Yizao Wang, Xinlei Wang, Lei Zhu et al.

2025-05-29Photo RetouchingLanguage Modelling
Paper
HMAD: Advancing E2E Driving with Anchored Offset Proposals and Simulation-Supervised Multi-target Scoring

Bin Wang, Pingjun Li, Jinkun Liu, Jun Cheng, Hailong Lei et al.

2025-05-29Autonomous Driving
Paper
MMGT: Motion Mask Guided Two-Stage Network for Co-Speech Gesture Video Generation

Siyuan Wang, Jiawei Liu, Wei Wang, Yeying Jin, Jinsong Du et al.

2025-05-29Motion GenerationVideo Generation
PaperCode
TextSR: Diffusion Super-Resolution with Multilingual OCR Guidance

Keren Ye, Ignacio Garcia Dorado, Michalis Raptis, Mauricio Delbracio, Irene Zhu et al.

2025-05-29Super-ResolutionImage Super-ResolutionOptical Character Recognition (OCR)
Paper
Diffusion-Based Generative Models for 3D Occupancy Prediction in Autonomous Driving

Yunshen Wang, Yicheng Liu, Tianyuan Yuan, Yucheng Mao, Yingshi Liang et al.

2025-05-29Autonomous Driving
Paper
Identification of Patterns of Cognitive Impairment for Early Detection of Dementia

Anusha A. S., Uma Ranjan, Medha Sharma, Siddharth Dutt

2025-05-29feature selection
Paper
EAD: An EEG Adapter for Automated Classification

Pushapdeep Singh, Jyoti Nigam, Medicherla Vamsi Krishna, Arnav Bhavsar, Aditya Nigam et al.

2025-05-29EEG Signal ClassificationClassificationEEG
Paper
CURVE: CLIP-Utilized Reinforcement Learning for Visual Image Enhancement via Simple Image Processing

Yuka Ogino, Takahiro Toizumi, Atsushi Ito

2025-05-29Image EnhancementReinforcement Learningreinforcement-learning+1
Paper
LeMoRe: Learn More Details for Lightweight Semantic Segmentation

Mian Muhammad Naeem Abid, Nancy Mehta, Zongwei Wu, Radu Timofte

2025-05-29Representation LearningSemantic Segmentation
PaperCode
PreviousPage 452 of 28782Next