TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

Reason-SVG: Hybrid Reward RL for Aha-Moments in Vector Graphics Generation

XiMing Xing, Yandong Guan, Jing Zhang, Dong Xu, Qian Yu et al.

2025-05-30Reinforcement LearningVector Graphics
Paper
ACM-UNet: Adaptive Integration of CNNs and Mamba for Efficient Medical Image Segmentation

Jing Huang, Yongkang Zhao, Yuhan Li, Zhitao Dai, Cheng Chen et al.

2025-05-30Semantic SegmentationMedical Image SegmentationImage Segmentation
Paper
Period-LLM: Extending the Periodic Capability of Multimodal Large Language Model

Yuting Zhang, Hao Lu, Qingyong Hu, Yin Wang, Kaishen Yuan et al.

2025-05-30CVPR 2025 1Multimodal Large Language ModelLarge Language ModelLanguage Modelling
PaperCode
SPPSFormer: High-quality Superpoint-based Transformer for Roof Plane Instance Segmentation from Point Clouds

Cheng Zeng, Xiatian Qi, Chi Chen, Kai Sun, Wangle Zhang et al.

2025-05-30Plane Instance SegmentationData AugmentationSemantic Segmentation+1
Paper
SA-Person: Text-Based Person Retrieval with Scene-aware Re-ranking

Yingjia Xu, Jinlin Wu, Zhen Chen, Daming Gao, Yang Yang et al.

2025-05-30Cross-Modal RetrievalText-based Person RetrievalPerson Retrieval+3
Paper
SORCE: Small Object Retrieval in Complex Environments

Chunxu Liu, Chi Xie, Xiaxu Chen, Wei Li, Feng Zhu et al.

2025-05-30BenchmarkingRetrievalImage Retrieval
PaperCode
Bridging 3D Anomaly Localization and Repair via High-Quality Continuous Geometric Representation

Bozhong Zheng, Jinye Gan, Xiaohao Xu, Wenqiao Li, Xiaonan Huang et al.

2025-05-30Anomaly Localization3D Anomaly DetectionAnomaly Detection
Paper
EasyText: Controllable Diffusion Transformer for Multilingual Text Rendering

Runnan Lu, Yuxuan Zhang, Jiaming Liu, Haofan Wang, Yiren Song et al.

2025-05-30Denoising
PaperCode
PCIE_Pose Solution for EgoExo4D Pose and Proficiency Estimation Challenge

Feng Chen, Kanokphan Lertniphonphan, Qiancheng Yan, Xiaohui Fan, Jun Xie et al.

2025-05-30Pose Estimation
Paper
IRBridge: Solving Image Restoration Bridge with Pre-trained Generative Diffusion Models

Hanting Wang, Tao Jin, Wang Lin, Shulei Wang, Hai Huang et al.

2025-05-30Image Restoration
PaperCode
PCIE_Interaction Solution for Ego4D Social Interaction Challenge

Kanokphan Lertniphonphan, Feng Chen, Junda Xu, Fengbu Lan, Jun Xie et al.

2025-05-30
PaperCode
Leveraging Intermediate Features of Vision Transformer for Face Anti-Spoofing

Mika Feng, Koichi Ito, Takafumi Aoki, Tetsushi Ohki, Masakatsu Nishigaki et al.

2025-05-30Face RecognitionData AugmentationFace Anti-Spoofing
Paper
S3CE-Net: Spike-guided Spatiotemporal Semantic Coupling and Expansion Network for Long Sequence Event Re-Identification

Xianheng Ma, Hongchen Tan, Xiuping Liu, Yi Zhang, Huasheng Wang et al.

2025-05-30Person Re-Identification
PaperCode
Leadership Assessment in Pediatric Intensive Care Unit Team Training

Liangyang Ouyang, Yuki Sakai, Ryosuke Furuta, Hisataka Nozawa, Hikoro Matsui et al.

2025-05-30Contact Detectionobject-detectionObject Detection
Paper
Spatiotemporal Analysis of Forest Machine Operations Using 3D Video Classification

Maciej Wielgosz, Simon berg, Heikki Korpunen, Stephan Hoffmann

2025-05-30Video ClassificationActivity Recognition
Paper
D2AF: A Dual-Driven Annotation and Filtering Framework for Visual Grounding

Yichi Zhang, Gongwei Chen, Jun Zhu, Jia Wan

2025-05-30Visual Grounding
Paper
Revisiting Cross-Modal Knowledge Distillation: A Disentanglement Approach for RGBD Semantic Segmentation

Roger Ferrod, Cássio F. Dantas, Luigi di Caro, Dino Ienco

2025-05-30DisentanglementData AugmentationAutonomous Driving+3
PaperCode
VUDG: A Dataset for Video Understanding Domain Generalization

Ziyi Wang, Zhi Gao, Boxuan Yu, Zirui Dai, Yuxiang Song et al.

2025-05-30Question AnsweringDomain GeneralizationVideo Question Answering+4
Paper
KairosAD: A SAM-Based Model for Industrial Anomaly Detection on Embedded Devices

Uzair Khan, Franco Fummi, Luigi Capogrosso

2025-05-30Anomaly Detection
PaperCode
DisTime: Distribution-based Time Representation for Video Large Language Models

Yingsen Zeng, Zepeng Huang, Yujie Zhong, Chengjian Feng, Jie Hu et al.

2025-05-30Temporal LocalizationVideo Understanding
PaperCode
PreviousPage 412 of 28782Next