TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

Listener-Rewarded Thinking in VLMs for Image Preferences

Alexander Gambashidze, Li Pengyi, Matvey Skripkin, Andrey Galichin, Anton Gusarov et al.

2025-06-28Reinforcement LearningMemorization
Paper
FOCUS: Fine-grained Optimization with Semantic Guided Understanding for Pedestrian Attributes Recognition

Hongyan An, Kuan Zhu, Xin He, Haiyun Guo, Chaoyang Zhao et al.

2025-06-28Pedestrian Attribute RecognitionAttributeContrastive Learning
Paper
Point Cloud Compression and Objective Quality Assessment: A Survey

Yiling Xu, Yujie Zhang, Shuting Xia, Kaifa Yang, He Huang et al.

2025-06-28BenchmarkingAutonomous DrivingPoint Cloud Quality Assessment
Paper
Prompt Mechanisms in Medical Imaging: A Comprehensive Survey

Hao Yang, Xinlong Liang, Zhang Li, Yue Sun, Zheyu Hu et al.

2025-06-28Feature EngineeringPrompt EngineeringImage Generation
Paper
Deterministic Object Pose Confidence Region Estimation

Jinghao Wang, Zhang Li, Zi Wang, Banglei Guan, Yang Shang et al.

2025-06-28Uncertainty QuantificationPose Estimation
Paper
SemFaceEdit: Semantic Face Editing on Generative Radiance Manifolds

Shashikant Verma, Shanmuganathan Raman

2025-06-28Disentanglement
Paper
VoteSplat: Hough Voting Gaussian Splatting for 3D Scene Understanding

Minchao Jiang, Shunyu Jia, Jiaming Gu, Xiaoyuan Lu, Guangming Zhu et al.

2025-06-28Novel View SynthesisScene UnderstandingSemantic Segmentation+3
Paper
Decoupled Seg Tokens Make Stronger Reasoning Video Segmenter and Grounder

Dang Jisheng, Wu Xudong, Wang Bimei, Lv Ning, Chen Jiayu et al.

2025-06-28Question AnsweringSegmentationVideo Question Answering+6
PaperCode
Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval

Li-Cheng Shen, Jih-Kang Hsieh, Wei-Hua Li, Chu-Song Chen

2025-06-28Cross-Modal RetrievalRerankingReferring Expression+9
Paper
Revisiting CroPA: A Reproducibility Study and Enhancements for Cross-Prompt Adversarial Transferability in Vision-Language Models

Atharv Mittal, Agam Pandey, Amritanshu Tiwari, Sukrit Jindal, Swadesh Swain et al.

2025-06-28Question AnsweringImage ClassificationVisual Question Answering
PaperCode
Agent-to-Agent Theory of Mind: Testing Interlocutor Awareness among Large Language Models

Younwoo Choi, Changling Li, Yongjin Yang, Zhijing Jin

2025-06-28
Paper
Knowledge Augmented Finetuning Matters in both RAG and Agent Based Dialog Systems

Yucheng Cai, Yuxuan Wu, Yi Huang, Junlan Feng, Zhijian Ou et al.

2025-06-28RAGResponse Generation
Paper
Potential Customer Lifetime Value in Financial Institutions: The Usage Of Open Banking Data to Improve CLV Estimation

João B. G. de Brito, Rodrigo Heldt, Cleo S. Silveira, Matthias Bogaert, Guilherme B. Bucco et al.

2025-06-28Marketing
Paper
Positioning AI Tools to Support Online Harm Reduction Practice: Applications and Design Directions

Kaixuan Wang, Jason T. Jacques, Chenxin Diao

2025-06-28
Paper
Sensing Security Oriented OFDM-ISAC Against Multi-Intercept Threats

Lingyun Xu, Bowen Wang, Huiyong Li, Ziyang Cheng

2025-06-28
Paper
Attention to Burstiness: Low-Rank Bilinear Prompt Tuning

Yuzhu Wang, Manni Duan, Shu Kong

2025-06-28Visual Prompt Tuning
PaperCode
ActAlign: Zero-Shot Fine-Grained Video Classification via Language-Guided Sequence Alignment

Amir Aghdam, Vincent Tao Hu

2025-06-28Open Set Learningtext similarityLarge Language Model+2
PaperCode
DAPFAM: A Domain-Aware Patent Retrieval Dataset Aggregated at the Family Level

Iliass Ayaou, Denis Cavallucci, Hicham Chibane

2025-06-27Patent classificationRetrieval
Paper
Visual Structures Helps Visual Reasoning: Addressing the Binding Problem in VLMs

Amirmohammad Izadi, Mohammad Ali Banayeeanzade, Fatemeh Askari, Ali Rahimiakbar, Mohammad Mahdi Vahedi et al.

2025-06-27Visual Reasoning
Paper
MiCo: Multi-image Contrast for Reinforcement Visual Reasoning

Xi Chen, Mingkang Zhu, Shaoteng Liu, Xiaoyang Wu, Xiaogang Xu et al.

2025-06-27Representation LearningLogical ReasoningVisual Reasoning
Paper
PreviousPage 65 of 28782Next