TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

Prompt2SegCXR:Prompt to Segment All Organs and Diseases in Chest X-rays

Abduz Zami, Shadman Sobhan, Rounaq Hossain, Md. Sawran Sorker, Mohiuddin Ahmed et al.

2025-07-01SegmentationSemantic SegmentationAll+1
Paper
HumanoidGen: Data Generation for Bimanual Dexterous Manipulation via LLM Reasoning

Zhi Jing, Siyuan Yang, Jicong Ao, Ting Xiao, Yugang Jiang et al.

2025-07-01
Paper
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

GLM-V Team, :, Wenyi Hong, Wenmeng Yu, Xiaotao Gu et al.

2025-07-01document understandingMultimodal ReasoningVideo Understanding
PaperCode
ATSTrack: Enhancing Visual-Language Tracking by Aligning Temporal and Spatial Scales

Yihao Zhen, Qiang Wang, Yu Qiao, Liangqiong Qu, Huijie Fan et al.

2025-07-01
Paper
CAVALRY-V: A Large-Scale Generator Framework for Adversarial Attacks on Video MLLMs

Jiaming Zhang, Rui Hu, Qing Guo, Wei Yang Bryan Lim

2025-07-01Text GenerationVideo Understanding
Paper
Why Multi-Interest Fairness Matters: Hypergraph Contrastive Multi-Interest Learning for Fair Conversational Recommender System

Yongsen Zheng, Zongxuan Xie, Guohua Wang, Ziyao Liu, Liang Lin et al.

2025-07-01FairnessContrastive LearningRecommendation Systems
Paper
LD-RPS: Zero-Shot Unified Image Restoration via Latent Diffusion Recurrent Posterior Sampling

Huaqiu Li, Yong Wang, Tongwen Huang, Hailang Huang, Haoqian Wang et al.

2025-07-01Image RestorationUnified Image Restoration
PaperCode
World4Drive: End-to-End Autonomous Driving via Intention-aware Physical Latent World Model

Yupeng Zheng, Pengxuan Yang, Zebin Xing, Qichao Zhang, Yuhang Zheng et al.

2025-07-01Self-Supervised LearningAutonomous Driving
Paper
Enhancing LLM Agent Safety via Causal Influence Prompting

Dongyoon Hahm, Woogyeol Jin, June Suk Choi, Sungsoo Ahn, Kimin Lee et al.

2025-07-01Decision Making
PaperCode
TABASCO: A Fast, Simplified Model for Molecular Generation with Improved Physical Quality

Carlos Vonessen, Charles Harris, Miruna Cretu, Pietro Liò

2025-07-01
PaperCode
Empirical Analysis Of Heuristic and Approximation Algorithms for the The Mutual-Visibility Problem

Vanja Stojanović, Bor Pangeršič

2025-07-01
PaperCode
A Diagrammatic Calculus for a Functional Model of Natural Language Semantics

Matthieu Pierre Boyer

2025-07-01
Paper
Instant Particle Size Distribution Measurement Using CNNs Trained on Synthetic Data

Yasser El Jarida, Youssef Iraqi, Loubna Mekouar

2025-07-01
PaperCode
MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement

Nikolai Lund Kühne, Jesper Jensen, Jan Østergaard, Zheng-Hua Tan

2025-07-01Speech RecognitionAutomatic Speech Recognitionspeech-recognition+1
PaperCode
Geometric Gaussian Approximations of Probability Distributions

Nathaël Da Costa, Bálint Mucsányi, Philipp Hennig

2025-07-01
Paper
Understanding Generalization in Node and Link Prediction

Antonis Vasileiou, Timo Stoll, Christopher Morris

2025-07-01PredictionLink Prediction
Paper
Process-aware and high-fidelity microstructure generation using stable diffusion

Hoang Cuong Phan, Minh Tien Tran, Chihun Lee, Hoheok Kim, Sehyok Oh et al.

2025-07-01Semantic SegmentationImage Generation
Paper
A Unified Transformer-Based Framework with Pretraining For Whole Body Grasping Motion Generation

Edward Effendy, Kuan-Wei Tseng, Rei Kawakami

2025-07-01Grasp GenerationMotion Generation
PaperCode
Zero-shot Skeleton-based Action Recognition with Prototype-guided Feature Alignment

Kai Zhou, Shuhai Zhang, Zeng You, Jinwu Hu, Mingkui Tan et al.

2025-07-01One-Shot 3D Action RecognitionSkeleton Based Action RecognitionZero Shot Skeletal Action Recognition+2
PaperCode
LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs

Haoran Lou, Chunxiao Fan, Ziyan Liu, Yuexin Wu, Xinxiang Wang et al.

2025-07-01Large Language Model
PaperCode
PreviousPage 60 of 28782Next