Papers

575,626 papers

Prompt2SegCXR:Prompt to Segment All Organs and Diseases in Chest X-rays

Abduz Zami, Shadman Sobhan, Rounaq Hossain, Md. Sawran Sorker, Mohiuddin Ahmed et al.

2025-07-01SegmentationSemantic SegmentationAll+1

Paper

HumanoidGen: Data Generation for Bimanual Dexterous Manipulation via LLM Reasoning

Zhi Jing, Siyuan Yang, Jicong Ao, Ting Xiao, Yugang Jiang et al.

2025-07-01

Paper

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

GLM-V Team, :, Wenyi Hong, Wenmeng Yu, Xiaotao Gu et al.

2025-07-01document understandingMultimodal ReasoningVideo Understanding

Paper Code

ATSTrack: Enhancing Visual-Language Tracking by Aligning Temporal and Spatial Scales

Yihao Zhen, Qiang Wang, Yu Qiao, Liangqiong Qu, Huijie Fan et al.

2025-07-01

Paper

CAVALRY-V: A Large-Scale Generator Framework for Adversarial Attacks on Video MLLMs

Jiaming Zhang, Rui Hu, Qing Guo, Wei Yang Bryan Lim

2025-07-01Text GenerationVideo Understanding

Paper

Why Multi-Interest Fairness Matters: Hypergraph Contrastive Multi-Interest Learning for Fair Conversational Recommender System

Yongsen Zheng, Zongxuan Xie, Guohua Wang, Ziyao Liu, Liang Lin et al.

2025-07-01FairnessContrastive LearningRecommendation Systems

Paper

LD-RPS: Zero-Shot Unified Image Restoration via Latent Diffusion Recurrent Posterior Sampling

Huaqiu Li, Yong Wang, Tongwen Huang, Hailang Huang, Haoqian Wang et al.

2025-07-01Image RestorationUnified Image Restoration

Paper Code

World4Drive: End-to-End Autonomous Driving via Intention-aware Physical Latent World Model

Yupeng Zheng, Pengxuan Yang, Zebin Xing, Qichao Zhang, Yuhang Zheng et al.

2025-07-01Self-Supervised LearningAutonomous Driving

Paper

Enhancing LLM Agent Safety via Causal Influence Prompting

Dongyoon Hahm, Woogyeol Jin, June Suk Choi, Sungsoo Ahn, Kimin Lee et al.

2025-07-01Decision Making

Paper Code

TABASCO: A Fast, Simplified Model for Molecular Generation with Improved Physical Quality

Carlos Vonessen, Charles Harris, Miruna Cretu, Pietro Liò

2025-07-01

Paper Code

Empirical Analysis Of Heuristic and Approximation Algorithms for the The Mutual-Visibility Problem

Vanja Stojanović, Bor Pangeršič

2025-07-01

Paper Code

A Diagrammatic Calculus for a Functional Model of Natural Language Semantics

Matthieu Pierre Boyer

2025-07-01

Paper

Instant Particle Size Distribution Measurement Using CNNs Trained on Synthetic Data

Yasser El Jarida, Youssef Iraqi, Loubna Mekouar

2025-07-01

Paper Code

MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement

Nikolai Lund Kühne, Jesper Jensen, Jan Østergaard, Zheng-Hua Tan

2025-07-01Speech RecognitionAutomatic Speech Recognitionspeech-recognition+1

Paper Code

Geometric Gaussian Approximations of Probability Distributions

Nathaël Da Costa, Bálint Mucsányi, Philipp Hennig

2025-07-01

Paper

Understanding Generalization in Node and Link Prediction

Antonis Vasileiou, Timo Stoll, Christopher Morris

2025-07-01PredictionLink Prediction

Paper

Process-aware and high-fidelity microstructure generation using stable diffusion

Hoang Cuong Phan, Minh Tien Tran, Chihun Lee, Hoheok Kim, Sehyok Oh et al.

2025-07-01Semantic SegmentationImage Generation

Paper

A Unified Transformer-Based Framework with Pretraining For Whole Body Grasping Motion Generation

Edward Effendy, Kuan-Wei Tseng, Rei Kawakami

2025-07-01Grasp GenerationMotion Generation

Paper Code

Zero-shot Skeleton-based Action Recognition with Prototype-guided Feature Alignment

Kai Zhou, Shuhai Zhang, Zeng You, Jinwu Hu, Mingkui Tan et al.

2025-07-01One-Shot 3D Action RecognitionSkeleton Based Action RecognitionZero Shot Skeletal Action Recognition+2

Paper Code

LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs

Haoran Lou, Chunxiao Fan, Ziyan Liu, Yuexin Wu, Xinxiang Wang et al.

2025-07-01Large Language Model

Paper Code

PreviousPage 60 of 28782Next