TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

GroundingDINO-US-SAM: Text-Prompted Multi-Organ Segmentation in Ultrasound with LoRA-Tuned Vision-Language Models

Hamza Rasaee, Taha Koleilat, Hassan Rivaz

2025-06-30SegmentationSemantic SegmentationOrgan Segmentation
Paper
DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World

Xiangtai Li, Tao Zhang, Yanwei Li, Haobo Yuan, Shihao Chen et al.

2025-06-30Visual GroundingCaption Generation
PaperCode
Visual and Memory Dual Adapter for Multi-Modal Object Tracking

Boyue Xu, Ruichao Hou, Tongwei Ren, Gangshan Wu

2025-06-30Object Tracking
PaperCode
Mamba-FETrack V2: Revisiting State Space Model for Frame-Event based Visual Object Tracking

Shiao Wang, Ju Huang, Qingchuan Ma, Jinfeng Gao, Chunyi Xu et al.

2025-06-30Visual Object TrackingObject Tracking
PaperCode
Flash-VStream: Efficient Real-Time Understanding for Long Video Streams

Haoji Zhang, Yiqin Wang, Yansong Tang, Yong liu, Jiashi Feng et al.

2025-06-30cross-modal alignmentVideo Understanding
PaperCode
Graft: Integrating the Domain Knowledge via Efficient Parameter Synergy for MLLMs

Yang Dai, Jianxiang An, Tianwei Lin, Hongyang He, Hongzhe Huang et al.

2025-06-30
Paper
Advancing Learnable Multi-Agent Pathfinding Solvers with Active Fine-Tuning

Anton Andreychuk, Konstantin Yakovlev, Aleksandr Panov, Alexey Skrynnik

2025-06-30Trajectory PlanningImitation Learning
PaperCode
NavMorph: A Self-Evolving World Model for Vision-and-Language Navigation in Continuous Environments

Xuan Yao, Junyu Gao, Changsheng Xu

2025-06-30Vision and Language NavigationDecision Making
PaperCode
Towards foundational LiDAR world models with efficient latent flow matching

Tianran Liu, Shengwen Zhao, Nicholas Rhinehart

2025-06-30
Paper
Epona: Autoregressive Diffusion World Model for Autonomous Driving

Kaiwen Zhang, Zhenyu Tang, Xiaotao Hu, Xingang Pan, Xiaoyang Guo et al.

2025-06-30Trajectory PlanningMotion PlanningVideo Prediction+2
PaperCode
Do Thinking Tokens Help or Trap? Towards More Efficient Large Reasoning Model

Bowen Ding, Yuhan Chen, Futing Wang, Lingfeng Ming, Tao Lin et al.

2025-06-30Math
Paper
LLM Agents Are the Antidote to Walled Gardens

Samuele Marro, Philip Torr

2025-06-30
Paper
Auto-TA: Towards Scalable Automated Thematic Analysis (TA) via Multi-Agent Large Language Models with Reinforcement Learning

Seungjun Yi, Joakim Nguyen, Huimin Xu, Terence Lim, Andrew Well et al.

2025-06-30Large Language ModelLanguage Modelling
Paper
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Bo Liu, Leon Guertler, Simon Yu, Zichen Liu, Penghui Qi et al.

2025-06-30MathMulti-agent Reinforcement Learning
PaperCode
Ella: Embodied Social Agents with Lifelong Memory

Hongxin Zhang, Zheyuan Zhang, Zeyuan Wang, Zunzhe Zhang, Lixing Fang et al.

2025-06-30
Paper
Thought-Augmented Planning for LLM-Powered Interactive Recommender Agent

Haocheng Yu, Yaxiong Wu, Hao Wang, Wei Guo, Yong liu et al.

2025-06-30User SimulationLarge Language ModelRecommendation Systems
PaperCode
L0: Reinforcement Learning to Become General Agents

Junjie Zhang, Jingyi Xi, Zhuoyang Song, Junyu Lu, Yuhua Ke et al.

2025-06-30Question AnsweringReinforcement Learningreinforcement-learning
PaperCode
Flow-Through Tensors: A Unified Computational Graph Architecture for Multi-Layer Transportation Network Optimization

Xuesong, Zhou, Taehooie Kim, Mostafa Ameli, Henan et al.

2025-06-30
Paper
Real-World En Call Center Transcripts Dataset with PII Redaction

Ha Dao, Gaurav Chawla, Raghu Banda, Caleb DeLeeuw

2025-06-30
PaperCode
RAG-R1 : Incentivize the Search and Reasoning Capabilities of LLMs through Multi-query Parallelism

Zhiwen Tan, Jiaming Huang, Qintong Wu, Hongxuan Zhang, Chenyi Zhuang et al.

2025-06-30Question AnsweringReinforcement LearningRetrieval+1
PaperCode
PreviousPage 62 of 28782Next