TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

Heterogeneous Temporal Hypergraph Neural Network

Huan Liu, Pengfei Jiao, Mengzhou Gao, Chaochao Chen, Di Jin et al.

2025-06-18Graph Representation LearningContrastive Learning
Paper
A family of graph GOSPA metrics for graphs with different sizes

Jinhao Gu, Ángel F. García-Fernández, Robert E. Firth, Lennart Svensson

2025-06-18Attribute
Paper
Uncovering Intention through LLM-Driven Code Snippet Description Generation

Yusuf Sulistyo Nugroho, Farah Danisha Salam, Brittany Reid, Raula Gaikovina Kula, Kazumasa Shimari et al.

2025-06-18Descriptive
Paper
An accurate and revised version of optical character recognition-based speech synthesis using LabVIEW

Prateek Mehta, Anasuya Patil

2025-06-18Speech SynthesisOptical Character Recognition (OCR)
Paper
video-SALMONN 2: Captioning-Enhanced Audio-Visual Large Language Models

Changli Tang, Yixuan Li, Yudong Yang, Jimin Zhuang, Guangzhi Sun et al.

2025-06-18Question AnsweringVideo Question AnsweringAudio captioning+3
PaperCode
Factorized RVQ-GAN For Disentangled Speech Tokenization

Sameer Khurana, Dominik Klement, Antoine Laurent, Dominik Bobos, Juraj Novosad et al.

2025-06-18DisentanglementKnowledge Distillation
Paper
Diff-TONE: Timestep Optimization for iNstrument Editing in Text-to-Music Diffusion Models

Teysir Baoueb, Xiaoyu Bie, Xi Wang, Gaël Richard

2025-06-18Music GenerationText-to-Music Generation
Paper
PredGen: Accelerated Inference of Large Language Models through Input-Time Speculation for Real-Time Speech Interaction

Shufan Li, Aditya Grover

2025-06-18Text to Speechtext-to-speech
Paper
Early Attentive Sparsification Accelerates Neural Speech Transcription

Zifei Xu, Sayeh Sharify, Hesham Mostafa, Tristan Webb, Wanzin Yazar et al.

2025-06-18
Paper
HEAL: An Empirical Study on Hallucinations in Embodied Agents Driven by Large Language Models

Trishna Chakraborty, Udita Ghosh, Xiaopan Zhang, Fahim Faisal Niloy, Yue Dong et al.

2025-06-18Hallucination
Paper
Probabilistic Trajectory GOSPA: A Metric for Uncertainty-Aware Multi-Object Tracking Performance Evaluation

Yuxuan Xia, Ángel F. García-Fernández, Johan Karlsson, Yu Ge, Lennart Svensson et al.

2025-06-18Multi-Object TrackingObject Tracking
Paper
Correspondence-Free Multiview Point Cloud Registration via Depth-Guided Joint Optimisation

Yiran Zhou, YingYu Wang, Shoudong Huang, Liang Zhao

2025-06-18Point Cloud Registration
Paper
Robust Instant Policy: Leveraging Student's t-Regression Model for Robust In-context Imitation Learning of Robot Manipulation

Hanbit Oh, Andrea M. Salcedo-Vázquez, Ixchel G. Ramirez-Alpizar, Yukiyasu Domae

2025-06-18Imitation LearningHallucinationRobot Manipulation
Paper
Context-Aware Deep Lagrangian Networks for Model Predictive Control

Lucas Schulze, Jan Peters, Oleg Arenz

2025-06-18
Paper
MCOO-SLAM: A Multi-Camera Omnidirectional Object SLAM System

Miaoxin Pan, Jinnan Li, Yaowen Zhang, Yi Yang, Yufeng Yue et al.

2025-06-18Object SLAM
Paper
Model Predictive Path-Following Control for a Quadrotor

David Leprich, Mario Rosenfelder, Mario Hermle, Jingshan Chen, Peter Eberhard et al.

2025-06-18
Paper
RaCalNet: Radar Calibration Network for Sparse-Supervised Metric Depth Estimation

Xingrui Qin, Wentao Zhao, Chuan Cao, Yihe Niu, Houcheng Jiang et al.

2025-06-18Depth PredictionDepth Estimation
Paper
FindingDory: A Benchmark to Evaluate Memory in Embodied Agents

Karmesh Yadav, Yusuf Ali, Gunshi Gupta, Yarin Gal, Zsolt Kira et al.

2025-06-18
Paper
Particle-Grid Neural Dynamics for Learning Deformable Object Models from RGB-D Videos

Kaifeng Zhang, Baoyu Li, Kris Hauser, Yunzhu Li

2025-06-18
Paper
Context Matters! Relaxing Goals with LLMs for Feasible 3D Scene Planning

Emanuele Musumeci, Michele Brienza, Francesco Argenziano, Vincenzo Suriani, Daniele Nardi et al.

2025-06-18
Paper
PreviousPage 150 of 28782Next