TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

OpenDance: Multimodal Controllable 3D Dance Generation Using Large-scale Internet Data

Jinlu Zhang, Zixi Kang, Yizhou Wang

2025-06-09
Paper
BridgeVLA: Input-Output Alignment for Efficient 3D Manipulation Learning with Vision-Language Models

Peiyan Li, Yixiang Chen, Hongtao Wu, Xiao Ma, Xiangnan Wu et al.

2025-06-09Robot ManipulationVision-Language-Action
Paper
Reinforcing Multimodal Understanding and Generation with Dual Self-rewards

Jixiang Hong, Yiran Zhang, Guanzhong Wang, Yi Liu, Ji-Rong Wen et al.

2025-06-09
Paper
PairEdit: Learning Semantic Variations for Exemplar-based Image Editing

Haoguang Lu, Jiacheng Chen, Zhenguo Yang, Aurele Tohokantche Gnanha, Fu Lee Wang et al.

2025-06-09
PaperCode
MADFormer: Mixed Autoregressive and Diffusion Transformers for Continuous Image Generation

JunHao Chen, Yulia Tsvetkov, Xiaochuang Han

2025-06-09multimodal generationImage Generation
Paper
ZeroVO: Visual Odometry with Minimal Assumptions

Lei Lai, Zekai Yin, Eshed Ohn-Bar

2025-06-09CVPR 2025 1Zero-shot GeneralizationVisual OdometryCamera Calibration+1
Paper
Hidden in plain sight: VLMs overlook their visual representations

Stephanie Fu, Tyler Bonnen, Devin Guillory, Trevor Darrell

2025-06-09Depth Estimation
Paper
Vision Transformers Don't Need Trained Registers

Nick Jiang, Amil Dravid, Alexei Efros, Yossi Gandelsman

2025-06-09
PaperCode
A Temporal FRBR/FRBRoo-Based Model for Component-Level Versioning of Legal Norms

Hudson de Martim

2025-06-09Knowledge Graphs
Paper
A distributed motion planning approach to cooperative underwater acoustic source tracking and pursuit

Andrea Tiranti, Francesco Wanderlingh, Enrico Simetti, Marco Baglietto, Giovanni Indiveri et al.

2025-06-09Motion Planning
Paper
EgoM2P: Egocentric Multimodal Multitask Pretraining

Gen Li, Yutong Chen, Yiqian Wu, Kaifeng Zhao, Marc Pollefeys et al.

2025-06-09Depth EstimationGaze PredictionMonocular Depth Estimation
Paper
Stone Soup: ADS-B-based Multi-Target Tracking with Stochastic Integration Filter

John Hiles, Jakub Matousek, Erik Blasch, Ruixin Niu, Ondrej Straka et al.

2025-06-09
Paper
LUCIFER: Language Understanding and Context-Infused Framework for Exploration and Behavior Refinement

Dimitris Panagopoulos, Adolfo Perrusquia, Weisi Guo

2025-06-09Reinforcement LearningDecision Making
Paper
Squeeze3D: Your 3D Generation Model is Secretly an Extreme Neural Compressor

Rishit Dagli, Yushi Guan, Sankeerth Durvasula, Mohammadreza Mofayezi, Nandita Vijaykumar et al.

2025-06-093D Generation
Paper
Neural Tangent Kernel Analysis to Probe Convergence in Physics-informed Neural Solvers: PIKANs vs. PINNs

Salah A. Faroughi, Farinaz Mostajeran

2025-06-09
Paper
HeuriGym: An Agentic Benchmark for LLM-Crafted Heuristics in Combinatorial Optimization

Hongzheng Chen, Yingheng Wang, Yaohui Cai, Hins Hu, Jiajie Li et al.

2025-06-09Combinatorial OptimizationMemorization
PaperCode
Real-time Localization of a Soccer Ball from a Single Camera

Dmitrii Vorobev, Artem Prosvetov, Karim Elhadji Daou

2025-06-09
Paper
Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers

Zhengyao Lv, Tianlin Pan, Chenyang Si, Zhaoxi Chen, WangMeng Zuo et al.

2025-06-09Attribute
PaperCode
UA-Pose: Uncertainty-Aware 6D Object Pose Estimation and Online Object Completion with Partial References

Ming-Feng Li, Xin Yang, Fu-En Wang, Hritam Basak, Yuyin Sun et al.

2025-06-09CVPR 2025 1Image to 3DPose Estimation6D Pose Estimation using RGB
Paper
Reparameterized LLM Training via Orthogonal Equivalence Transformation

Zeju Qiu, Simon Buchholz, Tim Z. Xiao, Maximilian Dax, Bernhard Schölkopf et al.

2025-06-09
Paper
PreviousPage 286 of 28782Next