TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers

575,626 papers

Recognition through Reasoning: Reinforcing Image Geo-localization with Large Vision-Language Models

Ling Li, Yao Zhou, Yuxuan Liang, Fugee Tsung, Jiaheng Wei et al.

2025-06-17geo-localization
Paper
DDS-NAS: Dynamic Data Selection within Neural Architecture Search via On-line Hard Example Mining applied to Image Classification

Matt Poyser, Toby P. Breckon

2025-06-17Image ClassificationNeural Architecture Search
Paper
3DGS-IEval-15K: A Large-scale Image Quality Evaluation Database for 3D Gaussian-Splatting

Yuke Xing, Jiarui Wang, Peizhi Niu, Wenjie Huang, Guangtao Zhai et al.

2025-06-17Novel View SynthesisImage Quality Assessment3DGS
PaperCode
VisText-Mosquito: A Multimodal Dataset and Benchmark for AI-Based Mosquito Breeding Site Detection and Reasoning

Md. Adnanul Islam, Md. Faiyaz Abdullah Sayeedi, Md. Asaduzzaman Shuvo, Muhammad Ziaur Rahman, Shahanur Rahman Bappy et al.

2025-06-17Segmentationobject-detectionObject Detection
PaperCode
Unsupervised Imaging Inverse Problems with Diffusion Distribution Matching

Giacomo Meanti, Thomas Ryckeboer, Michael Arbel, Julien Mairal

2025-06-17Super-ResolutionDeblurringImage Restoration+1
PaperCode
Align Your Flow: Scaling Continuous-Time Flow Map Distillation

Amirmojtaba Sabour, Sanja Fidler, Karsten Kreis

2025-06-17Image Generation
Paper
PoseGRAF: Geometric-Reinforced Adaptive Fusion for Monocular 3D Human Pose Estimation

Ming Xu, Xu Zhang

2025-06-173D Human Pose EstimationMonocular 3D Human Pose EstimationPose Estimation+1
PaperCode
Synthetic Data Augmentation for Table Detection: Re-evaluating TableNet's Performance with Automatically Generated Document Images

Krishna Sahukara, Zineddine Bettouche, Andreas Fischer

2025-06-17Table DetectionData Augmentation
Paper
Risk Estimation of Knee Osteoarthritis Progression via Predictive Multi-task Modelling from Efficient Diffusion Model using X-ray Images

David Butler, Adrian Hilton, Gustavo Carneiro

2025-06-17Image GenerationInterpretable Machine Learning
Paper
DreamLight: Towards Harmonious and Consistent Image Relighting

Yong liu, Wenpeng Xiao, Qianqian Wang, Junlin Chen, Shiyin Wang et al.

2025-06-17DisentanglementImage Relighting
Paper
Exploring Diffusion with Test-Time Training on Efficient Image Restoration

Rongchang Lu, Tianduo Luo, Yunzhi Jiang, Conghan Yue, Pei Yang et al.

2025-06-17DenoisingSuper-ResolutionSSIM+1
Paper
SIRI-Bench: Challenging VLMs' Spatial Intelligence through Complex Reasoning Tasks

Zijian Song, Xiaoxin Lin, Qiuming Huang, Guangrun Wang, Liang Lin et al.

2025-06-17Spatial ReasoningMath
Paper
MOL: Joint Estimation of Micro-Expression, Optical Flow, and Landmark via Transformer-Graph-Style Convolution

Zhiwen Shao, Yifan Cheng, Feiran Li, Yong Zhou, Xuequan Lu et al.

2025-06-17Optical Flow EstimationMicro Expression RecognitionMicro-Expression Recognition+1
PaperCode
I Speak and You Find: Robust 3D Visual Grounding with Noisy and Ambiguous Speech Inputs

Yu Qi, Lipeng Gu, Honghua Chen, Liangliang Nan, Mingqiang Wei et al.

2025-06-17Visual GroundingSpeech-to-TextContrastive Learning+1
Paper
Foundation Model Insights and a Multi-Model Approach for Superior Fine-Grained One-shot Subset Selection

Zhijing Wan, Zhixiang Wang, Zheng Wang, Xin Xu, Shin'ichi Satoh et al.

2025-06-17
PaperCode
Dense360: Dense Understanding from Omnidirectional Panoramas

Yikang Zhou, Tao Zhang, Dizhe Zhang, Shunping Ji, Xiangtai Li et al.

2025-06-17
Paper
Adapting Lightweight Vision Language Models for Radiological Visual Question Answering

Aditya Shourya, Michel Dumontier, Chang Sun

2025-06-17Question AnsweringDiagnosticVisual Question Answering (VQA)+1
PaperCode
Model compression using knowledge distillation with integrated gradients

David E. Hernandez, Jose Chang, Torbjörn E. M. Nordling

2025-06-17Model CompressionData AugmentationKnowledge Distillation
Paper
MoTE: Mixture of Ternary Experts for Memory-efficient Large Multimodal Models

Hongyu Wang, Jiayu Xu, Ruiping Wang, Yan Feng, Yitao Zhai et al.

2025-06-17Quantization
Paper
Toward Rich Video Human-Motion2D Generation

Ruihao Xi, Xuekuan Wang, Yongcheng Li, Shuhua Li, Zichen Wang et al.

2025-06-17
Paper
PreviousPage 166 of 28782Next