TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/self-prompting analogical reasoning for uav object detection

self-prompting analogical reasoning for uav object detection

Nianxin Li, Mao Ye, Lihua Zhou, Song Tang, Yan Gan, Zizhuo Liang, Xiatian Zhu

2025-04-11Proceedings of the AAAI Conference on Artificial Intelligence 2025 4Object Detection In Aerial Imagesgraph constructionobject-detectionObject DetectionSmall Object Detection
PaperPDFCode

Abstract

Unmanned Aerial Vehicle Object Detection (UAVOD) presents unique challenges due to varying altitudes, dynamic backgrounds, and the small size of objects. Traditional de- tection methods often struggle with these challenges, as they typically rely on visual features only and fail to ex- tract the semantic relations between the objects. To address these limitations,we propose a novel approach named Self- Prompting Analogical Reasoning (SPAR). Ourmethod uti- lizes the vision-languagemodel (CLIP) to generate context- aware prompts based on image features, providing rich se- mantic information that guides analogical reasoning. SPAR includes two main modules: self-prompting and analogi- cal reasoning. Self-prompting module based on learnable description and CLIP-text encoder generates context-aware prompt by combining specific image feature; then an object- ness prompt scoremap is produced by computing the simi- laritybetweenpixel-level features andcontext-awareprompt. Withthis scoremap,multi-scale image features are enhanced and pixel-level features are chosen for graph construction. While for analogical reasoningmodule, graph nodes consist of category-level prompt nodes and pixel-level image feature nodes.Analogical inference is based on graph convolution. Under the guidance of category-level nodes, different-scale object features have been enhanced, which helps achieve more accuratedetectionof challengingobjects.Extensive ex- periments illustrate that SPARoutperforms traditionalmeth- ods,offeringamorerobustandaccuratesolutionforUAVOD.

Related Papers

Efficiently Constructing Sparse Navigable Graphs2025-07-17NGTM: Substructure-based Neural Graph Topic Model for Interpretable Graph Generation2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15