TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/SeeDS: Semantic Separable Diffusion Synthesizer for Zero-s...

SeeDS: Semantic Separable Diffusion Synthesizer for Zero-shot Food Detection

Pengfei Zhou, Weiqing Min, Yang Zhang, Jiajun Song, Ying Jin, Shuqiang Jiang

2023-10-07DenoisingFood recommendationZero-Shot Object DetectionGeneralized Zero-Shot Object DetectionObject Detection
PaperPDFCode(official)

Abstract

Food detection is becoming a fundamental task in food computing that supports various multimedia applications, including food recommendation and dietary monitoring. To deal with real-world scenarios, food detection needs to localize and recognize novel food objects that are not seen during training, demanding Zero-Shot Detection (ZSD). However, the complexity of semantic attributes and intra-class feature diversity poses challenges for ZSD methods in distinguishing fine-grained food classes. To tackle this, we propose the Semantic Separable Diffusion Synthesizer (SeeDS) framework for Zero-Shot Food Detection (ZSFD). SeeDS consists of two modules: a Semantic Separable Synthesizing Module (S$^3$M) and a Region Feature Denoising Diffusion Model (RFDDM). The S$^3$M learns the disentangled semantic representation for complex food attributes from ingredients and cuisines, and synthesizes discriminative food features via enhanced semantic information. The RFDDM utilizes a novel diffusion model to generate diversified region features and enhances ZSFD via fine-grained synthesized features. Extensive experiments show the state-of-the-art ZSFD performance of our proposed method on two food datasets, ZSFooD and UECFOOD-256. Moreover, SeeDS also maintains effectiveness on general ZSD datasets, PASCAL VOC and MS COCO. The code and dataset can be found at https://github.com/LanceZPF/SeeDS.

Results

TaskDatasetMetricValueModel
Object DetectionMS-COCORecall64SeeDS
Object DetectionMS-COCOmAP20.6SeeDS
Object DetectionPASCAL VOC'07mAP68.9SeeDS
3DMS-COCORecall64SeeDS
3DMS-COCOmAP20.6SeeDS
3DPASCAL VOC'07mAP68.9SeeDS
2D ClassificationMS-COCORecall64SeeDS
2D ClassificationMS-COCOmAP20.6SeeDS
2D ClassificationPASCAL VOC'07mAP68.9SeeDS
2D Object DetectionMS-COCORecall64SeeDS
2D Object DetectionMS-COCOmAP20.6SeeDS
2D Object DetectionPASCAL VOC'07mAP68.9SeeDS
16kMS-COCORecall64SeeDS
16kMS-COCOmAP20.6SeeDS
16kPASCAL VOC'07mAP68.9SeeDS

Related Papers

fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16