TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Robust Adaptation of Large Multimodal Models for Retrieval...

Robust Adaptation of Large Multimodal Models for Retrieval Augmented Hateful Meme Detection

Jingbiao Mei, Jinghong Chen, Guangyu Yang, Weizhe Lin, Bill Byrne

2025-02-18Hateful Meme ClassificationDomain GeneralizationContrastive LearningRetrievalMeme Classification
PaperPDFCode(official)

Abstract

Hateful memes have become a significant concern on the Internet, necessitating robust automated detection systems. While LMMs have shown promise in hateful meme detection, they face notable challenges like sub-optimal performance and limited out-of-domain generalization capabilities. Recent studies further reveal the limitations of both SFT and in-context learning when applied to LMMs in this setting. To address these issues, we propose a robust adaptation framework for hateful meme detection that enhances in-domain accuracy and cross-domain generalization while preserving the general vision-language capabilities of LMMs. Experiments on six meme classification datasets show that our approach achieves state-of-the-art performance, outperforming larger agentic systems. Moreover, our method generates higher-quality rationales for explaining hateful content compared to standard SFT, enhancing model interpretability.

Results

TaskDatasetMetricValueModel
Meme ClassificationHateful MemesAccuracy0.821RA-HMD (Qwen2-VL-7B)
Meme ClassificationHateful MemesROC-AUC0.911RA-HMD (Qwen2-VL-7B)
Meme ClassificationHateful MemesAccuracy0.809RA-HMD (LLaVA-1.5-7B)
Meme ClassificationHateful MemesROC-AUC0.897RA-HMD (LLaVA-1.5-7B)
Meme ClassificationHateful MemesAccuracy0.791RA-HMD (Qwen2-VL-2B)
Meme ClassificationHateful MemesROC-AUC0.884RA-HMD (Qwen2-VL-2B)
Meme ClassificationMultiOFFAccuracy71.1RA-HMD (Qwen2-VL-7B)
Meme ClassificationMultiOFFF164.8RA-HMD (Qwen2-VL-7B)
Meme ClassificationHateful MemesAUROC91.1RA-HMD (Qwen2-VL-7B)
Meme ClassificationHarMemeAUROC93.2RA-HMD (Qwen2VL-7B)
Meme ClassificationHarMemeAccuracy88.1RA-HMD (Qwen2VL-7B)
Meme ClassificationHarMemeAUROC92.9RA-HMD (Qwen2VL-2B)
Meme ClassificationHarMemeAccuracy87.7RA-HMD (Qwen2VL-2B)
Meme ClassificationHarm-PAccuracy91.6RA-HMD (Qwen2-VL-7B)
Meme ClassificationHarm-PF191.1RA-HMD (Qwen2-VL-7B)
Meme ClassificationPrideMMAccuracy78.1RA-HMD (Qwen2-VL-7B)
Meme ClassificationPrideMMF178.4RA-HMD (Qwen2-VL-7B)
Meme ClassificationPrideMMAccuracy76RA-HMD (Qwen2-VL-2B)
Meme ClassificationPrideMMF176.7RA-HMD (Qwen2-VL-2B)

Related Papers

Simulate, Refocus and Ensemble: An Attention-Refocusing Scheme for Domain Generalization2025-07-17GLAD: Generalizable Tuning for Vision-Language Models2025-07-17MoTM: Towards a Foundation Model for Time Series Imputation based on Continuous Modeling2025-07-17SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts2025-07-17HapticCap: A Multimodal Dataset and Task for Understanding User Experience of Vibration Haptic Signals2025-07-17Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17SGCL: Unifying Self-Supervised and Supervised Learning for Graph Recommendation2025-07-17From Roots to Rewards: Dynamic Tree Reasoning with RL2025-07-17