TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Towards Scalable and Reliable Capsule Networks for Challen...

Towards Scalable and Reliable Capsule Networks for Challenging NLP Applications

Wei Zhao, Haiyun Peng, Steffen Eger, Erik Cambria, Min Yang

2019-06-06ACL 2019 7Text ClassificationQuestion AnsweringGeneral ClassificationMulti-Label Text Classification
PaperPDFCodeCodeCodeCodeCode

Abstract

Obstacles hindering the development of capsule networks for challenging NLP applications include poor scalability to large output spaces and less reliable routing processes. In this paper, we introduce: 1) an agreement score to evaluate the performance of routing processes at instance level; 2) an adaptive optimizer to enhance the reliability of routing; 3) capsule compression and partial routing to improve the scalability of capsule networks. We validate our approach on two NLP tasks, namely: multi-label text classification and question answering. Experimental results show that our approach considerably improves over strong competitors on both tasks. In addition, we gain the best results in low-resource settings with few training instances.

Results

TaskDatasetMetricValueModel
Question AnsweringTrecQAMAP0.7773NLP-Capsule
Question AnsweringTrecQAMRR0.7416NLP-Capsule
Multi-Label Text ClassificationEUR-LexP@180.2NLP-Cap
Multi-Label Text ClassificationEUR-LexP@365.48NLP-Cap
Multi-Label Text ClassificationEUR-LexP@552.83NLP-Cap
Multi-Label Text ClassificationEUR-LexnDCG@180.2NLP-Cap
Multi-Label Text ClassificationEUR-LexnDCG@371.11NLP-Cap
Multi-Label Text ClassificationEUR-LexnDCG@568.8NLP-Cap
Text ClassificationRCV1P@197.05NLP-Cap
Text ClassificationRCV1P@381.27NLP-Cap
Text ClassificationRCV1P@556.33NLP-Cap
Text ClassificationRCV1nDCG@197.05NLP-Cap
Text ClassificationRCV1nDCG@392.47NLP-Cap
Text ClassificationRCV1nDCG@593.11NLP-Cap
Text ClassificationEUR-LexP@180.2NLP-Cap
Text ClassificationEUR-LexP@365.48NLP-Cap
Text ClassificationEUR-LexP@552.83NLP-Cap
Text ClassificationEUR-LexnDCG@180.2NLP-Cap
Text ClassificationEUR-LexnDCG@371.11NLP-Cap
Text ClassificationEUR-LexnDCG@568.8NLP-Cap
ClassificationRCV1P@197.05NLP-Cap
ClassificationRCV1P@381.27NLP-Cap
ClassificationRCV1P@556.33NLP-Cap
ClassificationRCV1nDCG@197.05NLP-Cap
ClassificationRCV1nDCG@392.47NLP-Cap
ClassificationRCV1nDCG@593.11NLP-Cap
ClassificationEUR-LexP@180.2NLP-Cap
ClassificationEUR-LexP@365.48NLP-Cap
ClassificationEUR-LexP@552.83NLP-Cap
ClassificationEUR-LexnDCG@180.2NLP-Cap
ClassificationEUR-LexnDCG@371.11NLP-Cap
ClassificationEUR-LexnDCG@568.8NLP-Cap

Related Papers

Making Language Model a Hierarchical Classifier and Generator2025-07-17From Roots to Rewards: Dynamic Tree Reasoning with RL2025-07-17Enter the Mind Palace: Reasoning and Planning for Long-term Active Embodied Question Answering2025-07-17Vision-and-Language Training Helps Deploy Taxonomic Knowledge but Does Not Fundamentally Alter It2025-07-17City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17Describe Anything Model for Visual Question Answering on Text-rich Images2025-07-16Is This Just Fantasy? Language Model Representations Reflect Human Judgments of Event Plausibility2025-07-16Warehouse Spatial Question Answering with LLM Agent2025-07-14