TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Neural Topic Model via Optimal Transport

Neural Topic Model via Optimal Transport

He Zhao, Dinh Phung, Viet Huynh, Trung Le, Wray Buntine

2020-08-12ICLR 2021 1Topic Models
PaperPDFCode(official)

Abstract

Recently, Neural Topic Models (NTMs) inspired by variational autoencoders have obtained increasingly research interest due to their promising results on text analysis. However, it is usually hard for existing NTMs to achieve good document representation and coherent/diverse topics at the same time. Moreover, they often degrade their performance severely on short documents. The requirement of reparameterisation could also comprise their training quality and model flexibility. To address these shortcomings, we present a new neural topic model via the theory of optimal transport (OT). Specifically, we propose to learn the topic distribution of a document by directly minimising its OT distance to the document's word distributions. Importantly, the cost matrix of the OT distance models the weights between topics and words, which is constructed by the distances between topics and words in an embedding space. Our proposed model can be trained efficiently with a differentiable loss. Extensive experiments show that our framework significantly outperforms the state-of-the-art NTMs on discovering more coherent and diverse topics and deriving better document representations for both regular and short texts.

Results

TaskDatasetMetricValueModel
Text ClassificationAG NewsC_v0.37NSTM
Text ClassificationAG NewsNPMI-0.04NSTM
Text Classification20NewsGroupsC_v0.38NSTM
Topic ModelsAG NewsC_v0.37NSTM
Topic ModelsAG NewsNPMI-0.04NSTM
Topic Models20NewsGroupsC_v0.38NSTM
ClassificationAG NewsC_v0.37NSTM
ClassificationAG NewsNPMI-0.04NSTM
Classification20NewsGroupsC_v0.38NSTM

Related Papers

Narrative Shift Detection: A Hybrid Approach of Dynamic Topic Models and Large Language Models2025-06-25Constrained Non-negative Matrix Factorization for Guided Topic Modeling of Minority Topics2025-05-22topicwizard -- a Modern, Model-agnostic Framework for Topic Model Visualization and Interpretation2025-05-19HAMLET: Healthcare-focused Adaptive Multilingual Learning Embedding-based Topic Modeling2025-05-12Fully Bayesian Approaches to Topics over Time2025-04-21Evaluating Negative Sampling Approaches for Neural Topic Models2025-03-23Multivariate Gaussian Topic Modelling: A novel approach to discover topics with greater semantic coherence2025-03-19Seeded Poisson Factorization: Leveraging domain knowledge to fit topic models2025-03-04