TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Revisiting Distributional Correspondence Indexing: A Pytho...

Revisiting Distributional Correspondence Indexing: A Python Reimplementation and New Experiments

Alejandro Moreo, Andrea Esuli, Fabrizio Sebastiani

2018-10-19Text ClassificationSentiment AnalysisTransfer Learningtext-classificationGeneral ClassificationDomain Adaptation
PaperPDFCode(official)

Abstract

This paper introduces PyDCI, a new implementation of Distributional Correspondence Indexing (DCI) written in Python. DCI is a transfer learning method for cross-domain and cross-lingual text classification for which we had provided an implementation (here called JaDCI) built on top of JaTeCS, a Java framework for text classification. PyDCI is a stand-alone version of DCI that exploits scikit-learn and the SciPy stack. We here report on new experiments that we have carried out in order to test PyDCI, and in which we use as baselines new high-performing methods that have appeared after DCI was originally proposed. These experiments show that, thanks to a few subtle ways in which we have improved DCI, PyDCI outperforms both JaDCI and the above-mentioned high-performing methods, and delivers the best known results on the two popular benchmarks on which we had tested DCI, i.e., MultiDomainSentiment (a.k.a. MDS -- for cross-domain adaptation) and Webis-CLS-10 (for cross-lingual adaptation). PyDCI, together with the code allowing to replicate our experiments, is available at https://github.com/AlexMoreo/pydci .

Results

TaskDatasetMetricValueModel
Sentiment AnalysisMulti-Domain Sentiment DatasetAverage83.3Distributional Correspondence Indexing
Sentiment AnalysisMulti-Domain Sentiment DatasetBooks81.4Distributional Correspondence Indexing
Sentiment AnalysisMulti-Domain Sentiment DatasetDVD81Distributional Correspondence Indexing
Sentiment AnalysisMulti-Domain Sentiment DatasetElectronics8506Distributional Correspondence Indexing
Sentiment AnalysisMulti-Domain Sentiment DatasetKitchen85.9Distributional Correspondence Indexing

Related Papers

RaMen: Multi-Strategy Multi-Modal Learning for Bundle Construction2025-07-18Making Language Model a Hierarchical Classifier and Generator2025-07-17AdaptiSent: Context-Aware Adaptive Attention for Multimodal Aspect-Based Sentiment Analysis2025-07-17Disentangling coincident cell events using deep transfer learning and compressive sensing2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17Best Practices for Large-Scale, Pixel-Wise Crop Mapping and Transfer Learning Workflows2025-07-16AI Wizards at CheckThat! 2025: Enhancing Transformer-Based Embeddings with Sentiment for Subjectivity Detection in News Articles2025-07-15DCR: Quantifying Data Contamination in LLMs Evaluation2025-07-15