TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Converting Transformers into DGNNs Form

Converting Transformers into DGNNs Form

Jie Zhang, Kuan-Chieh Wang, Bo-Wei Chiu, Min-Te Sun

2025-02-01FormDocument Classification
PaperPDFCode(official)

Abstract

Recent advances in deep learning have established Transformer architectures as the predominant modeling paradigm. Central to the success of Transformers is the self-attention mechanism, which scores the similarity between query and key matrices to modulate a value matrix. This operation bears striking similarities to digraph convolution, prompting an investigation into whether digraph convolution could serve as an alternative to self-attention. In this study, we formalize this concept by introducing a synthetic unitary digraph convolution based on the digraph Fourier transform. The resulting model, which we term Converter, effectively converts a Transformer into a Directed Graph Neural Network (DGNN) form. We have tested Converter on Long-Range Arena benchmark, long document classification, and DNA sequence-based taxonomy classification. Our experimental results demonstrate that Converter achieves superior performance while maintaining computational efficiency and architectural simplicity, which establishes it as a lightweight yet powerful Transformer variant.

Results

TaskDatasetMetricValueModel
Language ModellingLRAAvg75.94Converter
Language ModellingLRAImage61.02Converter
Language ModellingLRAListOps60.38Converter
Language ModellingLRAPathfinder88.43Converter
Language ModellingLRARetrieval83.41Converter
Language ModellingLRAText86.44Converter

Related Papers

FreeAudio: Training-Free Timing Planning for Controllable Long-Form Text-to-Audio Generation2025-07-11Controlled Retrieval-augmented Context Evaluation for Long-form RAG2025-06-24FormGym: Doing Paperwork with Agents2025-06-17FreeQ-Graph: Free-form Querying with Semantic Consistent Scene Graph for 3D Scene Understanding2025-06-16Direct Reasoning Optimization: LLMs Can Reward And Refine Their Own Reasoning for Open-Ended Tasks2025-06-16ARGUS: Hallucination and Omission Evaluation in Video-LLMs2025-06-09LLM Unlearning Should Be Form-Independent2025-06-09Writing-RL: Advancing Long-form Writing via Adaptive Curriculum Reinforcement Learning2025-06-06