Converting Transformers into DGNNs Form

Jie Zhang, Kuan-Chieh Wang, Bo-Wei Chiu, Min-Te Sun

2025-02-01Form Document Classification

Abstract

Recent advances in deep learning have established Transformer architectures as the predominant modeling paradigm. Central to the success of Transformers is the self-attention mechanism, which scores the similarity between query and key matrices to modulate a value matrix. This operation bears striking similarities to digraph convolution, prompting an investigation into whether digraph convolution could serve as an alternative to self-attention. In this study, we formalize this concept by introducing a synthetic unitary digraph convolution based on the digraph Fourier transform. The resulting model, which we term Converter, effectively converts a Transformer into a Directed Graph Neural Network (DGNN) form. We have tested Converter on Long-Range Arena benchmark, long document classification, and DNA sequence-based taxonomy classification. Our experimental results demonstrate that Converter achieves superior performance while maintaining computational efficiency and architectural simplicity, which establishes it as a lightweight yet powerful Transformer variant.

Results

Task	Dataset	Metric	Value	Model
Language Modelling	LRA	Avg	75.94	Converter
Language Modelling	LRA	Image	61.02	Converter
Language Modelling	LRA	ListOps	60.38	Converter
Language Modelling	LRA	Pathfinder	88.43	Converter
Language Modelling	LRA	Retrieval	83.41	Converter
Language Modelling	LRA	Text	86.44	Converter

Related Papers

FreeAudio: Training-Free Timing Planning for Controllable Long-Form Text-to-Audio Generation2025-07-11 Controlled Retrieval-augmented Context Evaluation for Long-form RAG2025-06-24 FormGym: Doing Paperwork with Agents2025-06-17 FreeQ-Graph: Free-form Querying with Semantic Consistent Scene Graph for 3D Scene Understanding2025-06-16 Direct Reasoning Optimization: LLMs Can Reward And Refine Their Own Reasoning for Open-Ended Tasks2025-06-16 ARGUS: Hallucination and Omission Evaluation in Video-LLMs2025-06-09 LLM Unlearning Should Be Form-Independent2025-06-09 Writing-RL: Advancing Long-form Writing via Adaptive Curriculum Reinforcement Learning2025-06-06