A graph transformer with optimized attention scores for node classification

Yu Zhang; Xin Li; Yaoqun Xu; Xitong Xu; Zhe Wang

doi:10.1038/s41598-025-15551-2

Scientific Reports (Aug 2025)

A graph transformer with optimized attention scores for node classification

Yu Zhang,
Xin Li,
Yaoqun Xu,
Xitong Xu,
Zhe Wang

Affiliations

Yu Zhang: School of Computer Science and Information Engineering, Harbin University of Commerce
Xin Li: School of Computer Science and Information Engineering, Harbin University of Commerce
Yaoqun Xu: School of Computer Science and Information Engineering, Harbin University of Commerce
Xitong Xu: College of Geo-Exploration Science and Technology, Jilin University
Zhe Wang: School of Computer Science and Information Engineering, Harbin University of Commerce

DOI: https://doi.org/10.1038/s41598-025-15551-2
Journal volume & issue: Vol. 15, no. 1
pp. 1 – 16

Abstract

Read online

Abstract The message-passing paradigm on graphs has significant advantages in modeling local structures, but still faces challenges in capturing global information and complex relationships. Although the Transformer architecture has become a mainstream approach for nonlocal modeling in many domains, Transformer-based architectures fail to demonstrate competitiveness in popular node-level prediction tasks when compared to mainstream graph neural network (GNN) variants. This can be attributed to the fact that existing research has largely focused on more efficient strategies to approximate the Vanilla Transformer, thereby overlooking its potential in node embedding representation learning. This paper introduces a novel graph Transformer model with optimized attention scores, named OGFormer, to address this gap. OGFormer employs a simplified single-head self-attention mechanism, incorporating several critical structural innovations in its self-attention layers to more effectively capture global dependencies within graphs. Specifically, we propose an end-to-end attention score optimization loss function, designed to suppress noisy connections and enhance the connection weights between similar nodes. Additionally, we introduce an effective structural encoding strategy integrated into the attention score computation, enabling the model to focus more on key local dependencies in the graph. Experimental results demonstrate that OGFormer achieves strong competitive performance in node classification tasks across multiple benchmark datasets, particularly in Homophilous and Heterophilous tests, surpassing current mainstream methods. Our implementation is available at https://github.com/LittleBlackBearLiXin/OGFormer .

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal

Abstract

Keywords