International Journal of Digital Earth (Aug 2025)

Intelligent segmentation of Chinese address elements combining textual and spatial semantic features

  • Xuefeng Yan,
  • An Luo,
  • Jiping Liu,
  • Yong Wang,
  • Ya Zhang

DOI
https://doi.org/10.1080/17538947.2025.2481142
Journal volume & issue
Vol. 18, no. 1

Abstract

Read online

Chinese address segmentation is a critical step in tasks such as address parsing, address standardization, address matching, and geocoding. However, existing research has largely overlooked the spatial characteristics of address elements, thereby limiting the potential for further performance enhancement. This paper integrates both textual and spatial semantic features and proposes a framework for intelligent segmentation of Chinese addresses via multi-semantic features (FISCA-MS). The framework leverages the pooled output of the pretrained language model MacBERT to represent the textual semantic features of address elements and introduces a novel full-scale clustering evaluation method (FSCE) to compute the spatial semantic features. By combining textual and spatial semantic features, the proposed approach uses a bidirectional gated recurrent unit (BiGRU) neural network to automatically segment Chinese address elements. The experimental results show that FISCA-MS outperforms FISCA, which uses only textual semantic features (FISCA-TS), and FISCA, which uses only spatial semantic features (FISCA-SS), with accuracy improvements of 0.725% and 38.05%, respectively.

Keywords