Volume 28, Issue 5 pp. 993-1020
RESEARCH ARTICLE

DePNR: A DeBERTa-based deep learning model with complete position embedding for place name recognition from geographical literature

Weirong Li

Weirong Li

College of Environment and Resources, Guangxi Normal University, Guilin, China

Guangxi Key Laboratory of Environmental Processes and Remediation in Ecologically Fragile Regions, Guangxi Normal University, Guilin, China

Search for more papers by this author
Kai Sun

Corresponding Author

Kai Sun

GeoAI Lab, Department of Geography, University at Buffalo, Buffalo, New York, USA

Correspondence

Kai Sun, GeoAI Lab, Department of Geography, University at Buffalo, Buffalo, NY, USA.

Email: [email protected]

Yunqiang Zhu, State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China.

Email: [email protected]

Search for more papers by this author
Shu Wang

Shu Wang

State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China

Search for more papers by this author
Yunqiang Zhu

Corresponding Author

Yunqiang Zhu

State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China

Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application, Nanjing, China

Correspondence

Kai Sun, GeoAI Lab, Department of Geography, University at Buffalo, Buffalo, NY, USA.

Email: [email protected]

Yunqiang Zhu, State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China.

Email: [email protected]

Search for more papers by this author
Xiaoliang Dai

Xiaoliang Dai

State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China

College of Resources and Environment, University of Chinese Academy of Sciences, Beijing, China

Search for more papers by this author
Lei Hu

Lei Hu

State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, China

College of Resources and Environment, University of Chinese Academy of Sciences, Beijing, China

Search for more papers by this author
First published: 03 May 2024
Citations: 3

Abstract

Place names play an important role in linking physical places to human perception and are highly frequently used in the daily lives of people to refer to places in natural language. However, many place names may not be recorded in typical gazetteers due to their new establishment, colloquial nature, and different concerns. These unrecorded toponyms are often discussed in geographical literature; thus, it is necessary to automatically identify them from geographical literature and update existing gazetteers using computational approaches. Currently, the most advanced approaches are deep learning-based models. However, existing models used only partial position information rather than complete position information of words in a sentence, which limits their performance in recognizing toponyms. To this end, we develop DePNR, a DeBERTa-based deep learning model with complete position embedding for place name recognition from geographical literature. We train DePNR on two datasets and test it on a real dataset from geographical literature to evaluate its performance. The results show that DePNR achieves an F-score of 0.8282, outperforming previous approaches, and can recognize new toponyms from literature text, potentially enriching existing gazetteers.

DATA AVAILABILITY STATEMENT

The data that support the findings of this study are openly available in DePNR at https://github.com/liwr-igsnrr/DePNR.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.