| 注册
首页|期刊导航|北京测绘|基于音形码的地名地址数据相似度计算与去重方法

基于音形码的地名地址数据相似度计算与去重方法

严海峰 简梓红 江秀明

北京测绘2024,Vol.38Issue(9):1271-1276,6.
北京测绘2024,Vol.38Issue(9):1271-1276,6.DOI:10.19580/j.cnki.1007-3000.2024.09.006

基于音形码的地名地址数据相似度计算与去重方法

Similarity calculation and duplication method of geographical name and address data based on phonetic code

严海峰 1简梓红 1江秀明2

作者信息

  • 1. 广东省地图院,广东 广州 510075
  • 2. 广东省测绘工程有限公司,广东 广州 510663
  • 折叠

摘要

Abstract

The processing of duplicate data is an important task in the management of geographical name and address data.To address the problem of duplicate data in the geographical name and address database of Guangdong Province,this paper proposed a method to calculate Chinese character similarity based on phonetic codes and introduced the principle,process,and method of de-duplication of geographical names and addresses based on phonetic codes.In addition,according to relevant principles,the geographical name and address data deduplication software was developed.This paper took the geographical name and address data of Liwan District as experimental data,calculated the similarity of data in the geographical name and address database of Liwan District by software,and judged the data duplication by the duplication rule and the difference of distance.As a result,it solved the problem of duplicate data in the geographical name and address database and ensured the accuracy of the database.The experimental results show that the software can match duplicate data with high accuracy,and the problem of duplicate geographical name and address data can be effectively solved by the dual drive method of phonetic codes and distance,providing a reliable solution for the management of geographical names and addresses in other regions.

关键词

地名地址/音形码/相似度/距离/去重

Key words

geographical name and address/phonetic code/similarity/distance/deduplication

分类

天文与地球科学

引用本文复制引用

严海峰,简梓红,江秀明..基于音形码的地名地址数据相似度计算与去重方法[J].北京测绘,2024,38(9):1271-1276,6.

基金项目

广东省科技计划(2021B1111610001) (2021B1111610001)

北京测绘

1007-3000

访问量3
|
下载量0
段落导航相关论文