农业工程学报2018,Vol.34Issue(8):177-183,7.DOI:10.11975/j.issn.1002-6819.2018.08.023
基于空间LDA模型的高分辨率遥感影像地物覆盖分类
Classification of land cover in high-resolution remote sensing images based on Space-LDA model
摘要
Abstract
Automatic classification of high-resolution remote sensing image has been a hot topic in remote sensing image analysis and related areas. Object-oriented image analysis method exploits segmentation object as the basic unit of analysis, avoiding the "salt and pepper" phenomenon in the traditional classification method. However, the major limitation is the uncertainty of segmenting semantically meaningful object as the basic unit of analysis. The traditional classifiers to discriminate objects in the underlying feature spaces cannot adapt to the complexity of different features with different imaging mechanisms, leading to large differences within classes and the imbalance between different classes. Probabilistic topic model shows a great success in the field of natural language processing with solid mathematical theoretical foundation. Its inherent characteristics are extremely consistent with the demand of remote sensing information extraction. In this paper, the classic probabilistic topic model, latent dirichlet allocation (LDA) model, is used as the main model. LDA model is introduced into high-resolution remote sensing image classification research. Bag-of-words model can produce a valid information expression of remote sensing images, but the studies regarding this model are still limited in low-level features which mean the word vector space. The proposed LDA model in this paper would be applicable to the further exploitation of remote sensing data. This paper was going to establish some sort of spatial model oriented to the classification of high-resolution remote sensing images, and meanwhile the improved model was based on the traditional LDA model. The expression of remote sensing images based on the bag-of-words model is a basic of probabilistic topic model. Considering the requires of patterns classification, we were going to create the "Document-Words" mapping of high-resolution remote sensing images by using multi-scale segmentation algorithm and study on the key technology of building the bag-of-words model. It created the visual dictionary with a fast clustering algorithm based on the density peak to get rid of the dependence of original clustering center and identify the noise. After building the bag-of-words model, we tried to introduce the popularity and content of spatial topic as the latent topic of data and the prior distribution of words in the topic. Moreover, the variational expectation maximization (EM) inference algorithm of this model was built and the tests would verify the advantage of improved model. QuickBird images of Wuxi were used in the experiment, whose spatial resolution was 0.6 m with 4 bands. LDA and Space-LDA model were compared in the classification of land use types. Space-LDA model had higher classification accuracy than traditional LDA, and reached the highest accuracy when the visual dictionary size was 480. At last, when topic size was fixed at 40, both overall classification accuracy and Kappa's coefficient showed that Space-LDA model achieved better results than LDA model. The spatial region information provides reasoning information from both the theme popularity and the thematic content at the same time, so the model has a more flexible structure.关键词
模型/土地利用/遥感/面向对象/空间关系/空间LDA模型Key words
models/land use/remote sensing/object-oriented/spatial relationship/Space-LDA model分类
信息技术与安全科学引用本文复制引用
李杨,邵华,江南,施歌,丁远..基于空间LDA模型的高分辨率遥感影像地物覆盖分类[J].农业工程学报,2018,34(8):177-183,7.基金项目
国家自然科学基金项目(41501431,41601449) (41501431,41601449)
江苏省高校自然科学研究面上项目(16KJD420002) (16KJD420002)
江苏高校优势学科建设工程资助项目(164320H116) (164320H116)