空间多维经济统计数据的降维方法——以四川省经济统计数据为例  被引量:5

Study on dimension-reduction of spatial economic statistics:A case study of economic statistial data of Sichuan

在线阅读下载全文

作  者:董承玮[1,2] 芮小平[1,3] 邓羽[4,5,6] 关兴良[4,5] 

机构地区:[1]中国科学院研究生院资源与环境学院,北京100049 [2]北京市测绘设计研究院,北京100038 [3]中国科学院生态环境研究中心,北京100085 [4]中国科学院地理科学与资源研究所,北京100101 [5]中国科学院研究生院,北京100039 [6]哈佛大学,美国坎布里奇02138

出  处:《地理研究》2012年第8期1411-1421,共11页Geographical Research

基  金:国家自然科学基金项目(40901191)

摘  要:经济统计信息往往包含多维属性,需要采用降维方法将多维信息转换到三维以内的空间来实现多维信息可视化,这有助于研究其内在空间分布规律。在评价线性方法 (PCA)、非线性方法 (NLM和SOFM),以及监督分类方法 (SVM)等四种降维方法的基础上,以2007年四川省区县尺度为研究单元,运用不同分类方法针对区县社会经济发展现状进行聚类(分类)处理,并对成果的差异性展开了深入讨论,主要结论如下:PCA虽然能在整体上揭示经济发展趋势,但结果与实际情况差异较大;NLM能很好地展现出四川经济发展的区域态势和核心区域,准确反映了四川经济发展现状;SOFM的分类结果与发展现状较吻合,但局部地区存在一定的错分情况,且不能进行类内目标的比较;SVM是监督分类,需要已知样本来训练分类过程,在样本的选择上存在较大的主观性,且最优参数的搜索过程较为复杂。本文对几种降维方法的比较,并在经济统计领域中的应用,可以为相关的空间多维信息降维研究提供参考。There are more than three attributes in economic statistical data generally. When studying the inherent structural characteristics of these data such as clustering and distribution, researchers need to reduce multi-dimensional information to three-dimensional space or less to achieve multi-dimensional visualization. There are multi-dimensional re- duction methods, whose results are different from each other because of different mathe- matics theories and application ranges, and the visualization results of these methods will vary. So evaluation of different methods can provide important references for the selection of methods in different areas. In the paper, the authors analyze economic statistical data of Sichuan province in 2007 based on county-unit by implementing four commonly used algo- rithms: the linear method PCA, nonlinear method NLM and SOFM, and a supervised classification method SVM, then obtain a series of classification results. Considering the status of economic development in Sichuan, the authors analyze the differences between the results of these methods, and draw some conclusions as follows. Although PCA can reveal the overall development trend, the result is not consistent with the real condition in Sichuan; NLM can well show the regional trend and core areas of economic development in Sichuan, and account for the development status~ SOFM can also show the development status, but there are several classification errors in the northeastern part of the region. It is impossible for comparison within each cluster~ as a supervised method, SVM needs a known sample set to train the classification process, which makes the sample selection subjective, and the search process for optimal parameters is complicated. The comparison of these methods and their application in economic statistics fields can provide a reference for the future relevant spatial dimension-reduction research.

关 键 词:降维 多维可视化 经济统计数据 四川 

分 类 号:F127[经济管理—世界经济] F222.1

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象