A data representation method using distance correlation  

作  者:Xinyan LIANG Yuhua QIAN Qian GUO Keyin ZHENG 

机构地区:[1]Institute of Big Data Science and Industry,Shanxi University,Taiyuan 030006,China [2]Key Laboratory of Computational Intelligence and Chinese Information Processing of Ministry of Education,Shanxi University,Taiyuan 030006,China [3]School of Computer Science and Technology,Taiyuan University of Science and Technology,Taiyuan 030024,China [4]Shanxi Key Laboratory of Big Data Analysis and Parallel Computing,Taiyuan University of Science and Technology,Taiyuan 030024,China

出  处:《Frontiers of Computer Science》2025年第1期1-14,共14页计算机科学前沿(英文版)

基  金:supported by the National Key R&D Program of China(No.2021ZD0112400);the National Natural Science Foundation of China(Grant Nos.62306171,62136005,61976129,62106132,61906114,61906115);the Science and Technology Major Project of Shanxi(No.202201020101006);the Young Scientists Fund of the Natural Science Foundation of Shanxi(Nos.202203021222183,20210302124549);the Open Project Foundation of Intelligent Information Processing Key Laboratory of Shanxi Province(Nos.CICIP2023005,CICIP202205);the Science and Technology Innovation Plan for Colleges and Universities of Shanxi Province(2022L296);and Taiyuan University of Science and Technology Doctoral Research Start-up Fund Project(20222106).

摘  要:Association in-between features has been demonstrated to improve the representation ability of data. However, the original association data reconstruction method may face two issues: the dimension of reconstructed data is undoubtedly higher than that of original data, and adopted association measure method does not well balance effectiveness and efficiency. To address above two issues, this paper proposes a novel association-based representation improvement method, named as AssoRep. AssoRep first obtains the association between features via distance correlation method that has some advantages than Pearson’s correlation coefficient. Then an improved matrix is formed via stacking the association value of any two features. Next, an improved feature representation is obtained by aggregating the original feature with the enhancement matrix. Finally, the improved feature representation is mapped to a low-dimensional space via principal component analysis. The effectiveness of AssoRep is validated on 120 datasets and the fruits further prefect our previous work on the association data reconstruction.

关 键 词:ASSOCIATION REPRESENTATION distance correlation CLASSIFICATION 

分 类 号:O17[理学—数学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象