检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]大连理工大学系统工程研究所,大连116024
出 处:《情报学报》2011年第8期812-818,共7页Journal of the China Society for Scientific and Technical Information
基 金:国家自然科学基金资助项目(70871015); 国家高技术研究发展计划(863计划)资助项目(2008AA04Z107)
摘 要:本文从信息论的角度考虑了聚类问题,将聚类看成是有损信息压缩的过程。首先运用率失真理论建立了模糊聚类的优化模型,与经典的模糊聚类模型相比,模型的目标函数中多了一个描述聚类过程复杂度的指标。同时为了估计聚类数目,还提出了一个新的聚类有效性指标。其次通过求解优化模型得到基于率失真理论的模糊聚类算法。最后将基于率失真理论的模糊聚类算法与经典模糊C均值算法进行了数值实验比较。数值实验结果表明基于率失真理论的模糊聚类算法能够自动确定聚类数目,在运行时间上比模糊C均值算法有一定减少,且最终的模糊划分矩阵与模糊C均值算法相比有较少的模糊性,因而聚类结果更加明确可靠。Clustering is considered as a process of lossy compression from an information theory perspective in this paper.Firstly an optimization model of fuzzy clustering is built by using the rate distortion theory.Comparing to the classic fuzzy clustering model,the new model introduces a new index in the objective function which describes the complexity of clustering process.In order to estimate the number of clusters,a new cluster validity index is also proposed.Then the fuzzy clustering algorithm based on rate distortion theory is obtained by solving the optimization model.Finally some numerical experiments are made to compare the fuzzy clustering algorithm based on rate distortion theory with fuzzy c-means.The experimental results indicate that the fuzzy clustering algorithm based on rate distortion theory can estimate the number of clusters automatically and it also has less running time than fuzzy c-means.Moreover,membership assignments of the proposed algorithm based on rate distortion theory are less confused than fuzzy c-means,which makes the result more definite and reliable.
分 类 号:O224[理学—运筹学与控制论]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.145.177.173