检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]吉林大学计算机科学与技术学院,长春130012
出 处:《计算机研究与发展》2007年第11期1816-1824,共9页Journal of Computer Research and Development
基 金:国家自然科学基金项目(60373099)~~
摘 要:多维序列模式挖掘旨在将一个或多个背景维度信息中发现的关联模式与有序事务序列中发现的序列模式有机结合,从而为用户提供信息内容更加丰富、更具有直接应用价值的多维序列模式.目前虽有一些挖掘多维序列模式的工作,但其关联模式与序列模式的发现过程是基于不同的数据结构分开进行的.提出一种新的概念格结构——多维概念格,它是对概念格的延伸与泛化,其内涵更加丰富,不仅具有多个有序的任务内涵,而且具有多个无序的背景内涵.设计实现了基于该结构的增量式多维序列模式挖掘算法,该算法使用统一的数据模型实现关联模式与序列模式的高效同步挖掘.在合成数据集上的实验结果验证了算法的有效性.同时,算法在实际的银行数据集上的应用效果也说明了算法的实用性.Multi-dimensional sequential pattern mining is the process of mining association rules from one or more dimensions of background information in which the order of the dimension values is not relevant, alongside mining sequential patterns from one or more dimensions of information in which the order is important. Multi-dimensional sequential patterns are much more informative frequent patterns which are suitable for immediate use. Although some work has been conducted for mining multi-dimensional sequential patterns, association patterns and sequential patterns are mined separately based on different data structures. In this paper, a novel data model called multi-dimensional concept lattice is proposed, which is the extension or generalization toward concept lattice. The intension of multi-dimensional concept lattice is more informative, which is constituted of one or more ordered task-relevant dimensions and one or more unordered background dimensions. Moreover, an incremental multi-dimensional sequential pattern mining algorithm is developed. The proposed algorithm integrates sequential pattern mining and association pattern mining with a uniform data structure and makes the mining process more efficient. The performance study on synthetic datasets shows the scalability and effectiveness of the proposed algorithm. At the same time, the application on the real-life financial datasets demonstrates the practicability of the approach.
关 键 词:数据挖掘 序列模式 关联模式 增量式 多维概念格
分 类 号:TP18[自动化与计算机技术—控制理论与控制工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.177