检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:王娜娜[1] 陈立潮[1] 潘理虎[1] 张英俊[1]
机构地区:[1]太原科技大学计算机科学与技术学院,山西太原030024
出 处:《计算机技术与发展》2011年第10期81-84,共4页Computer Technology and Development
基 金:山西省自然科学基金资助项目(2009011022-1)
摘 要:数据挖掘算法过程中对客户行为的实时性是分析客户网络消费行为的重要要素之一,但是Prefixspan数据挖掘算法挖掘过程中并未对此问题予以考虑,因此,在时间间隔序列模式概念的基础上,提出了一种基于时间间隔和点击量的Prefixspan改进算法。在该算法中,引入了频繁度和时间属性的概念,并加入了时间间隔和点击量等要素,从而使挖掘到的信息具有实时性的特点,并且提高了对挖掘对象的侧重性。通过实验验证,与原来的Prefixspan算法相比较后表明,改进算法用于具有时间特性的数据集时获得的挖掘结果更精确,挖掘效率得到了有效的提高。The real-time character of customer behavior is one of the main factors for analyzing customer's internet consumption behavior. But it was ignored in the data mining algorithm of Prefixspan, so based on the concept of time interval sequence pattern, an improved algorithm integrated with time interval and click quantity was presented. In this algorithm,the concept of the frequent degree and time attribute was imported and the factors of time interval and click quantity was added, which made the mined dates had the real-time charac- ter, and improved the emphasis on sex of the mining object. The experiment shown that compared with the original algorithm, the improved algorithm was more precise,when used to mine the data set with real-time character,at the same time the mining efficiency has been improved effectively.
分 类 号:TP301.6[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.38