Similarity Measurement of Web Sessions Based on Sequence Alignment  

Similarity Measurement of Web Sessions Based on Sequence Alignment

在线阅读下载全文

作  者:LI Chaofeng LU Yansheng 

机构地区:[1]College of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, Hubei, China [2]College of Management, South-Central University for Nationalities, Wuhan 430074, Hubei,China

出  处:《Wuhan University Journal of Natural Sciences》2007年第5期814-818,共5页武汉大学学报(自然科学英文版)

基  金:Supported by the Foundation of Hubei Key Technology Research and Development(2005AA101C18);the Natural Science Founda-tion of South-Central University for Nationalities(YZY06009)

摘  要:The task of clustering Web sessions is to group Web sessions based on similarity and consists of maximizing the intra-group similarity while minimizing the inter-group similarity. The first and foremost question needed to be considered in clustering Web sessions is how to measure the similarity between Web sessions. However, there are many shortcomings in traditional measurements. This paper introduces a new method for measuring similarities between Web pages that takes into account not only the URL but also the viewing time of the visited Web page. Then we give a new method to measure the similarity of Web sessions using sequence alignment and the similarity of Web page access in detail Experiments have proved that our method is valid and efficient.The task of clustering Web sessions is to group Web sessions based on similarity and consists of maximizing the intra-group similarity while minimizing the inter-group similarity. The first and foremost question needed to be considered in clustering Web sessions is how to measure the similarity between Web sessions. However, there are many shortcomings in traditional measurements. This paper introduces a new method for measuring similarities between Web pages that takes into account not only the URL but also the viewing time of the visited Web page. Then we give a new method to measure the similarity of Web sessions using sequence alignment and the similarity of Web page access in detail Experiments have proved that our method is valid and efficient.

关 键 词:Web usage mining CLUSTERING Web session sequence alignment 

分 类 号:TP391[自动化与计算机技术—计算机应用技术] TP393[自动化与计算机技术—计算机科学与技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象