Web访问挖掘中的匿名用户识别算法研究  被引量:5

On Anonymous User Identification Algorithm in Web Usage Mining

在线阅读下载全文

作  者:李红波[1] 孟欣赏 吴渝[1] 李娜芬 

机构地区:[1]重庆邮电大学计算机科学与技术学院,重庆400065

出  处:《西南师范大学学报(自然科学版)》2015年第9期78-84,共7页Journal of Southwest China Normal University(Natural Science Edition)

基  金:重庆市自然科学基金项目(CSTC2012jjA40027);"核高基"重大专项(2009ZX01038-002-002-2)

摘  要:有效的用户识别与用户细分是网站用户行为分析的基础.针对现有用户识别算法将注册用户和匿名用户均按匿名用户处理,导致用户分类不细致的问题,提出了一种匿名用户识别算法.该算法通过识别用户访问行为状态,采取页面访问路径和浏览时长匹配方式,进一步识别IP地址变化后混入纯匿名用户中的注册匿名用户,从而把用户细分为注册用户、假匿名用户和纯匿名用户.实验结果表明,该算法能够提高匿名用户识别率,更加准确地识别假匿名用户.Valid user identification and user segment are the basis of user behavior analysis.For the user classification is oversimplified which leads by confusing anonymous users and registered users all as anonymous user in existing user identification algorithms,an anonymous user identification algorithm has been proposed.By identifying users' access behavior state and matching users' access path and browsing time length,the algorithm identifies the registered anonymous user mixed in the pure anonymous user due to the IP address change to classify user into registered login user,registered anonymous user,pure anonymous users.The experimental results indicate that the proposed algorithm can improve anonymous user identification rate,and identify the registered anonymous user much more accurately.

关 键 词:数据挖掘 匿名用户 用户识别 浏览路径 访问时长 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象