机构地区:[1]School of Information, Renmin University of China, Beijing 100872, China [2]College of Computer Science and Engineering, Northwest Normal University, Lanzhou 730070, China [3]School of Systems Science, Beijing Normal University, Beijing 100875, China [4]DNSLAB, China Internet Network Information Center, Beijing 100190, China
出 处:《Journal of Computer Science & Technology》2015年第6期1175-1187,共13页计算机科学技术学报(英文版)
摘 要:The availability of network big data, such as those from online users' surfing records, communication records, and e-commerce records, makes it possible for us to probe into and quantify the regular patterns of users' long-range and complex interactions between websites. If we see the Web as a virtual living organism, according to the metabolic theory, the websites must absorb "energy" to grow, reproduce, and develop. We are interested in the following two questions: 1) where does the "energy" come from? 2) will the websites generate macro influence on the whole Web based on the "energy"? Our data consist of more than 30 000 online users' surfing log data from China Internet Network Information Center. We would consider the influence as metabolism and users' attention flow as the energy for the websites. We study how collective attention distributes and flows among different websites by the empirical attention flow network. Different from traditional studies which focused on information flow, we study users' attention flow, which is not only a "reversed" way to study Web structure and transmission mode, but also the first step to understand the underlying dynamics of the World Wide Web. We find that the macro influence of websites scales sub-linearly against the collective attention flow dwelling time, which is not consistent with the heuristics that the more users' dwelling time is, the greater influence a website will have. Further analysis finds a supper-linear scaling relationship between the influence of websites and the attention flow intensity. This is a websites version of Kleiber's law. We further notice that the development cycle of the websites can be split into three phases: the uncertain growth phase, the partially accelerating growth phase, and the fully accelerating growth phase. We also find that compared with the widespread hyperlinks analysis models, the attention flow network is an effective theoretical tool to estimate and rank websites.The availability of network big data, such as those from online users' surfing records, communication records, and e-commerce records, makes it possible for us to probe into and quantify the regular patterns of users' long-range and complex interactions between websites. If we see the Web as a virtual living organism, according to the metabolic theory, the websites must absorb "energy" to grow, reproduce, and develop. We are interested in the following two questions: 1) where does the "energy" come from? 2) will the websites generate macro influence on the whole Web based on the "energy"? Our data consist of more than 30 000 online users' surfing log data from China Internet Network Information Center. We would consider the influence as metabolism and users' attention flow as the energy for the websites. We study how collective attention distributes and flows among different websites by the empirical attention flow network. Different from traditional studies which focused on information flow, we study users' attention flow, which is not only a "reversed" way to study Web structure and transmission mode, but also the first step to understand the underlying dynamics of the World Wide Web. We find that the macro influence of websites scales sub-linearly against the collective attention flow dwelling time, which is not consistent with the heuristics that the more users' dwelling time is, the greater influence a website will have. Further analysis finds a supper-linear scaling relationship between the influence of websites and the attention flow intensity. This is a websites version of Kleiber's law. We further notice that the development cycle of the websites can be split into three phases: the uncertain growth phase, the partially accelerating growth phase, and the fully accelerating growth phase. We also find that compared with the widespread hyperlinks analysis models, the attention flow network is an effective theoretical tool to estimate and rank websites.
关 键 词:attention flow network allometric scaling law influence of website online collective behavior
分 类 号:TP277[自动化与计算机技术—检测技术与自动化装置] TP393.092[自动化与计算机技术—控制科学与工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...