基于通信负载均衡的社交网络图分割方法  被引量:1

Graph partitioning method for social networks based on communication load balancing

在线阅读下载全文

作  者:刘康[1] 张雪英[1] 李凤莲[1] 田玉楚[1,2] 

机构地区:[1]太原理工大学信息工程学院,山西晋中030600 [2]昆士兰科技大学电机工程及计算机科学学院

出  处:《计算机工程与应用》2018年第4期66-71,共6页Computer Engineering and Applications

基  金:山西省国际合作项目(No.2015081007);山西省优秀人才科技创新项目(No.201605D211021);2016年太原理工大学教改项目(24)

摘  要:海量社交网络数据中蕴含着丰富的信息,图论是挖掘这些信息的重要方法之一。面对日益增多的图数据,分布式计算成为处理大规模图数据的有效手段。在分布式图计算中,通信所消耗的时间占有很大的比例,通过图分割算法的设计可以有效地降低通信量并实现负载均衡,从而提高分布式图计算的效率,典型的例子包括Metis图分割算法。但是,用现有的图分割算法处理非均衡图数据会造成各个子图之间通信量不均衡,从而影响了计算效率。为了解决这一问题,提出一种新的图分割方法:通信均衡标签交换方法。该方法在保持子图规模一致的基础上,既降低了全图计算所需的通信量,又使各个子图之间的通信量达到均衡。实验结果表明,与Metis等典型的图分割算法相比,提出的图分割方法在各种数据集和集群配置情况下,能降低6%~30%的图计算时间,充分显示了该方法的有效性。Massive data from social networks contains a wealth of information. Among various methods to mine such information, graph theory is an attractive tool. With the increase of the volume of the graph data, distributed computation of graphs becomes an effective means to deal with large-scale graph data. In distributed graph computation, the time consumed in communications contributes significantly to the overall computation time. A well-designed graph partitioning algorithm can effectively reduce the communications as well as achieve load balancing, thus improving the efficiency of distributed graph computation. Typical examples include the Metis graph partitioning algorithm. However, existing graph partitioning algorithms to deal with social network graphs which involve non-equilibrium graph data will result in imbalance between subgraph communications, thus affecting the computational efficiency. To solve this problem, a new graph partitioning method, namely communication balanced label switching method, is presented. It behaves with three unique features: consistent subgraph scale, reduction of the communications required for the whole graph computation, and balanced communications between subgraphs. Experimental results show that in comparison with existing partitioning algorithms such as Metis, the graph partitioning method presented in this paper improves the computational time performance by 6% ~30%for various data sets and cluster configurations. These results highlight the effectiveness of the presented method.

关 键 词:社交网络 图论 图分割 分布式计算 负载均衡 

分 类 号:TP39[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象