基于R语言lncRNA生物学文献挖掘方法初探  

Primary Investigation into Biological Literature Mining on lncRNA based on R Language

在线阅读下载全文

作  者:高志华[1,2] 马莉丽[1] 石晓辉[1] 李桂琴 

机构地区:[1]河北经贸大学生物科学与工程学院,河北石家庄050061 [2]河北师范大学生命科学学院,河北石家庄050016 [3]河北经贸大学统战部,河北石家庄050061

出  处:《河北经贸大学学报(综合版)》2017年第4期88-93,共6页Journal of Hebei University of Economics and Business(Comprehensive Edition)

基  金:河北经贸大学校内科研基金资助项目(2013KYZ05)

摘  要:近年来,长链非编码RNA(long non-coding RNA,lncRNA)因其在细胞生命活动中的重要作用而受到越来越多的关注,发表的lncRNA文献数量也急剧增长。当前,对于普通的生物学家而言,人工文献挖掘主要靠在文献数据库中进行关键词搜索。然而,传统的文献检索已经很难适应文献累积速度。所以,我们通过R语言编程在万方数据库在线抓取到604条lncRNA文献记录,利用文献计量分析、关联分析、社会网络分析等数据挖掘工具,对其标题、作者、关键词、基金等文献外部特征进行了文献挖掘初步探索。结果表明,这种方法可以揭示lncRNA研究重点,并为lncRNA的深入研究和其他研究领域的文献挖掘提供借鉴。In recent years,long non-coding RNA(lncRNA) is drawing increasing attention because it plays an important role in a variety of cellular processes, and the quantity of literature on lncRNA has been growing dramatically. For the average biologist,hands-on literature mining currently means a keyword search in literature database. However,traditional method of literature retrieval has been difficult to adapt to the accumulation speed of literatures. Therefore,authores downloaded online604 literature entries in the field of lncRNA from Wanfang database using R program. Then,they explored primary investigation of applying data mining tools including bibliometric analysis,association analysis and social network analysis to external characteristics of these literatures such as title,author,keyword,fund and so on. The results showed that this method revealed lncRNA research focus,and provided some references for further research of lncRNA and literature mining of other fields.

关 键 词:R语言 lncRNA 文献挖掘 生物学 

分 类 号:Q752[生物学—分子生物学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象