1种基于信息理论的新分子序列度量法

A new measure based on shannon information for biologic sequence

出　　处：《计算机与应用化学》2009年第11期1380-1384,共5页Computers and Applied Chemistry

基　　金：国家自然科学基金资助项目(10571019:数学方法在分子生物学中的应用)

摘　　要：传统的方法度量序列之间的距离需要序列比对,使一些主观因素破坏数据的原始状态,导致计算结果因人而异。本文拟介绍一些基于信息理论的度量法,提出1种类似的新度量法,建立数学依据。这些度量法度量序列之间的距离不需要序列比对,没有主观因素干涉。同时,选取了20种胎生哺乳动物的线粒体全基因序列,分别使用这些度量法计算出他们的距离,再利用NEIGHBOR法构建系统树。由比较结果看来,新方法用较少时间构建的系统树完全不逊色于以往的方法。这为研究分子序列的差异性提供了1种新方法。Traditional sequence distances require an alignment and therefore are not directly applicable to the moreproblem of whole genome phylogeny where events such as rearrangements make full length alignments impossible. This paper introduces information theoretical concept and arithmetic which is used to compute information probability distribution of sequence. Some information theory-based measures are also introduced, which are used to measure discrepancy of information probability distribution, such as Kullback-Leiber entropy, cross entropy and FDOD function. Then, a sequence measure is presented, which works on sequences using the information theoretical concept of shannon information and a program to estimate this distance, the new measure needn＇t align sequences to measure their distance and do not have subjective factors to interfere. Some properties of the new measure are proved. Distance matrix of 20 mammals whole mitochondrial genomes sequences is computed by measures. Then, Phylogenies are constructed by NEIGHBOR. As the experiment shown, The time complexity of the new measure is less, and phylogeny constructed by new measure is the most credible. It is useful for studying the discrepancy of biologic sequence.

关键词：信息熵度量法系统树

分类号：O6[理学—化学] TP393[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

1种基于信息理论的新分子序列度量法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

1种基于信息理论的新分子序列度量法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索