一种高速精确单模式串匹配算法  被引量:14

A Fast and Exact Single Pattern Matching Algorithm

在线阅读下载全文

作  者:范洪博[1,2] 姚念民[1] 

机构地区:[1]哈尔滨工程大学计算机科学与技术学院,哈尔滨150001 [2]绥化学院计算机科学与技术系,黑龙江绥化152000

出  处:《计算机研究与发展》2009年第8期1341-1348,共8页Journal of Computer Research and Development

基  金:国家自然科学基金项目(60503055);黑龙江省博士后启动基金项目(323630217)~~

摘  要:串匹配问题是计算机科学的基础问题之一,是网络安全、信息检索与过滤、计算生物学等众多领域的核心问题,其中,高速精确单模式匹配算法设计又是各种串匹配问题的基础.基于SBNDM2,通过修改位掩码有效位到无符号整数的高位,将BNDM算法核心循环化简至最简形式(5指令/字符),并引入越界保护机制,提出S2BNDM系列精确单模式匹配算法.实验结果显示,S2BNDM系列算法在任何情况下都快于SBNDM2,对于英文语料(m<32)和DNA序列(m<8),S2BNDM系列算法为现有已知最快算法.String matching problem is a fundamental problem in computer science. It is the key problem of many important scopes such as network security, information retrieval and filtration, computational biology, etc. And the design of exact single pattern string matching algorithm with high performance is the basis of all string matching problems. In this paper, the authors improve one of the fastest exact single pattern matching algorithms known on English text, which is SBNDM2. The simplest form of the BNDM core loop is obtained, in which there are only 5 instructions per character read by amending the relationship between position in the pattern and bit in the bitmask. And a cross-border protection method is added to the algorithm in order to reduce the cost of crossborder inspection. Two algorithms named S2BNDM and S2BNDM' are presented. The experimental results indicate that both S2BNDM and S2BNDM' are faster than SBNDM2 in any case. It can be considered that S2BNDM is the fastest algorithm on English text (2〈m≤12) /DNA sequence (2〈m≤8), and S2BNDM' is the fastest algorithm on English text (13≤m〈32).

关 键 词:串匹配 精确单模式 算法设计 位并行 文本搜索 

分 类 号:TP391.3[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象