检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]中国科学技术信息研究所
出 处:《图书情报工作》2013年第20期123-126,共4页Library and Information Service
基 金:中国科学技术信息研究所重点工作课题"多语言科技信息语义关联网络构建及其应用"(项目编号:ZD2012-3-3)研究成果之一
摘 要:面对日益膨胀的多语种信息资源,跨语言信息检索已成为实现全球知识存取和共享的关键技术手段。构建一个实用型的跨语言检索查询翻译接口,可方便地嵌入任意的信息检索平台,扩展现有信息检索平台的多语言信息处理能力。该查询翻译接口采用基于最长短语、查询分类和概率词典等多种翻译消歧策略,并从查询翻译的准确性和接口的运行效率两个角度对构建的查询翻译接口进行评测,实验结果验证所采用方法具有可行性。With the huge muhilingual information resources, there's an obvious language barrier for the existing information retrieval platform to provide documents organization and management services. A practical query, translation interface for Cross-Language Information Retrieval (CLIR) has been established in the paper. It could be embedded expediently in any information retrieval platform, expanding the muhilingual information processing ability for the existing system. A hybrid eorpora-based method for translation disambiguation is proposed in the paper. Provided an English- Chinese bilingual dictionary, parallel corpus have been employed to append translation probability value to bilingual dictionary. Moreover, maximum forward matching of phrases and query classification method were also used. The test collection has been used in testing CLIR systems successfully, and the experimental results demonstrate the methods of query translation and translation disambiguation for CLIR in the paper is feasible.
分 类 号:TP391.1[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:52.14.145.78