检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]上海大学计算机工程与科学学院,上海200072
出 处:《计算机工程与设计》2008年第21期5555-5558,共4页Computer Engineering and Design
摘 要:目前广为应用的文本过滤技术是利用关键字检索,没有考虑概念之间的关联,因此其过滤性能在达到一定程度后,很难有突破。介绍了一种基于领域本体的文本过滤模型DOTFM,探讨了领域本体在文本过滤中的应用。DOTFM在文本向量的表示和用户模板建立中引入概念关联度,并提出局部型和全局型的文本向量和用户模板。实验结果表明,DOTFM的召回率比之传统的基于关键字的过滤模型有较大提高,而其准确率在合适的阈值时,也有较大提高。The keyword based index is widely used in text filtering, which fails to deal with the relationship between concepts. Consequently, when the filtering performance reaches certain degree, it is very difficult to make a breakthrough. This paper introduces a text filtering model called DOTFM, and studies the applications of domain ontology in text filtering. In DOTFM, the concept related degree is introduced as a factor in text vector presentation and user model construction, and the local/global text vector and local/global user model are also proposed. Comparedwiththerecall-rateandprecision-rate of traditional key word based text filtering model, experimental results show that the recall-rate of DOTFM is improved significantly, and its precision-rate is also improved obviously under some proper thresholds.
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.202