检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:朱欣娟 牛婷婷 ZHU Xinjuan;NIU Tingting(School of Computer Science/The Shaanxi Key Laboratory of Clothing Intelligence,Xi’an Polytechnic University,Xi’an 710600,China)
机构地区:[1]西安工程大学计算机科学学院/陕西省服装设计智能化重点实验室,陕西西安710600
出 处:《西安工程大学学报》2024年第3期92-99,共8页Journal of Xi’an Polytechnic University
基 金:公共文化资源智能共建共享与管理平台构建与示范应用(2019YFC1521405)。
摘 要:在文旅领域智能问答中,用户问句文本表征稀疏、口语化表达、一词多义及特定领域词汇的识别困难使得常见的匹配模型难以将用户问句与标准问句进行精准匹配。针对此问题,本文构建了文旅客服问句匹配数据集和相应的领域词典,在此基础上提出一种融合领域词典的文旅问句匹配模型SBIDD(Improved SBERT Model for Integrating Domain Dictionaries)。模型利用Sentence-BERT对问句进行向量化表示,在孪生网络模型中融入领域词典,增强问句的领域词权重,使得模型对领域词汇的识别能力大幅提升。在自建数据集和公开数据集ATEC 2018 NLP上分别进行实验。结果表明,构建的模型与5种经典文本匹配模型DSSM、BiMPM、ESIM、IMAF、TSFR-RM及基线模型SBERT相比效果更优,F1值达到95.65%,比基线模型提升了2.75%,且模型在检索任务上表现出更高的适配性和鲁棒性。In culture and tourism intelligent question answering,the sparse representation,colloquial expression,polysemy of a word,and difficulty in recognizing specific domain vocabulary make it difficult for common matching models to accurately match user questions with standard questions.In response to this issue,firstly a dataset of customer service question matching for cultural and tourism and corresponding domain dictionaries were constructed.Then a cultural and tourism question matching model SBIDD(Improved SBERT Model for Integrating Domain Dictionaries)integrating domain dictionaries was proposed.The model utilizes SBERT to vectorize questions and incorporates a domain dictionary into the twin network model to enhance the domain word weight of the questions,greatly improving it′s ability to recognize domain vocabulary.Experiments were conducted on both self-built dataset and the public dataset ATEC 2018 NLP.The results show that compared with the classic text matching models such as DSSM,BiMPM,ESIM,IMAF,TSFR-RM,and baseline model SBERT,SBIDD has better performance,with F 1 value reaching 95.65%,an increase of 2.75%compared to the baseline model,and shows higher adaptability and robustness in retrieval tasks.
关 键 词:问句匹配 文旅客服 Sentence-BERT 领域词典 智能问答 检索式问答
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.189.188.228