检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]鲁东大学外国语学院汉语言文学院,山东烟台264025
出 处:《计算机工程与应用》2009年第23期137-139,148,共4页Computer Engineering and Applications
基 金:国家社会科学基金项目(No.08BYY046);教育部人文社会科学重点研究基地重大项目(No.06JJD740007);山东省社会科学规划项目(No.07CWXJ03)
摘 要:基于日语料库的粘着语文本语义接受度(SAS)研究分三步展开。首先提取『ゆきぐに』为分析文本,以等距离系统随机抽样方法取得6对比组。然后在屈折语SAS研究基础上提出适用于粘着语文本的词长定义,即百词所含5音拍及以上词数为超常用词量。最后得出结论:抽取间距由大变小引发抽取率(SR)由小变大的曲线变化;依次攀升的SR与围绕均值波动的SAS组图证明两者的非关联性,以实例验证了屈折语SAS评价公式对粘着语文本研究的可适用性。The study on agglutinative-language-involved Semantic Accessibility Scale(SAS) based on Japanese corpus comprises three steps.Firstly,「ゅきぐに」 is extracted from corpus and divided into six groups for comparison by the systematic random sampling skill in which different equidistant extraction is included.Secondly,the definition of word height in presently-verified SAS formula reflecting inflecting language domain is adapted for agglutinative language domain.The word beyond five music beats is called the unpopular one,and the number of this kind of word every 100 words is considered word height.Finally,a conclusion is drawn that decreasing extracted-space results in increasing Sampling Ratio(SR),and that the non-relevance between SR and SAS is verified by the schema in which the contrast between increasing SR and the mean-fluctuated SAS is involved.In short, the evaluation of SAS in inflecting language text can be applicable in other fields,including agglutinative language text.
分 类 号:TP311[自动化与计算机技术—计算机软件与理论]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.135.64.200