翻译教学语料库规模化建设的大数据解决方案被引量：1

The Scalability Challenge in Translation Corpus Building and a Big Data Solution from an Interdisciplinary Perspective

作　　者：赵政廷[1] 柴明熲[1] ZHAO Zhengting;CHAI Mingjiong(Graduate Institute of Interpretation and Translation,Shanghai International Studies University,Shanghai 200083,China)

机构地区：[1]上海外国语大学高级翻译学院,上海200083

出　　处：《上海理工大学学报（社会科学版）》2024年第5期403-412,共10页Journal of University of Shanghai for Science and Technology：Social Sciences Edition

基　　金：上海外国语大学校级规划项目(2021114015);上海外国语大学专项课题(2022zxkt22)。

摘　　要：翻译教学中基于平行文本的双语平行语料库建设,本质上是译者根据自身需求进行方案设计、语料收集和语料管理的过程。目前受限于现有翻译技术手段和经济条件,翻译教学中难以规模化建设此类语料库。通过以大数据自动化采集的方式,引入人工智能领域的Python技术与PostgreSQL数据库系统,来解决目前建立大规模教学语料库的难题。为了完整呈现规模化建库过程,借用政府公文类翻译的真实案例,详细描述了建库方案设计、语料收集和语料管理等操作步骤,分析了各步骤中存在的自动化难题以及新技术介入的契机。部分技术细节与源代码也在文中公开,以期为大规模建设教学语料库提供一个切实可行的解决方案。In translation teaching,students regularly prepare“parallel texts”for a new translation project and build these linguistic materials into database,or a reference corpus.This corpus requires a relative high volume of data in order to be effective due to no ideal solution provided by available product/serviceoriented translation technology.This article aims at this scalability challenge and introduces an interdisciplinary approach by combining Python and PostgreSQL.The automation challenges in processes like corpus design,data collection and corpus management are addressed by applying these technologies.The entire large-scale corpus building process is presented in detail with relevant Python and PostgreSQL source code disclosed.

关键词：翻译教学语料库规模化建设 Python应用大数据

分类号：TP18[自动化与计算机技术—控制理论与控制工程] H059[自动化与计算机技术—控制科学与工程]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

翻译教学语料库规模化建设的大数据解决方案被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

翻译教学语料库规模化建设的大数据解决方案 被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

翻译教学语料库规模化建设的大数据解决方案被引量：1