基于可配置的办公文档格式转换方法  

Research on a configurable conversion method for office document formats translation

在线阅读下载全文

作  者:田英爱[1,2] 王维[2] 

机构地区:[1]北京信息科技大学网络文化与数字传播北京市重点实验室,北京100101 [2]北京信息科技大学计算机学院,北京100101

出  处:《北京信息科技大学学报(自然科学版)》2012年第6期27-33,共7页Journal of Beijing Information Science and Technology University

基  金:核高基重大专项(2010ZX01044-001-001);北京市教委科技面上项目(SQKM201211232011);网络文化与数字传播北京市重点实验室开放课题资助

摘  要:针对传统"文档对文档"方式的格式转换器的开发、扩展和维护工作量大、成本高等问题,提出了一种基于文档功能点的可配置办公文档格式转换体系。利用某种基于XML的中间格式作为中介进行格式转换,体系中所有格式的文档,只需要按照功能点维护各自的读取、生成配置规则以及与中间格式的映射关系,就可以通过转换适配器完成任意格式间的相互转换。根据此转换方法,以XHTML为中间格式,利用XQuery对文档功能点实现封装,实现了办公文档格式UOF与OOXML主要功能点的相互转换。实验表明,基于文档功能点的可配置文档格式转换方案可行,转换正确,并以ODF格式验证了体系较好的可扩展和维护性,同时也验证了所研制的转换适配器具有较高程度的可复用性,具有实用意义。Conventional document format converters are developed in a "document to document" mode which leads to some problems such as heavy maintenance workload and high cost.A new document format converting architecture is proposed in which document features can be configured easily.In the architecture,every two types of document formats can be converted mutually via an intermediate format file or database based on XML,and main work for converting is to maintain three rule files,including reader configure rules file,writer configure rules file and mapping rules file.According to the presented architecture,a converting case between two office document formats,UOF and OOXML,is studied and in the procedure of converting XHTML is used as intermediate format.Document features are encapsulated as APIs with XQuery technology.Experiments show that,the solution to converting heterogeneous XML documents based on configurable document features is feasible and accurate.Finally,ODF is introduced into this architecture successfully which verifies the system good in scalability and maintainability,and also validates that the developed conversion wrapper has a higher degree of reusability.

关 键 词:文档格式转换器 文档功能点 格式映射 配置规则 转换适配器 XQuery封装 

分 类 号:TP317.2[自动化与计算机技术—计算机软件与理论]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象