基于全球典型油气田数据库的数据挖掘预处理  被引量:9

PREPROCESSING OF THE DATA TAPPING BASED ON GLOBAL TYPICAL OIL AND GAS FIELD DATABASE

在线阅读下载全文

作  者:李大伟[1] 熊华平[2] 石广仁[1] 牛敏[1] 

机构地区:[1]中国石油勘探开发研究院,北京100083 [2]大庆油田有限责任公司勘探开发研究院,黑龙江大庆163712

出  处:《大庆石油地质与开发》2016年第1期66-70,共5页Petroleum Geology & Oilfield Development in Daqing

基  金:国家油气重大科技专项"全球剩余油气资源研究及油气资产快速评价技术"(2011ZX050)

摘  要:石油工业早已进入大数据时代,数据挖掘是充分利用数据资产价值的有效途径,而数据预处理是数据挖掘研究的热点之一。分析了数据挖掘以及数据预处理的意义及其现状,提出了在石油工业进行数据挖掘的基本思路;以某国际石油勘探开发技术服务与咨询公司研制的全球典型油气田数据库为例,以"采收率"为挖掘对象,详细解析了各种常用的数据挖掘预处理方法和具体做法,主要包括数据获取、属性选择、数据清理、数据集成、数据变换、数据规约和数据消密;提出了源数据的"5C"标准,即Correctness(正确性)、Currency(适时性)、Completeness(完整性)、Consistency(一致性)、Confidentiality(保密性)。研究成果可为石油行业开展数据预处理等工作提供参考。Oil industry has entered upon "big data" epoch for many years, the data tapping or mining is an effec- tive method to fully utilize the value of the data asset, and the data preprocessing is one of the study focuses of the data mining. The significance and situation of the data mining and preprocessing are analyzed, the basic thinking of the data mining in oil industry was presented. Taking Global Typical Oil and Gas Field database from an interna- tional petroleum exploration and development service and consultant company as the example, the detailed methods of the data mining preprocessing are dissected by taking "recovery factor" as the mining object. These methods in- clude: data acquisition, attribute selection, data cleaning, data integration, data conversion, data specification and data confidentiality treatment; finally "5C" criteria for the source data are proposed: correctness, currency, com- pleteness, consistency and confidentiality. These achievements can provide references for the researchers on the da- ta preprocessing and so on in oil industry.

关 键 词:数据挖掘 预处理 油气田 数据库 5C标准 

分 类 号:TE19[石油与天然气工程—油气勘探]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象