Standard NER Tagging Scheme for Big Data Healthcare Analytics built on Unified Medical Corpora  被引量:1

在线阅读下载全文

作  者:Sarah Shafqat Hammad Majeed Qaisar Javaid Hafiz Farooq Ahmad 

机构地区:[1]Department of basic and Applied Sciences,International Islamic University(IIU),Islamabad,Pakistan [2]Department of Computer Science,National University of Computer and Emerging Sciences,Islamabad,Pakistan [3]Computer Science Department,College of Computer Sciences and Information Technology(CCSIT),King Faisal University,Al-Ahsa 31982,Saudi Arabia

出  处:《Journal of Artificial Intelligence and Technology》2022年第4期152-157,共6页人工智能技术学报(英文)

基  金:This research is supported by Shifa International Hospital,Pakistan.Endocrine patients’data contributed for diagnosis of diabetes,and its comorbidities holds a lot of worth to come up with these observations from experimental study。

摘  要:The motivation for this research comes from the gap found in discovering the common ground for medical context learning through analytics for different purposes of diagnosing,recommending,prescribing,or treating patients for uniform phenotype features from patients’profile.The authors of this paper while searching for possible solutions for medical context learning found that unified corpora tagged with medical nomenclature was missing to train the analytics for medical context learning.Therefore,here we demonstrated a mechanism to come up with uniform NER(Named Entity Recognition)tagged medical corpora that is fed with 14407 endocrine patients’data set in Comma Separated Values(CSV)format diagnosed with diabetes mellitus and comorbidity diseases.The other corpus is of ICD-10-CM coding scheme in text format taken from www.icd10data.com.ICD-10-CM corpus is to be tagged for understanding the medical context with uniformity for which we are conducting different experiments using common natural language programming(NLP)techniques and frameworks like TensorFlow,Keras,Long Short-Term Memory(LSTM),and Bi-LSTM.In our preliminary experiments,albeit label sets in form of(instance,label)pair were tagged with Sequential()model formed on TensorFlow.Keras and Bi-LSTM NLP algorithms.The maximum accuracy achieved for model validation was 0.8846.

关 键 词:big data endocrine diseases international diabetes federation healthcare analytics ICD-10 medical Corpora NLP 

分 类 号:TP311.13[自动化与计算机技术—计算机软件与理论]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象