A Novel Optimized Language-Independent Text Summarization Technique  

在线阅读下载全文

作  者:Hanan A.Hosni Mahmoud Alaaeldin M.Hafez 

机构地区:[1]Department of Computer Sciences,College of Computer and Information Sciences,Princess Nourah Bint Abdulrahman University,P.O.Box 84428,Riyadh,11671,Saudi Arabia [2]Department of Information Systems,College of Computer and Information Sciences,King Saud University,Riyadh,Saudi Arabia

出  处:《Computers, Materials & Continua》2022年第12期5121-5136,共16页计算机、材料和连续体(英文)

基  金:This research was funded by Princess Nourah bint Abdulrahman University Researchers Supporting Project number(PNURSP2022R113);Princess Nourah bint Abdulrahman University,Riyadh,Saudi Arabia.

摘  要:A substantial amount of textual data is present electronically in several languages.These texts directed the gear to information redundancy.It is essential to remove this redundancy and decrease the reading time of these data.Therefore,we need a computerized text summarization technique to extract relevant information from group of text documents with correlated subjects.This paper proposes a language-independent extractive summarization technique.The proposed technique presents a clustering-based optimization technique.The clustering technique determines the main subjects of the text,while the proposed optimization technique minimizes redundancy,and maximizes significance.Experiments are devised and evaluated using BillSum dataset for the English language,MLSUM for German and Russian and Mawdoo3 for the Arabic language.The experiments are evaluated using ROUGE metrics.The results showed the effectiveness of the proposed technique compared to other language-dependent and languageindependent summarization techniques.Our technique achieved better ROUGE metrics for all the utilized datasets.The technique accomplished an F-measure of 41.9%for Rouge-1,18.7%for Rouge-2,39.4%for Rouge-3,and 16.8%for Rouge-4 on average for all the dataset using all three objectives.Our system also exhibited an improvement of 26.6%,35.5%,34.65%,and 31.54%w.r.t.The recent model contributed in the summarization of BillSum in terms of ROUGE metric evaluation.Our model’s performance is higher than the comparedmodels,especially in themetric results ofROUGE_2which is bi-gram matching.

关 键 词:Text summarization:language-independent summarization ROUGE 

分 类 号:TP391.1[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象