GPT在文本分析中的应用:一个基于Stata的集成命令用法介绍  被引量:7

Application of GPT in Textual Analysis: An Introduction to A Community Developed Command within Stata

在线阅读下载全文

作  者:李春涛 闫续文[3] 张学人 LI Chuntao;YAN Xuwen;ZHANG Xueren(International Business School,Henan University;School of Finance,Zhongnan University of Economics and Law;Henan YUNENG Holdings Company Limited;School of Economics and Management,Wuhan University)

机构地区:[1]河南大学国际商学院 [2]中南财经政法大学金融学院 [3]河南豫能控股股份有限公司 [4]武汉大学经济与管理学院

出  处:《数量经济技术经济研究》2024年第5期197-216,共20页Journal of Quantitative & Technological Economics

基  金:国家自然科学基金面上项目(72072051)的资助。

摘  要:数字智能时代,深度挖掘文本数据的价值日益重要,大语言模型的出现为此提供了新的契机。本文在人工智能技术赋能实证研究发展的框架下,首先分析了文献中词典法、文本相似度、监督式机器学习等传统文本分析方法的局限性;其次论述了大语言模型的优势及其对实证研究、特别是文本分析的赋能作用;在此基础上,借助由本文作者编制的Stata命令chatgpt,通过一系列案例展示了GPT在提高文本数据处理效率、优化文本指标刻画能力、增强文本指标衡量精度以及丰富文本指标信息含量上的核心优势。本文认为,大语言模型将极大地释放文本数据价值,在基于文本分析方法的相关实证研究中具有巨大的应用潜力。In the context of the“Credibility Revolution”over the past four decades,empirical research has emerged as the predominant research paradigm in modern economics. With the rapid advancement of information technology,textual data available for empirical analysis,such as listed company annual reports,analyst research reports,news commentaries,social media,and earnings conference calls,are becoming increasingly abundant. This provides new perspectives for classic research questions in the fields of finance and accounting and offers new solutions to previously challenging problems. However,constrained by the limitations of traditional text analysis techniques,existing text metrics have consistently struggled to achieve satisfactory performance in terms of result credibility,cost,and repeatability. The academic community continues to anticipate a more effective text processing tool to facilitate the integration of textual information into empirical research.As the latest significant technological achievement in the field of artificial intelligence,large language models have demonstrated powerful capabilities in addressing various tasks within natural language processing and have made a significant impact on the entire field of artificial intelligence. Unlike traditional natural language processing,large language models focus more on building a universal,intelligent,and smoothly interactive processing system. Recent applications such as ChatGPT and API interfaces, which are based on OpenAI's Large Language Model GPT(Generative Pre-trained Transformer),can generate responses based on patterns and statistical regularities observed during pretraining and can interact in context with conversations. This study summarizes four application characteristics of large language models,namely,“virtual agent,”“knowledge repository,”high indicator credibility,and low information processing costs,demonstrating their empowering role in empirical research,particularly in text analysis.Building on this foundation and utilizing

关 键 词:文本分析 GPT 大语言模型 自然语言处理 文本语调 

分 类 号:F830[经济管理—金融学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象