大语言模型驱动的交互式建筑设计新范式——基于Rhino7的概念验证  被引量:3

A new interaction paradigm for building design driven by large language model:proof of concept with Rhino7

在线阅读下载全文

作  者:蒋灿 郑哲 梁雄 林佳瑞[2,3] 马智亮 陆新征[2] JIANG Can;ZHENG Zhe;LIANG Xiong;LIN Jiarui;MA Zhiliang;LU Xinzheng(Glodon Company Limited,Beijing 100193,China;Department of Civil Engineering,Tsinghua University,Beijing 100084,China;Key Laboratory of Digital Construction and Digital Twin,Ministry of Housing and Urban-Rural Development,Beijing 100084,China)

机构地区:[1]广联达科技股份有限公司,北京100193 [2]清华大学土木工程系,北京100084 [3]住房城乡建设部数字建造与孪生重点实验室,北京100084

出  处:《图学学报》2024年第3期594-600,共7页Journal of Graphics

基  金:国家自然科学基金项目(52378306);北京市科委-中关村管委会项目(20220468132)。

摘  要:随着社会对建筑设计质量要求越来越高,建筑设计软件也变得越来越专业和复杂。现在的设计软件不仅学习成本高,而且交互模式复杂。大语言模型(LLM)的最新突破使计算机清晰地理解人类自然语言指令,并准确生成代码语言具有可行性,有望为人与软件的交互范式提供新思路。因此,本文提出了LLM驱动的交互式建筑设计新范式--将设计师通过多次键鼠操作与设计软件交互转变为LLM根据设计师自然语言指令生成并执行API调用脚本的方式;提出了技术路线并验证了其在建筑设计场景落地的可能性。该技术路线包括:①LLM根据用户指令从API库中搜索与任务相关的API;②LLM基于指令和候选API摘要信息编写程序脚本并运行;③LLM根据来自软件环境、用户等反馈改进优化所编写的程序脚本。通过Rhino7设计软件、GPT-4和CodeLlaMa完成多个设计任务,测试当前LLM是否具备执行该技术路线各关键环节的能力。测试结果不仅证明了LLM驱动的交互式设计范式在建筑设计场景已初具落地前景,也为技术落地提供经验和建议。该设计范式的落地可以降低软件的使用门槛和学习成本,提高设计师工作效率;有望在未来的建筑设计软件中发挥重要作用。As society places higher demands on the quality of building designs,design software has become more professional and complicated.Current design software not only incurs high learning costs but also features complex interaction modes.The recent breakthroughs in large language models(LLM)have enabled computers to clearly comprehend instructions based on human natural language and accurately generate code,which is expected to provide new ideas for the paradigm of human interaction with software.Therefore,this study designed a new paradigm of interactive building design driven by LLM,i.e.,shifting from the designers interacting with the design software through multiple keyboard and mouse operations to LLMs writing scripts to invoke APIs according to architects’instructions.The methodology was proposed and its implementation feasibility in building design was validated.The methodology included:①LLM retrieved task-related APIs from the API set according to user instructions;②LLM wrote a program script based on instructions and the abstract of candidate APIs and ran it;③LLM revised the script written based on the feedback from the environment,users,etc.To validate the capabilities of current LLMs in executing the key steps of the methodology,multiple design tasks were completed with Rhino7 design software,GPT-4,and CodeLlaMa.The results not only demonstrated that the LLM-driven interactive design paradigm held initial prospects for implementation in building design,but also provided experiences and suggestions for its implementation.The implementation of this design paradigm could reduce the threshold and learning costs,improving the efficiency in many scenarios,and was expected to play a key role in future building design software.

关 键 词:建筑设计软件 软件交互 大语言模型 应用程序接口 GPT-4 Rhino7 LADYBUG 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象