QUICK REVIEW

[論文レビュー] Text2BIM: Generating Building Models Using a Large Language Model-based Multi-Agent Framework

Changyu Du, Sebastian Esser|arXiv (Cornell University)|Aug 15, 2024

BIM and Construction Integration被引用数 6

ひとこと要約

Text2BIMはマルチエージェントLLMフレームワークを活用し、自然言語を実行可能コードへ変換してVectorworksで編集可能なBIMモデルを生成し、ルールベースの品質チェックと反復的な改善を行う。

ABSTRACT

The conventional BIM authoring process typically requires designers to master complex and tedious modeling commands in order to materialize their design intentions within BIM authoring tools. This additional cognitive burden complicates the design process and hinders the adoption of BIM and model-based design in the AEC (Architecture, Engineering, and Construction) industry. To facilitate the expression of design intentions more intuitively, we propose Text2BIM, an LLM-based multi-agent framework that can generate 3D building models from natural language instructions. This framework orchestrates multiple LLM agents to collaborate and reason, transforming textual user input into imperative code that invokes the BIM authoring tool's APIs, thereby generating editable BIM models with internal layouts, external envelopes, and semantic information directly in the software. Furthermore, a rule-based model checker is introduced into the agentic workflow, utilizing predefined domain knowledge to guide the LLM agents in resolving issues within the generated models and iteratively improving model quality. Extensive experiments were conducted to compare and analyze the performance of three different LLMs under the proposed framework. The evaluation results demonstrate that our approach can effectively generate high-quality, structurally rational building models that are aligned with the abstract concepts specified by user input. Finally, an interactive software prototype was developed to integrate the framework into the BIM authoring software Vectorworks, showcasing the potential of modeling by chatting. The code is available at: https://github.com/dcy0577/Text2BIM

研究の動機と目的

BIM著作の認知的負担を低減し、自然言語からBIMモデル生成を可能にする。
専門化したLLMエージェント間の協調的推論を有効にし、構造化された実行可能なBIMコードを生成する。
ルールベースのモデルチェッカーを通じてドメイン知識を統合し、反復的なモデル品質の改善を導く。
Text2BIMとVectorworksを統合したインタラクティブモデリングの動作プロトタイプを示す。

提案手法

4人の専門化LLMエージェント（Product Owner、Architect、Programmer、Reviewer）が自然言語をimperative BIM API呼び出しへ変換するために協働する。
Vectorworks API呼び出しとドメインルールをカプセル化した26の高レベルBIM機能のツールセットがコード生成を導く。
Architectのプロンプトは建築的ルールを用いて建物計画を作成する；Product Ownerが指示を補強し、関数呼び出しを通じてArchitectと調整する。
Programmerは定義されたツール関数と標準Pythonライブラリのみを呼び出すPythonコードを作成する；カスタムインタプリタがコードを実行・テストする。
三重ループのワークフローでコード生成、自己反省、モデル品質評価を組み合わせ、ルールベースのモデルチェッカーとBCFレポーティングを用いて問題を反復的に修正する。
メモリモジュール（ローカル/グローバル）は相互作用履歴を保存し、反復間でコンテキストを維持する。

実験結果

リサーチクエスチョン

RQ1LLMベースのマルチエージェントシステムは自然言語指示から一貫性のある実行可能なBIMコードを生成できるか？
RQ2建築ルールとドメイン知識をどのように統合してモデル品質を向上させ、BIM標準との一貫性を確保するか？
RQ3ルールベースのモデルチェッカーとBCF主導のフィードバックはBIMモデルの反復的改善を導くうえでどのような役割を果たすか？
RQ4Vectorworksベースのプロトタイプは初期設計のBIM生成におけるモデリング・バイ・チャットの実用性をどう示すか？

主な発見

フレームワークは自然言語入力から、外部エンベロープ、内部レイアウト、セマンティック情報を含む編集可能なBIMモデルを生成できる。
ルールベースのチェッカーを備えたマルチエージェント協調は、複数のフィードバックループを通じてモデル品質の反復的改善を導く。
Text2BIMフレームワーク内の3つのLLMを比較し、構造的に合理的で意味的に豊かなモデルを生成する際の性能を評価する。
Vectorworksとの統合を示すインタラクティブなプロトタイプが、実践的なモデリング・バイ・チャットのワークフローを示す。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。