QUICK REVIEW

[論文レビュー] The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies

Stephan Zheng, Alexander Trott|arXiv (Cornell University)|Apr 28, 2020

Energy, Environment, and Transportation Policies参考文献 83被引用数 44

ひとこと要約

本論文は、AI社会プランナーがシミュレーション経済で動的な税政策を学習し、平等と生産性のバランスを取る二階層の深層強化学習フレームワークを訓練する。ベースラインの税制度を顕著なマージンで上回り、AI設計の税政策が emergent に生み出す挙動と有効性を、ヒト参加者を用いた実証で示す。

ABSTRACT

Tackling real-world socio-economic challenges requires designing and testing economic policies. However, this is hard in practice, due to a lack of appropriate (micro-level) economic data and limited opportunity to experiment. In this work, we train social planners that discover tax policies in dynamic economies that can effectively trade-off economic equality and productivity. We propose a two-level deep reinforcement learning approach to learn dynamic tax policies, based on economic simulations in which both agents and a government learn and adapt. Our data-driven approach does not make use of economic modeling assumptions, and learns from observational data alone. We make four main contributions. First, we present an economic simulation environment that features competitive pressures and market dynamics. We validate the simulation by showing that baseline tax systems perform in a way that is consistent with economic theory, including in regard to learned agent behaviors and specializations. Second, we show that AI-driven tax policies improve the trade-off between equality and productivity by 16% over baseline policies, including the prominent Saez tax framework. Third, we showcase several emergent features: AI-driven tax policies are qualitatively different from baselines, setting a higher top tax rate and higher net subsidies for low incomes. Moreover, AI-driven tax policies perform strongly in the face of emergent tax-gaming strategies learned by AI agents. Lastly, AI-driven tax policies are also effective when used in experiments with human participants. In experiments conducted on MTurk, an AI tax policy provides an equality-productivity trade-off that is similar to that provided by the Saez framework along with higher inverse-income weighted social welfare.

研究の動機と目的

ダイナミックな経済における平等と生産性のバランスを取るためのデータ駆動型シミュレーションによる税政策設計の動機づけ。
プランナーが税政策を学習する一方でエージェントがその行動を適応させる二階層RLフレームワークの開発。
AI駆動の税政策がベースライン政策と比較して平等-生産性のトレードオフを改善することの示界。
実世界の適用性を評価するため、人間の参加者を用いた emergent な挙動と政策の検証。

提案手法

適応的なエージェントと政府プランナーを含む経済シミュレーション（Gather-and-Build game）を作成。
観測可能な世界データ上の深層ニューラルネットワークを用いて所得階層に適用される学習済みスケジュールとして税政策を定式化。
エージェントの効用をコインの凹凸関数 minus 線形労働コストとしてモデル化し、RL最適化を可能にする。
カリキュラムとエントロピーベースの正則化を含む二階層RL設定でエージェントとプランナーを訓練し、安定性を確保。
平等と生産性の指標で政策を評価し、Saez frameworkおよび自由市場ベースラインと比較する。

実験結果

リサーチクエスチョン

RQ1AI駆動の税政策は動的で学習する経済における平等と生産性のトレードオフを改善できるか。
RQ2AIによる emergent な税設計はベースライン政策と定性的に異なり、エージェントの戦略行動に対して頑健か。
RQ3AI駆動の税政策は人間の参加者とともに平等-生産性のトレードオフを評価する際に有効か。

主な発見

AI駆動の税政策はベースライン政策（Saez frameworkを含む）よりも平等-生産性のトレードオフを16%改善。
AI政策は異なる質的なスケジュールを設定し、より高い最高税率や低所得層への純 subsidies の増加を含む。
AI駆動の政策はエージェントが税回避戦略を学習しても有効性を維持。
学習された政策はヒト参加者を用いるMTurk実験で平等-生産性の好ましい結果を示す。
政策はAIエージェントの emergent な戦略行動に対して頑健である。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。