QUICK REVIEW

[論文レビュー] Quantum Structure in Cognition: Human Language as a Boson Gas of Entangled Words

Diederik Aerts, Lester Beltran|arXiv (Cornell University)|Sep 15, 2019

Quantum Mechanics and Applications参考文献 130被引用数 19

ひとこと要約

本稿では、語を気体状態のボソンとして扱い、語の頻度をエネルギー準位に対応させることで、人間の言語の量子モデルを提案する。ボーズ＝アインシュタイン統計を用いて、物語（例：ウィニーブーハン）における実測語頻度分布を高い精度で適合させ、ジープの法則との深い関連を明らかにし、認知における概念的同一性と不可識別性について新たな知見を提供する。

ABSTRACT

We model a piece of text of human language telling a story by means of the quantum structure describing a Bose gas in a state close to a Bose-Einstein condensate near absolute zero temperature. For this we introduce energy levels for the words (concepts) used in the story and we also introduce the new notion of 'cogniton' as the quantum of human thought. Words (concepts) are then cognitons in different energy states as it is the case for photons in different energy states, or states of different radiative frequency, when the considered boson gas is that of the quanta of the electromagnetic field. We show that Bose-Einstein statistics delivers a very good model for these pieces of texts telling stories, both for short stories and for long stories of the size of novels. We analyze an unexpected connection with Zipf's law in human language, the Zipf ranking relating to the energy levels of the words, and the Bose-Einstein graph coinciding with the Zipf graph. We investigate the issue of 'identity and indistinguishability' from this new perspective and conjecture that the way one can easily understand how two of 'the same concepts' are 'absolutely identical and indistinguishable' in human language is also the way in which quantum particles are absolutely identical and indistinguishable in physical reality, providing in this way new evidence for our conceptuality interpretation of quantum theory.

研究の動機と目的

語をボソンとして扱う統計的挙動に基づき、人間の言語の量子理論的モデルを構築すること。
ボーズ＝アインシュタイン統計を用いて、語の頻度分布に現れるジープの法則の出現を説明すること。
光場におけるフォトンに類似した、人間の思考の量子単位「コグニトン」の概念を導入すること。
特に物語文における言語的構造において、量子重ね合わせともつれの役割を調査すること。
実際のテキスト（短編小説や小説を含む）を用いて、モデルの実証的妥当性を示し、ボーズ＝アインシュタイン分布と強い一致を示すこと。

提案手法

語の出現頻度に基づきエネルギー準位を割り当て、最も頻出する語を最低エネルギー準位 E₀ に割り当てる。
テキストを熱平衡状態のボーズ気体としてモデル化し、ボーズ＝アインシュタイン分布を用いる：N(Ei) = 1 / (A·e^(Ei/B) - 1)。
語の総数と全エネルギー（語の頻度から導出）の2つの制約を用いて、パラメータ A と B をキャリブレーションする。
語の頻度をエネルギー準位にマッピングするため、線形エネルギースケールを用いる：Ei = i（i ∈ [0, 1, ..., n]）、ここで n は異なる語の数である。
実際のテキスト（例：「In Which Piglet Meets a Haffalump」）にモデルを適用し、予測された頻度と実測データを比較する。
量子調和振動子および井戸型中での粒子モデルをアナロジーとして用い、言語的概念の波動的および量子化された挙動を説明する。

実験結果

リサーチクエスチョン

RQ1人間の言語における語の頻度分布の統計的分布は、ボーズ＝アインシュタイン統計を用いて正確にモデル化可能か？
RQ2語の頻度に基づきエネルギー準位を割り当てるという行為の物理的および概念的意味は何か？
RQ3言語的データに現れるジープの法則は、言語の量子モデルによってどのように説明されるか？
RQ4重ね合わせやもつれといった量子現象が、物語文の構造にどの程度根ざしているか？
RQ5言語における同一概念の「不可識別性」は、量子粒子の性質をどのように明らかにするか？

主な発見

ボーズ＝アインシュタイン分布は、語数2655、全エネルギー E = 242891 を有する物語「In Which Piglet Meets a Haffalump」の語頻度データにほぼ完璧に適合する。
モデルはジープの法則をうまく再現する：語の頻度とエネルギー準位のボーズ＝アインシュタイン分布グラフは、実測のジープのグラフと密接に一致する。
最も頻出する語「And」は133回出現し、最低エネルギー準位 E₀ = 0 に割り当てられる。一方、まれな語である「Able」（1回出現）は高いエネルギー準位に割り当てられる。
エネルギー準位 E₅₄ = 54 は語「Thought」に対応し、10回出現しており、全エネルギーに540単位を寄与する。
モデルは、言語的構造が重ね合わせやもつれといった量子的特徴を示すことを示しており、マージナル法則を破る非古典的相関が深く関与している。
「コグニトン」——思考の量子単位——の概念が、フォトンに類似した基本的単位として出現し、異なるエネルギー状態が異なる言語的概念に対応する。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。