Skip to main content
QUICK REVIEW

[论文解读] Transformer-based Planning for Symbolic Regression

Parshin Shojaee, Kazem Meidani|arXiv (Cornell University)|Mar 13, 2023
Evolutionary Algorithms and Applications被引用 16
一句话总结

TPSR 将蒙特卡洛树搜索(MCTS)与预训练的变换器符号回归模型相结合,以优化方程生成以提高拟合精度和复杂度之间的权衡,优于基线方法。

ABSTRACT

Symbolic regression (SR) is a challenging task in machine learning that involves finding a mathematical expression for a function based on its values. Recent advancements in SR have demonstrated the effectiveness of pre-trained transformer-based models in generating equations as sequences, leveraging large-scale pre-training on synthetic datasets and offering notable advantages in terms of inference time over classical Genetic Programming (GP) methods. However, these models primarily rely on supervised pre-training goals borrowed from text generation and overlook equation discovery objectives like accuracy and complexity. To address this, we propose TPSR, a Transformer-based Planning strategy for Symbolic Regression that incorporates Monte Carlo Tree Search into the transformer decoding process. Unlike conventional decoding strategies, TPSR enables the integration of non-differentiable feedback, such as fitting accuracy and complexity, as external sources of knowledge into the transformer-based equation generation process. Extensive experiments on various datasets show that our approach outperforms state-of-the-art methods, enhancing the model's fitting-complexity trade-off, extrapolation abilities, and robustness to noise.

研究动机与目标

  • 动机:将符号回归作为跨科学与工程的可解释控制方程的途径。
  • 通过引入不可微分的反馈(拟合精度与复杂度),解决符号回归中令牌级预训练损失的局限性。
  • 提出一种与模型无关的 TPSR 框架,通过将预训练的 SR Transformer 与 MCTS 规划结合,来优化方程发现。

提出的方法

  • 将方程表示为前缀表示法,以便与变换器解码对齐。
  • 使用预训练的 SR Transformer 主干来对数据进行编码并生成方程序列。
  • 在推理过程中应用蒙特卡洛树搜索(MCTS)前瞻规划,基于平衡拟合度与复杂性的自定义奖励来引导标记选择。
  • 结合前k个标记限制和基于束搜索的模拟来管理 MCTS 中的扩展与评估步骤。
  • 定义一个奖励:将归一化均方误差(NMSE)为拟合度,与复杂度的指数惩罚相结合:r = 1/(1+NMSE) + λ exp(-l/L)。
  • 引入缓存机制(前k缓存和序列缓存)以减少 MCTS 过程中的冗余计算。
Figure 1: Pareto plot comparing the rankings of all methods in terms of the $R^{2}$ performance and identified equation complexity for (a) SRBench Black-box datasets and (b) Feynman datasets . Our results with Transformer-based Planning (TPSR) applied on top of E2E transformer SR model improves its
Figure 1: Pareto plot comparing the rankings of all methods in terms of the $R^{2}$ performance and identified equation complexity for (a) SRBench Black-box datasets and (b) Feynman datasets . Our results with Transformer-based Planning (TPSR) applied on top of E2E transformer SR model improves its

实验结果

研究问题

  • RQ1在标准 SR 基准上,TPSR 是否优于束搜索和采样解码策略及其他基线?
  • RQ2TPSR 是否能在符号回归任务中提升外推能力和对噪声的鲁棒性?
  • RQ3TPSR 的缓存机制是否显著降低推理时间?
  • RQ4单独的 MCTS 组件对 TPSR 性能的影响是什么?

主要发现

  • 与基线相比,TPSR 在 SRBench 和领域内合成数据集上在拟合精度更高且方程复杂度相当或更低。
  • TPSR 在 Feynman 和 Black-box 数据集的拟合与复杂度权衡上达到第一帕累托前沿。
  • 缓存机制总体降低推理时间约28%,其中序列缓存带来最显著的提升。
  • 消融研究表明,增加 MCTS 的回合数、束宽和最大扩展通常会提升性能,其中回合数贡献最大。
  • 与端到端 transformer 基线相比,TPSR 在尺度变化下的外推能力和对噪声的鲁棒性更佳。
Figure 2: An overview of our proposed method with MCTS-guided decoding at inference compared to the concurrent works with beam search/sampling decoding strategy.
Figure 2: An overview of our proposed method with MCTS-guided decoding at inference compared to the concurrent works with beam search/sampling decoding strategy.

更好的研究,从现在开始

从论文设计到论文写作,大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成,并经人工编辑审核。