QUICK REVIEW

[论文解读] A New Paradigm for Minimax Search

Jonathan Schaeffer, Wim Pijls|arXiv (Cornell University)|Jan 1, 1994

Artificial Intelligence in Games参考文献 26被引用 23

一句话总结

本文提出 MTD(f)，一种新型的极小化极大搜索范式，通过利用迭代深化和内存增强技术，显著提升了搜索效率。通过重新定义 MT 过程的调用方式，MTD(f) 在传统 alpha-beta 和 SSS* 算法之上实现了更优性能，在国际跳棋、奥赛罗和国际象棋游戏中，叶节点扩展次数减少超过 20%，并在叶节点数、总节点数和执行时间方面优于最佳的 alpha-beta 实现方案。

ABSTRACT

textabstractThis paper introduces a new paradigm for minimax game-tree search algorithms. MT is a memory-enhanced version of Pearl's Test procedure. By changing the way MT is called, a number of best-first game-tree search algorithms can be simply and elegantly constructed (including SSS*). Most of the assessments of minimax search algorithms have been based on simulations. However, these simulations generally do not address two of the key ingredients of high performance game-playing programs: iterative deepening and memory usage. This paper presents experimental data from three game-playing programs (checkers, Othello and chess), covering the range from low to high branching factor. The improved move ordering due to iterative deepening and memory usage results in significantly different results from those portrayed in the literature. Whereas some simulations show alpha-beta expanding almost 100% more leaf nodes than other algorithms [Marsland, Reinefeld & Schaeffer, 1987], our results showed variations of less than 20%. One new instance of our framework MTD(f) out-performs our best alpha-beta searcher (aspiration NegaScout) on leaf nodes, total nodes and execution time. To our knowledge, these are the first reported results that compare both depth-first and best-first algorithms given the same amount of memory.

研究动机与目标

解决以往模拟研究中忽略迭代深化和内存使用等关键性能因素的问题，这些因素在游戏树搜索中至关重要。
构建一个统一框架，通过增强内存的 MT 过程，实现包括 SSS* 在内的最佳优先搜索算法的优雅构造。
实证评估迭代深化和内存使用对搜索效率的影响，突破以往仅依赖模拟结果的局限。
在相同内存约束下，对深度优先（alpha-beta）与最佳优先（SSS*）算法进行比较，这是此前未被报道的对比。
证明 MTD(f) 作为该框架的新实例，在实践中优于最佳的 alpha-beta 实现方案。

提出的方法

提出 MTD(f)，一种通过重新定义增强内存的 MT 过程调用方式而推导出的新极小化极大搜索算法。
采用迭代深化以改善走法排序，从而提升内存剪枝搜索空间的效果。
使用 Pearl 的测试过程（MT）的增强版本，以存储和重用游戏树值的边界信息。
通过改变 MT 的调用方式，构建如 SSS* 等最佳优先算法，实现统一框架。
采用深度优先搜索结构，并通过先前存储的边界信息指导选择性重新计算。
在三种游戏（国际跳棋、奥赛罗和国际象棋）中评估性能，涵盖从低到高的分支因子。

实验结果

研究问题

RQ1引入迭代深化和内存使用后，对极小化极大搜索算法的相对性能有何影响？
RQ2基于模拟的节点扩展比率结果在多大程度上能推广到真实的对弈程序中？
RQ3基于 MT 调用的统一框架能否生成高效的最佳优先搜索算法，包括 SSS*？
RQ4MTD(f) 是否在叶节点数、总节点数和执行时间方面优于最佳的 alpha-beta 实现？
RQ5当深度优先与最佳优先搜索受到相同内存约束时，其实际性能差异是什么？

主要发现

MTD(f) 在所有三项指标（叶节点数、总节点数和执行时间）上均优于最佳的 alpha-beta 搜索器（即引述式 NegaScout）。
在真实程序中，不同算法之间的叶节点数量差异小于 20%，这与以往模拟结果中报告的高达 100% 的差异形成矛盾。
通过迭代深化带来的走法排序改进，显著减少了在内存固定情况下的探索节点数。
本文首次在相同内存约束下对深度优先与最佳优先算法进行了实证比较。
结果表明，内存使用和迭代深化是模拟评估中常被忽视的关键因素。
SSS* 及其他最佳优先算法可通过该框架，借助不同的 MT 调用策略优雅地构建。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。