QUICK REVIEW

[論文レビュー] Detecting High Log-Densities -- an O(n^1/4) Approximation for Densest k-Subgraph

Aditya Bhaskara, Moses Charikar|arXiv (Cornell University)|Jan 17, 2010

Complexity and Algorithms in Graphs参考文献 10被引用数 115

ひとこと要約

本稿では、Densest k-Subgraph (DkS) 問題に対して、O(n^{1/4+ε})-近似アルゴリズムを提示する。時間計算量O(n^{O(log n)})でO(n^{1/4})近似が達成される。この手法は対数密度解析と定数サイズの木の数え上げを巧みに組み合わせ、密度の高い部分グラフを同定する。性能は、ランダムグラフにおけるプラントド密度部分グラフ問題の識別比と一致する。

ABSTRACT

In the Densest k-Subgraph problem, given a graph G and a parameter k, one needs to find a subgraph of G induced on k vertices that contains the largest number of edges. There is a significant gap between the best known upper and lower bounds for this problem. It is NP-hard, and does not have a PTAS unless NP has subexponential time algorithms. On the other hand, the current best known algorithm of Feige, Kortsarz and Peleg, gives an approximation ratio of n^(1/3-epsilon) for some specific epsilon > 0 (estimated at around 1/60). We present an algorithm that for every epsilon > 0 approximates the Densest k-Subgraph problem within a ratio of n^(1/4+epsilon) in time n^O(1/epsilon). In particular, our algorithm achieves an approximation ratio of O(n^1/4) in time n^O(log n). Our algorithm is inspired by studying an average-case version of the problem where the goal is to distinguish random graphs from graphs with planted dense subgraphs. The approximation ratio we achieve for the general case matches the distinguishing ratio we obtain for this planted problem. At a high level, our algorithms involve cleverly counting appropriately defined trees of constant size in G, and using these counts to identify the vertices of the dense subgraph. Our algorithm is based on the following principle. We say that a graph G(V,E) has log-density alpha if its average degree is Theta(|V|^alpha). The algorithmic core of our result is a family of algorithms that output k-subgraphs of nontrivial density whenever the log-density of the densest k-subgraph is larger than the log-density of the host graph.

研究の動機と目的

Densest k-Subgraph (DkS) 問題における、既知の上界と下界の間の長年のギャップを解消すること。
従来のO(n^{1/3−ε})の境界を超える、多項式時間近似アルゴリズムを設計すること。
平均ケースの変種、すなわちランダムグラフとプラントド密度部分グラフを有するグラフの識別を検討し、一般アルゴリズムにインサイトを提供すること。
プラントド部分グラフモデルにおける識別閾値と一致する近似比を達成すること、特に密度の高いインスタンスにおいて。
特定のパrameter領域において、スペクトル法と半定値計画法(SDP)の緩和を用いたアルゴリズム的改善の探求。

提案手法

グラフGの対数密度αを、平均次数がΘ(|V|^α)となるように定義し、これにより密度の高い部分グラフを特徴付ける。
密度の高いk-頂点部分グラフが、ホストグラフよりも高い対数密度を持つ場合に、非自明な密度を持つk-頂点部分グラフを同定するアルゴリズム族を設計する。
定数サイズの木の数え上げを用いて、局所的な密度を推定し、密度の高い部分グラフの頂点選択をガイドする。
k > √nの場合に、隣接行列の2番目の固有値を用いることで、Dense vs Random問題における識別性能を向上させる。
DkSのためのSDP緩和を定式化し、平均次数Dのランダムグラフにおいて、SDP値の上界がw.h.p. k(√D + k²D/n)であることを証明する。
木の数え上げとスペクトル的・SDP的手法を組み合わせ、異なるパrameter領域で改善された近似比を達成する。

実験結果

リサーチクエスチョン

RQ1Densest k-Subgraph 問題に対して、多項式時間でO(n^{1/4+ε})-近似アルゴリズムを設計することは可能か？
RQ2ランダムグラフ内にプラントド密度k-部分グラフが埋め込まれた場合、その検出に可能な最良の識別比は何か？そして、一般DkS設定でもこれを達成できるか？
RQ3k > √nの場合に、スペクトル法（固有値解析）とSDP緩和は、Dense vs Random問題における近似保証をどのように改善するか？
RQ4対数密度フレームワークを、ランダムまたは半ランダムインスタンスなどの制限付き設定に拡張し、より良い近似比を達成できるか？
RQ5部分指数時間でO(n^{1/4})未満の近似比を達成することは可能か？また、現在の技術的限界は何か？

主な発見

本稿では、任意のε > 0に対して、時間計算量n^{O(1/ε)}でO(n^{1/4+ε})-近似を達成する。
O(n^{O(log n)})時間で実行した場合、アルゴリズムはO(n^{1/4})の近似比を達成する。
プラントド密度部分グラフ問題の識別比が、一般アルゴリズムの近似比と一致しており、これがタイトである可能性を示唆する。
k > √nの場合、2番目の固有値法は対数密度解析よりも優れた識別閾値を提供し、プラントド部分グラフの密度が√D + kD/nを超えると検出可能となる。
平均次数Dのランダムグラフにおいて、DkSのSDP緩和の上界はk(√D + k²D/n)であり、k > √nの領域では対数密度の境界を上回る。
アルゴリズムの性能はタイトであり、現在のアルゴリズム的技術では、多項式時間でo(n^{1/4})の識別比を達成することは不可能である。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。