QUICK REVIEW

[論文レビュー] PF-Net: Point Fractal Network for 3D Point Cloud Completion

Zitian Huang, Yikuan Yu|arXiv (Cornell University)|Mar 1, 2020

3D Shape Modeling and Analysis参考文献 33被引用数 38

ひとこと要約

PF-Netは入力の部分的な点群を保持し、マルチスケールの fractal様デコーダと対立的損失を用いて欠落領域を階層的に予測し、高忠実度の3D点群完成を実現する。

ABSTRACT

In this paper, we propose a Point Fractal Network (PF-Net), a novel learning-based approach for precise and high-fidelity point cloud completion. Unlike existing point cloud completion networks, which generate the overall shape of the point cloud from the incomplete point cloud and always change existing points and encounter noise and geometrical loss, PF-Net preserves the spatial arrangements of the incomplete point cloud and can figure out the detailed geometrical structure of the missing region(s) in the prediction. To succeed at this task, PF-Net estimates the missing point cloud hierarchically by utilizing a feature-points-based multi-scale generating network. Further, we add up multi-stage completion loss and adversarial loss to generate more realistic missing region(s). The adversarial loss can better tackle multiple modes in the prediction. Our experiments demonstrate the effectiveness of our method for several challenging point cloud completion tasks.

研究の動機と目的

Incompleteな3D点群を既存の点を変更せず robustly 修復する動機づけ。
空間配置を保持しつつ欠落ジオメトリを推定する階層的・マルチスケールの生成器を開発する。
novel multi-resolution encoder を用いて partial inputs から rich なマルチスケール特徴を抽出する。
欠落領域を Point Pyramid Decoder で生成し、 genus-wise の歪みを低減して詳細を保持する。
現実味を高め、多様な出力モードを扱うために multi-stage completion loss と adversarial loss を組み込む。

提案手法

Partial Point Cloud からのマルチスケール特徴を抽出するための Combined Multi-Layer Perception（CMLP）を備えた Multi-Resolution Encoder（MRE）を導入する。
エンコーダ用に Iterative Farthest Point Sampling（IFPS）を用いて複数の解像度で特徴点を取得する。
feature points に guided された three-scale の欠落領域点を出力する primary, secondary, そして detailed point layers を持つ階層的な Point Pyramid Decoder（PPD）を設計する。
欠落領域の予測を欠落領域の ground-truth subsamples と比較する複数解像度の completion loss を採用する。
Discriminator が PF-Net により現実味のある欠落領域点群を生成するよう導く adversarial loss で訓練する。
completion loss と adversarial loss を結合した joint objective で、幾何学的忠実度と現実性のバランスを取る。

実験結果

リサーチクエスチョン

RQ1部分的な点群を修復する際、既存の構造を保持しつつ欠落領域のみを予測できるか。
RQ2マルチレゾリューションの特徴点主導型エンコーダ-デコーダが局所・グローバルな幾何を完全に活用して、予測領域のディテールを向上させるか。
RQ3階層的で fractal-like なデコーダが genus-wise の歪みを低減し、欠落領域のディテール保持を改善するか。
RQ4 adversarial training がリアリズムを向上させ、点群完成における多モード予測の問題を減らすか。

主な発見

カテゴリ	LGAN-AE	PCN	3D-Capsule	PF-Net(vanilla)	PF-Net
Airplane	0.856 / 0.722	0.800 / 0.800	0.826 / 0.881	0.284 / 0.231	0.263 / 0.238
Bag	3.102 / 2.994	2.954 / 3.063	3.228 / 2.722	0.927 / 0.934	0.926 / 0.772
Cap	3.530 / 2.823	3.466 / 2.674	3.439 / 2.844	1.308 / 1.027	1.226 / 1.169
Car	2.232 / 1.687	2.324 / 1.738	2.503 / 1.913	0.616 / 0.431	0.599 / 0.424
Chair	1.541 / 1.473	1.592 / 1.538	1.678 / 1.563	0.472 / 0.420	0.487 / 0.427
Guitar	0.394 / 0.354	0.367 / 0.406	0.298 / 0.461	0.097 / 0.094	0.108 / 0.091
Lamp	3.181 / 1.918	2.757 / 2.003	3.271 / 1.912	1.041 / 0.616	1.037 / 0.640
Laptop	1.206 / 1.030	1.191 / 1.155	1.276 / 1.254	0.309 / 0.244	0.301 / 0.245
Motorbike	1.828 / 1.455	1.699 / 1.459	1.591 / 1.664	0.524 / 0.414	0.522 / 0.389
Mug	2.732 / 2.946	2.893 / 2.821	3.086 / 2.961	0.793 / 0.776	0.745 / 0.739
Pistol	1.113 / 0.967	0.968 / 0.958	1.089 / 1.086	0.270 / 0.237	0.252 / 0.244
Skateboard	0.887 / 1.020	0.816 / 1.206	0.897 / 1.262	0.289 / 0.288	0.225 / 0.172
Table	1.694 / 1.601	1.604 / 1.790	1.870 / 1.749	0.505 / 0.417	0.525 / 0.404
Mean	1.869 / 1.615	1.802 / 1.662	1.927 / 1.713	0.572 / 0.471	0.555 / 0.458
Category	LGAN-AE	PCN	3D-Capsule	PF-Net(vanilla)	PF-Net
Airplane	3.357 / 1.130	5.060 / 1.243	2.676 / 1.401	1.197 / 1.006	1.091 / 1.070
Bag	5.707 / 5.303	3.251 / 4.314	5.228 / 4.202	3.946 / 4.054	3.929 / 3.768
Cap	8.968 / 4.608	7.015 / 4.240	11.04 / 4.739	5.519 / 4.470	5.290 / 4.800
Car	4.531 / 2.518	2.741 / 2.123	5.944 / 3.508	2.537 / 1.848	2.489 / 1.839
Chair	7.359 / 2.339	3.952 / 2.301	3.049 / 2.207	1.998 / 1.828	2.074 / 1.824
Guitar	0.838 / 0.536	1.419 / 0.689	0.625 / 0.662	0.435 / 0.435	0.456 / 0.429
Lamp	8.464 / 3.627	11.61 / 7.139	9.912 / 5.847	5.252 / 3.059	5.122 / 3.460
Laptop	7.649 / 1.413	3.070 / 1.422	2.129 / 1.733	1.291 / 1.013	1.247 / 0.997
Motorbike	4.914 / 2.036	4.962 / 1.922	8.617 / 2.708	2.229 / 1.876	2.206 / 1.775
Mug	6.139 / 4.735	3.590 / 3.591	5.155 / 5.168	3.228 / 3.332	3.138 / 3.238
Pistol	3.944 / 1.424	4.484 / 1.414	5.980 / 1.782	1.267 / 1.012	1.122 / 1.055
Skateboard	5.613 / 1.683	3.025 / 1.740	11.49 / 2.044	1.198 / 1.257	1.136 / 1.337
Table	2.658 / 2.484	2.503 / 2.452	3.929 / 3.098	2.184 / 1.928	2.235 / 1.934
Mean	5.395 / 2.603	4.360 / 2.661	5.829 / 3.008	2.483 / 2.086	2.426 / 2.117

PF-Net は全体的な完成品質と欠落領域の品質の両方で、ほとんどのカテゴリにおいてベースライン手法を上回る。
Discriminator の組み込みにより、多くのカテゴリで予測品質が向上する。
CMLP と MR-CMLP は特徴抽出性能を高め、PF-Net は PPD デコーダを通じてディテール保持をさらに改善。
モデルは欠落入力の異なる程度（25%、50%、75%）に対して堅牢であり、複数の欠損部にも対応可能。
定量的結果は PF-Net および PF-Net (vanilla) が多くのカテゴリで Pred→GT および GT→Pred エラーを低くし、13カテゴリの平均でも優位を示す。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。