QUICK REVIEW

[论文解读] Efficient feature learning and multi-size image steganalysis based on CNN

Ru Zhang, Feng Zhu|arXiv (Cornell University)|Jul 30, 2018

Advanced Steganography and Watermarking Techniques参考文献 34被引用 29

一句话总结

本文提出Zhu-Net，一种基于CNN的隐写分析框架，通过优化特征学习并支持多尺寸图像检测，提升了性能。通过使用更小的3×3卷积核、可分离卷积以提高信噪比，并采用空间金字塔池化（SPP）实现任意图像尺寸的兼容性，该模型在多个数据集上对WOW和S-UNIWARD隐写术实现了最先进（SOTA）的检测准确率。

ABSTRACT

For steganalysis, many studies showed that convolutional neural network has better performances than the two-part structure of traditional machine learning methods. However, there are still two problems to be resolved: cutting down signal to noise ratio of the steganalysis feature map and steganalyzing images of arbitrary size. Some algorithms required fixed size images as the input and had low accuracy due to the underutilization of the noise residuals obtained by various types of filters. In this paper, we focus on designing an improved network structure based on CNN to resolve the above problems. First, we use 3x3 kernels instead of the traditional 5x5 kernels and optimize convolution kernels in the preprocessing layer. The smaller convolution kernels are used to reduce the number of parameters and model the features in a small local region. Next, we use separable convolutions to utilize channel correlation of the residuals, compress the image content and increase the signal-to-noise ratio (between the stego signal and the image signal). Then, we use spatial pyramid pooling (SPP) to aggregate the local features, enhance the representation ability of features, and steganalyze arbitrary size image. Finally, data augmentation is adopted to further improve network performance. The experimental results show that the proposed CNN structure is significantly better than other four methods such as SRM, Ye-Net, Xu-Net, and Yedroudj-Net, when it is used to detect two spatial algorithms such as WOW and S-UNIWARAD with a wide variety of datasets and payloads.

研究动机与目标

通过优化卷积核设计，解决隐写分析特征图中信噪比低的问题。
通过利用通道与空间相关性，借助可分离卷积提升特征表示能力。
通过空间金字塔池化（SPP）实现对任意尺寸图像的精确隐写分析。
通过大规模数据增强提升模型泛化能力并减少过拟合。
在空域隐写术上，实现优于现有基于CNN的隐写分析器的检测性能。

提出的方法

在预处理层中，将传统的5×5卷积核替换为更小的3×3卷积核，以减少参数量并提升局部特征建模能力。
在首个卷积层中初始化30个SRM滤波器，并在训练过程中进行微调，以增强残差特征提取能力。
引入两个可分离卷积模块，用于建模残差中的空间与通道相关性，从而提高信噪比。
用空间金字塔池化（SPP）模块替代全局平均池化，以提取多层级特征并支持任意输入图像尺寸。
通过保持标签不变的翻转与旋转进行数据增强，以提升训练数据多样性并减少过拟合。
在逐步扩大的数据集（BOSS、BOSS+BOWS2、BOSS+BOWS2+DA）上训练网络，以评估可扩展性与性能增益。

实验结果

研究问题

RQ1更小的卷积核与优化的初始化是否能提升隐写分析中特征学习效率与检测准确率？
RQ2可分离卷积在建模隐写分析特征中残差相关性方面，能在多大程度上提升信噪比？
RQ3空间金字塔池化是否能有效实现对任意尺寸图像的隐写分析，且不损失准确率？
RQ4数据增强在基于CNN的隐写分析中，对模型泛化能力与过拟合的影响如何？
RQ5所提出的网络架构是否在多种隐写算法与载荷下，全面超越现有基于CNN的隐写分析器？

主要发现

在经过增强的BOSS+BOWS2+DA数据集上训练后，Zhu-Net在0.2 bpp载荷下对WOW的检测错误率降低了10.2%，对S-UNIWARD降低了11.4%，优于Ye-Net与Yedroudj-Net。
仅使用BOSS训练集时，Zhu-Net对WOW的错误率较基线模型降低5.5%，对S-UNIWARD降低3.6%。
可分离卷积的使用显著提升了信噪比，有效捕捉了残差中的空间与通道相关性。
空间金字塔池化通过将可变尺寸的特征图映射为固定长度表示，实现了对任意尺寸图像的精确隐写分析。
通过翻转与旋转进行的数据增强有效减少了过拟合并提升了泛化能力，所有测试的隐写算法均表现出性能增益。
Zhu-Net实现了最先进（SOTA）的检测准确率，在多个数据集与载荷下甚至超越了高级人工设计特征集（如SRM）的表现。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。