Skip to main content
QUICK REVIEW

[Paper Review] Salient Objects in Clutter: Bringing Salient Object Detection to the Foreground

Deng-Ping Fan, Ming‐Ming Cheng|arXiv (Cornell University)|Mar 16, 2018
Visual Attention and Saliency Detection39 references43 citations
TL;DR

The paper introduces the SOC dataset (Salient Objects in Clutter) to address data bias in salient object detection, provides a comprehensive benchmark of CNN-based SOD models on this dataset, and analyzes performance across multiple real-world attributes.

ABSTRACT

We provide a comprehensive evaluation of salient object detection (SOD) models. Our analysis identifies a serious design bias of existing SOD datasets which assumes that each image contains at least one clearly outstanding salient object in low clutter. The design bias has led to a saturated high performance for state-of-the-art SOD models when evaluated on existing datasets. The models, however, still perform far from being satisfactory when applied to real-world daily scenes. Based on our analyses, we first identify 7 crucial aspects that a comprehensive and balanced dataset should fulfill. Then, we propose a new high quality dataset and update the previous saliency benchmark. Specifically, our SOC (Salient Objects in Clutter) dataset, includes images with salient and non-salient objects from daily object categories. Beyond object category annotations, each salient image is accompanied by attributes that reflect common challenges in real-world scenes. Finally, we report attribute-based performance assessment on our dataset.

Motivation & Objective

  • Identify biases in existing SOD datasets that overestimate performance in idealized, low-clutter scenes.
  • Create a realistic, large-scale SOD dataset (SOC) including salient and non-salient images with instance-level annotations and attributes.
  • Benchmark major CNN-based SOD models on SOC to reveal generalization gaps and guide future research.
  • Provide attribute-based performance analysis to understand model strengths/weaknesses under real-world challenges.

Proposed method

  • Define seven criteria for a realistic and balanced SOD dataset.
  • Assemble SOC with 6,000 images (3,000 salient, 3,000 non-salient) across 80+ categories, including non-salient images and instance-level annotations.
  • Annotate salient objects with high-quality pixel-level masks and provide per-image attributes (e.g., motion blur, occlusion, clutter).
  • Evaluate representative single-task and multi-task CNN-based SOD models on SOC using pixel-wise accuracy, region similarity (F-measure), and structure similarity (S-measure).
  • Conduct attribute-based performance evaluation to analyze model performance under specific scene challenges.
  • Release the dataset and benchmarking tools publicly.

Experimental results

Research questions

  • RQ1How do current SOD models perform on realistic scenes with clutter and non-salient images, compared to existing benchmarks?
  • RQ2What dataset design biases exist in prior SOD datasets, and how does SOC address them?
  • RQ3How do salient-object attributes (e.g., motion blur, occlusion, clutter) affect model performance across different architectures?
  • RQ4Can attribute-based benchmarking reveal weaknesses of state-of-the-art SOD models and guide future research directions?

Key findings

  • SOC is the largest instance-level SOD dataset at the time of publication, with 6,000 images (3,000 salient, 3,000 non-salient) across 80+ categories.
  • SOC includes high-quality instance-level saliency masks and object attributes reflecting real-world challenges, enabling richer analysis than prior datasets.
  • Benchmark results show that top-performing models on existing datasets do not achieve satisfactory performance on SOC, highlighting a realism gap.
  • Attribute-based evaluation demonstrates how performance degrades for challenges like large objects, clutter, occlusion, and other specified attributes, guiding future model improvements.
  • Multi-task and weakly supervised models show promise but still lag behind fully supervised single-task models on SOC, suggesting directions for robust, real-world SOD.

Better researchstarts right now

From paper design to paper writing, dramatically reduce your research time.

No credit card · Free plan available

This review was created by AI and reviewed by human editors.