Skip to main content
QUICK REVIEW

[논문 리뷰] Semi-Supervised Semantic Segmentation in Earth Observation: The MiniFrance Suite, Dataset Analysis and Multi-task Network Study

Javiera Castillo-Navarro, Bertrand Le Saux|arXiv (Cornell University)|2020. 10. 15.
Remote-Sensing Image Classification참고 문헌 61인용 수 71
한 줄 요약

이 논문은 Earth Observation에서 반지도 학습 시맨틱 세그멘테이션을 위한 대규모 데이터셋인 MiniFrance를 도입하고, 데이터셋 대표성 및 외관을 분석하며, baselines로 다중 태스크 반지도 학습 네트워크를 연구한다.

ABSTRACT

The development of semi-supervised learning techniques is essential to enhance the generalization capacities of machine learning algorithms. Indeed, raw image data are abundant while labels are scarce, therefore it is crucial to leverage unlabeled inputs to build better models. The availability of large databases have been key for the development of learning algorithms with high level performance. Despite the major role of machine learning in Earth Observation to derive products such as land cover maps, datasets in the field are still limited, either because of modest surface coverage, lack of variety of scenes or restricted classes to identify. We introduce a novel large-scale dataset for semi-supervised semantic segmentation in Earth Observation, the MiniFrance suite. MiniFrance has several unprecedented properties: it is large-scale, containing over 2000 very high resolution aerial images, accounting for more than 200 billions samples (pixels); it is varied, covering 16 conurbations in France, with various climates, different landscapes, and urban as well as countryside scenes; and it is challenging, considering land use classes with high-level semantics. Nevertheless, the most distinctive quality of MiniFrance is being the only dataset in the field especially designed for semi-supervised learning: it contains labeled and unlabeled images in its training partition, which reproduces a life-like scenario. Along with this dataset, we present tools for data representativeness analysis in terms of appearance similarity and a thorough study of MiniFrance data, demonstrating that it is suitable for learning and generalizes well in a semi-supervised setting. Finally, we present semi-supervised deep architectures based on multi-task learning and the first experiments on MiniFrance.

연구 동기 및 목표

  • Introduce MiniFrance, a large-scale dataset for semi-supervised semantic segmentation in Earth Observation (EO).
  • Analyze data representativeness and appearance similarity to define suitable training/test splits.
  • Propose semi-supervised multi-task neural architectures and losses as baselines for EO segmentation.
  • Evaluate how unlabeled data improves generalization and provide public baselines for future work.

제안 방법

  • Describe MiniFrance properties: large-scale, 2,121 aerial images, 50 cm/px, 16 conurbations, 53,000 km2, 14 land-use classes.
  • Develop data representativeness tools to assess appearance similarity and domain coverage across cities (CNN feature extraction, t-SNE, one-class SVM, IoU/IoT metrics).
  • Propose semi-supervised network architectures that jointly optimize supervised segmentation and unsupervised tasks (multi-task learning) with shared parameters.
  • Formulate semi-supervised loss as L(x) = Ls(phi_s(x), y) + lambda Lu(phi_u(x), x).
  • Introduce BerundaNet (early/late) and W-Net architectures as multi-task baselines for semi-supervised learning in EO.

실험 결과

연구 질문

  • RQ1How representative are the labeled/unlabeled splits across cities for learning robust EO segmentation models?
  • RQ2Can unlabeled data improve segmentation generalization in EO when combined with labeled data in a multi-task framework?
  • RQ3What are effective network architectures and loss formulations for semi-supervised semantic segmentation in aerial imagery?
  • RQ4How do appearance and class distribution across locations affect learning and generalization in MiniFrance?

주요 결과

  • MiniFrance is a large-scale, diverse EO dataset with 2,121 RGB tiles (10,000 x 10,000 px) at 50 cm/px and 53,000 km2, covering 16 conurbations with 14 land-use classes.
  • The dataset includes labeled and unlabeled training partitions to mimic realistic semi-supervised learning scenarios and eight-city test split.
  • Appearance and class representativeness analyses show that training cities cover test appearances to a meaningful extent, though no single city mirrors another exactly.
  • Two analysis tools (IoU and IoT) quantify appearance similarity and coverage between cities, guiding partition design for semi-supervised learning.
  • TinyMiniFrance is introduced as a computationally affordable subset (3,500 images) preserving class proportions for rapid prototyping.
  • Proposed BerundaNet and W-Net multi-task architectures with unsupervised losses can serve as baselines for semi-supervised EO segmentation on MiniFrance.
  • The study provides baselines and methodological tools to promote future semi-supervised work on Earth observation data.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.