QUICK REVIEW

[논문 리뷰] An introduction to domain adaptation and transfer learning

Wouter M. Kouw, Loog, Marco|arXiv (Cornell University)|2018. 12. 31.

Domain Adaptation and Few-Shot Learning참고 문헌 213인용 수 244

한 줄 요약

이 기술 보고서는 소스 도메인에서 훈련된 분류기가 언제 어떻게 타깃 도메인으로 일반화될 수 있는지에 대해 조사하며, 간단한 데이터 시프트와 적응 전략을 상세히 다룬다.

ABSTRACT

In machine learning, if the training data is an unbiased sample of an underlying distribution, then the learned classification function will make accurate predictions for new samples. However, if the training data is not an unbiased sample, then there will be differences between how the training data is distributed and how the test data is distributed. Standard classifiers cannot cope with changes in data distributions between training and test phases, and will not perform well. Domain adaptation and transfer learning are sub-fields within machine learning that are concerned with accounting for these types of changes. Here, we present an introduction to these fields, guided by the question: when and how can a classifier generalize from a source to a target domain? We will start with a brief introduction into risk minimization, and how transfer learning and domain adaptation expand upon this framework. Following that, we discuss three special cases of data set shift, namely prior, covariate and concept shift. For more complex domain shifts, there are a wide variety of approaches. These are categorized into: importance-weighting, subspace mapping, domain-invariant spaces, feature augmentation, minimax estimators and robust algorithms. A number of points will arise, which we will discuss in the last section. We conclude with the remark that many open questions will have to be addressed before transfer learners and domain-adaptive classifiers become practical.

연구 동기 및 목표

도메인 적응과 전이 학습의 일반적 문제와 왜 표준 분류기가 분포 변화하에서 실패하는지 설명한다.
도메인을 정의하고 교차 도메인 일반화 위험 한계를 형식화한다.
간단한 데이터 시프트(사전/ prior, 공변량 covariate, 개념 concept)를 특징화하고 대응하는 적응 전략을 논의한다.
더 복잡한 도메인 시프트에 대한 접근 방법의 분류 체계(예: 중요도 가중치, 부분공간 매핑, 도메인 불변 공간, 특징 증강, 미니맥스 추정기, 강건한 알고리즘)를 제시한다.
이전 학습자와 도메인 적응 분류기를 실현 가능하게 만드는 데 있어 남아 있는 미해결 질문과 실무상의 도전을 강조한다.

제안 방법

위험 최소화 및 실험적 위험 프레임워크에 토대를 둔다.
HΔH-발산(HΔH-divergence)과 결합 오차 e*_{S,T} 같은 척도를 사용하여 교차 도메인 설정에 대한 일반화 한계를 도입한다.
사전 시프트를 포함한 이동에서도 중요도 가중치와 결합 분포의 비율 조정(R_T(h) 및 관련 형태)을 사용하여 타깃 위험 추정치를 도출한다.
데이터 시프트를 사전( prior ), 공변량( covariate ), 개념( concept ) 시프트로 분류하고 대응하는 적응 기법을 제시한다.
더 복잡한 도메인 시프트에 걸친 적응 방법의 구조화된 개요를 제공한다(예: 도메인 불변 공간, 특징 증강, 미니맥스 추정기, 강건한 방법들).
실무에서 전이 학습을 적용하는 데 따른 한계와 남아 있는 미해결 질문을 논의한다.

실험 결과

연구 질문

RQ1어떤 조건에서 소스 도메인에서 학습된 분류기가 타깃 도메인으로 일반화될 수 있는가?
RQ2다양한 유형의 데이터 시프트(사전, 공변량, 개념)가 교차 도메인 일반화 위험에 어떤 영향을 미치는가?
RQ3도메인 시프트를 다루기 위한 주요 적응 전략은 무엇이며, 위험 최소화 원칙과 어떻게 관련되는가?
RQ4도메인 적응에서 소스에서 학습된 성능과 타깃 도메인 성능을 연결하는 이론적 한계는 무엇인가?
RQ5전이 학습자를 실효성 있게 만들기 위한 핵심 미해결 질문과 실무상의 장애물은 무엇인가?

주요 결과

소스와 타깃 도메인 간의 관계가 명시될 때 교차 도메인 일반화 한계는 확립될 수 있으며, 예를 들어 HΔH-발산과 이상적인 결합 오차 e*_{S,T}를 통해 가능하다.
소스에서 학습된 분류기의 타깃 오차와 최적 타깃 분류기의 타깃 오차의 차이는 결합 오차의 합, 도메인 발산, 그리고 복잡도 항의 합으로 한정된다.
사전 시프트, 공변량 시프트, 개념 시프트 각각은 레이블이 달린 타깃 데이터가 없는 경우에 타깃 위험을 추정하기 위한 고유한 재가중 또는 상쇄 기술을 허용한다(적용 가능할 때).
공변량 시프트와 사전 시프트는 타깃 확률을 반영하도록 샘플링 가중치나 결합 분포 비를 조정함으로써 다룰 수 있다.
개념 시프트는 레이블이 달린 타깃 데이터가 없으면 여전히 도전적이며, 타깃 관찰에 의존하는 조건부 분포의 추정이 필요하다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.