QUICK REVIEW

[논문 리뷰] Pareto Multi-Task Learning

Xi Lin, Hui‐Ling Zhen|arXiv (Cornell University)|2019. 12. 30.

Advanced Multi-Objective Optimization Algorithms인용 수 52

한 줄 요약

Pareto MTL은 서로 다른 트레이드오프 선호를 가진 다수의 하위 문제로 다중 작업 학습 문제를 분해하여 서로 다른 작업 트레이드오프를 대표하는 다양한 Pareto 최적 해 집합을 생성합니다.

ABSTRACT

Multi-task learning is a powerful method for solving multiple correlated tasks simultaneously. However, it is often impossible to find one single solution to optimize all the tasks, since different tasks might conflict with each other. Recently, a novel method is proposed to find one single Pareto optimal solution with good trade-off among different tasks by casting multi-task learning as multiobjective optimization. In this paper, we generalize this idea and propose a novel Pareto multi-task learning algorithm (Pareto MTL) to find a set of well-distributed Pareto solutions which can represent different trade-offs among different tasks. The proposed algorithm first formulates a multi-task learning problem as a multiobjective optimization problem, and then decomposes the multiobjective optimization problem into a set of constrained subproblems with different trade-off preferences. By solving these subproblems in parallel, Pareto MTL can find a set of well-representative Pareto optimal solutions with different trade-off among all tasks. Practitioners can easily select their preferred solution from these Pareto solutions, or use different trade-off solutions for different situations. Experimental results confirm that the proposed algorithm can generate well-representative solutions and outperform some state-of-the-art algorithms on many multi-task learning applications.

연구 동기 및 목표

작업 간 충돌로 인한 트레이드오프를 고려한 다중 작업 학습의 필요성을 동기 부여한다.
다양한 Pareto 집합을 생성하여 단일 최적 해의 한계를 해결한다.
다양한 트레이드오프에 걸쳐 잘 대표되는 Pareto 해를 얻기 위해 Pareto MTL를 제안한다.
다양한 다중 작업 학습 문제에서 적용 가능성과 성능 향상을 보여준다.

제안 방법

MTL을 다목적 최적화로 수립하고 일련의 unit 선호 벡터를 사용하여 K개의 하위 문제로 분해한다.
목표 공간에 부분 영역을 정의하여 각 하위 문제가 서로 다른 트레이드오프(Omega_k)를 목표로 하도록 한다.
확장 가능한 이중(dual) 형식으로 제약 다목적 하위 문제를 해결하기 위해 기울기 기반 방법을 적용한다.
전체 매개변수 공간이 아니라 Lagrange 승수 차원에서 축소된 이중 문제를 해결하여 확장 가능한 최적화 접근법을 도출한다.
다양한 Pareto 최적 해를 얻기 위해 모든 하위 문제를 병렬로 해결한다.
Pareto MTL은 동적 가중치를 가지는 적응적 선형 스칼라화에 해당함을 보인다.

실험 결과

연구 질문

RQ1MTL 문제를 하위 문제로 분해하여 작업 간 다양한 트레이드오프를 포괄할 수 있는가?
RQ2가중치를 전수 조사하지 않고도 서로 다른 선호를 나타내는 잘 분포된 Pareto 해 집합을 얻을 수 있는가?
RQ3대형 모델과 데이터 세트에 적용할 때 Pareto MTL의 확장성은 어느 정도인가?
RQ4해결책의 다양성과 성능 측면에서 Pareto MTL이 최첨단 MTL 방법들과 어떻게 비교되는가?

주요 결과

Pareto MTL은 서로 다른 작업 트레이드오프를 나타내는 잘 분포된 Pareto 최적 해 집합을 성공적으로 생성한다.
이 방법은 여러 응용 분야에서 일부 최첨단 MTL 알고리즘보다 우수한 성능을 보인다.
극단적 선호 하위 문제는 해당 작업에 대해 최상의 성능을 제공합니다.
MultiMNIST, MultiFashionMNIST, 및 자율주행 로컬라이제이션에 대한 실험은 작업 간 견고한 이득을 보여준다.
하위 문제는 병렬로 해결될 수 있어 심층 모델에 대한 확장 가능한 최적화를 가능하게 한다.
이 방법은 이중 제약 문제를 해결하여 결정된 가중치와 함께 적응적 선형 스칼라화로 볼 수 있다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.