QUICK REVIEW

[논문 리뷰] A Contrastive Cross-Channel Data Augmentation Framework for Aspect-based Sentiment Analysis

Bing Wang, Liang Ding|arXiv (Cornell University)|2022. 04. 16.

Sentiment Analysis and Opinion Mining인용 수 28

한 줄 요약

C3da는 교차 채널 데이터 증강과 대조 학습 프레임워크를 사용하여 T5 기반 생성기로 도메인 내 다면적 ABSA 샘플을 생성하고, 베이스라인 대비 정확도와 Macro-F1을 향상시킵니다.

ABSTRACT

Aspect-based sentiment analysis (ABSA) is a fine-grained sentiment analysis task, which focuses on detecting the sentiment polarity towards the aspect in a sentence. However, it is always sensitive to the multi-aspect challenge, where features of multiple aspects in a sentence will affect each other. To mitigate this issue, we design a novel training framework, called Contrastive Cross-Channel Data Augmentation (C3 DA), which leverages an in-domain generator to construct more multi-aspect samples and then boosts the robustness of ABSA models via contrastive learning on these generated data. In practice, given a generative pretrained language model and some limited ABSA labeled data, we first employ some parameter-efficient approaches to perform the in-domain fine-tuning. Then, the obtained in-domain generator is used to generate the synthetic sentences from two channels, i.e., Aspect Augmentation Channel and Polarity Augmentation Channel, which generate the sentence condition on a given aspect and polarity respectively. Specifically, our C3 DA performs the sentence generation in a cross-channel manner to obtain more sentences, and proposes an Entropy-Minimization Filter to filter low-quality generated samples. Extensive experiments show that our C3 DA can outperform those baselines without any augmentations by about 1% on accuracy and Macro- F1. Code and data are released in https://github.com/wangbing1416/C3DA.

연구 동기 및 목표

ABSA의 다면적 도전을 데이터 증강을 통해 도메인 내 다면적 샘플을 생성하는 방향으로 해결하는 것.
도메인 내 문장 생성을 위한 생성기를 적은 매개변수로 미세 조정하는 방법을 활용하는 것.
교차 채널 생성과 엔트로피 기반 필터링을 통합하여 고품질의 증가 데이터를 생성하는 것.
문장 내 서로 다른 면의 임베딩을 보다 잘 구분하도록 대조 학습 목표를 적용하는 것.
다중 백본에 걸친 표준 ABSA 데이터셋에서 강건성 개선을 입증하는 것

제안 방법

증강 생성기로서 매개변수 효율적 튜닝(prompt/prefix/LoRA)를 가진 사전 학습된 T5 인코더-디코더를 사용한다.
생성기가 (i) 주어진 문장과 면, (ii) 주어진 문장과 극성에 조건화된 문장을 생성하도록 학습한다.
생성된 출력을 대안 채널에 다시 입력하여 추가적으로 다양한 샘플을 만드는 교차 채널 생성을 수행한다.
Prediction entropy를 기반으로 하는 Entropy-Minimization Filter(EMF)로 저품질 합성 문장을 필터링한다.
원본 데이터와 증가된 데이터에 대한 감독적 교차 엔트로피와 문장 내 서로 다른 면의 임베딩을 구분하도록 하는 대조 삼중 항을 결합한 목표로 학습한다.
주요 식은 다음과 같다: L_SCT = (1/N) sum CE(h_i W_s + b_s, p_i) + α CE(h_i^p W_s + b_s, p_i); R_CT = (1/N) sum max{ d(h_i, h_i^p) − d(h_i, h_i^n) + ξ, 0 }; L = L_SCT + β R_CT.]
research_questions':['데이터 부족 상황에서 도메인 내 교차 채널 데이터 증강이 ABSA 성능을 향상시킬 수 있는가?','면-조건부 및 극성-조건부인 교차 채널 생성이 다면적 표현 학습에 어떤 영향을 미치는가?','엔트로피 기반 샘플 필터를 도입하면 증가 데이터의 품질과 활용도가 향상되는가?','대조 학습 목표가 면-분리 임베딩에 미치는 영향은 무엇인가?','C3da의 이득은 ABSA 데이터셋과 백본 모델에 따라 어떻게 달라지는가?']
key_findings':['C3da는 Restaurant, Laptop, Twitter 데이터셋에서 BERT 기반 및 RoBERTa 기반 베이스라인을 지속적으로 개선합니다.','평균적으로 RoBERTa 기반 모델은 Augmentation를 포함한 베이스라인 대비 정확도 또는 Macro-F1에서 약 0.87%에서 2.22%의 절대 이득을 봅니다.','C3da는 Macro-F1 이득이 더 커 일반화 및 다면적 상호작용에 대한 강건성이 향상되었음을 시사합니다.','데이터 증강 + 대조 학습 및 EMF가 성능에 기여하며, AAC와 PAC 모두 유익하며 AAC가 약간 더 큰 영향을 미칩니다.','사례 연구에서 모호한 문장의 극성 분포 안정성이 C3da를 사용할 때 개선되었음을 보여줍니다.']
table_headers':['Model','Restaurant acc','Restaurant F1','Laptop acc','Laptop F1','Twitter acc','Twitter F1'],
table_rows":[[

실험 결과

주요 결과

모델	Restaurant acc	Restaurant F1	Laptop acc	Laptop F1	Twitter acc	Twitter F1
BERT-base	86.31	80.22	79.66	76.11	76.50	75.23
+ EDA	86.42	79.63	79.59	75.79	76.26	75.16
+ BT	86.47	80.29	79.67	76.35	76.63	75.47
+ CBERT	86.27	80.00	79.83	76.12	76.44	75.36
+ SCon	86.51	80.55	80.23	76.48	-	-
+ C3da (Ours)	86.93	81.23	80.61	77.11	77.55	76.53
RoBERTa-base	86.38	80.29	80.10	76.24	76.63	75.37
+ EDA	86.43	80.21	80.38	76.59	76.47	75.36
+ BT	86.50	80.59	80.22	76.73	76.59	75.47
+ CBERT	86.77	80.51	80.54	76.57	76.73	75.37
+ C3da	87.11	81.63	81.83	78.46	78.31	76.67

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.