QUICK REVIEW

[논문 리뷰] Semi-supervised Federated Learning for Activity Recognition

Yuchen Zhao, Hanyang Liu|arXiv (Cornell University)|2020. 11. 02.

Privacy-Preserving Technologies in Data참고 문헌 47인용 수 35

한 줄 요약

이 논문은 IoT 엣지 디바이스에서 라벨이 없는 데이터를 활용해 표현을 학습하는 로컬 오토인코더와 라벨링된 표현으로 학습된 클라우드 기반 감독 분류기를 사용하는 반-감독 연합학습(Federated Learning) 프레임워크를 제안한다.

ABSTRACT

Training deep learning models on in-home IoT sensory data is commonly used to recognise human activities. Recently, federated learning systems that use edge devices as clients to support local human activity recognition have emerged as a new paradigm to combine local (individual-level) and global (group-level) models. This approach provides better scalability and generalisability and also offers better privacy compared with the traditional centralised analysis and learning models. The assumption behind federated learning, however, relies on supervised learning on clients. This requires a large volume of labelled data, which is difficult to collect in uncontrolled IoT environments such as remote in-home monitoring. In this paper, we propose an activity recognition system that uses semi-supervised federated learning, wherein clients conduct unsupervised learning on autoencoders with unlabelled local data to learn general representations, and a cloud server conducts supervised learning on an activity classifier with labelled data. Our experimental results show that using a long short-term memory autoencoder and a Softmax classifier, the accuracy of our proposed system is higher than that of both centralised systems and semi-supervised federated learning using data augmentation. The accuracy is also comparable to that of supervised federated learning systems. Meanwhile, we demonstrate that our system can reduce the number of needed labels and the size of local models, and has faster local activity recognition speed than supervised federated learning does.

연구 동기 및 목표

개인정보 보호 학습을 통해 엣지 IoT에서 HAR을 추진한다.
연합 학습 설정에서 엣지 디바이스의 라벨링 데이터 부족 문제를 다룬다.
라벨이 없는 데이터에서 표현을 학습하기 위해 오토인코더를 활용하는 반-감독 FL 파이프라인을 개발한다.
서버의 표현 크기와 라벨링 데이터가 성능에 미치는 영향을 평가한다.
Raspberry Pi와 같은 저가형 엣지 하드웨어에서 실현 가능성과 효율성을 시연한다.

제안 방법

클라이언트는 라벨이 없는 시계열 데이터에서 지역적으로 오토인코더를 학습시켜 표현을 얻는다.
서버는 FedAvg로 지역 오토인코더를 집계하여 글로벌 오토인코더를 구성한다.
서버는 글로벌 오토인코더의 인코더를 사용해 라벨링된 데이터를 표현으로 변환하여 분류기의 감독 학습에 활용한다.
세 가지 오토인코더 스킴을 평가한다: 단순 오토인코더 (FC), 컨볼루션 오토인코더 (CNN), LSTM-오토인코더 (LSTM-FC).
기준선과 비교: 중앙집중식 감독 학습 (CS) 및 데이터 증강 기반 반-감독 FL (DA).
성능에 미치는 라벨 비율과 표현 압축 비율의 영향을 평가한다.
Raspberry Pi 4 Model B에서 시스템의 로컬 추론을 배포하고 테스트한다.

실험 결과

연구 질문

RQ1Q1: 오토인코더를 사용한 반-감독 FL이 중앙집중식 감독 학습과 비교하여 어떤가요?
RQ2Q2: 오토인코더를 활용한 반-감독 FL이 데이터 증강을 활용한 반-감독 FL과 비교하여 어떤가요?
RQ3Q3: 오토인코더를 이용한 반-감독 FL이 감독형 FL과 비교하여 어떤가요?
RQ4Q4: 서버의 라벨 크기와 표현 크기가 HAR 성능에 어떻게 영향을 미치나요?
RQ5Q5: 저가형 엣지 디바이스에서 반-감독 FL의 효율성은 어느 정도인가요?

주요 결과

LSTM 오토인코더와 Softmax 분류기로 구성된 반-감독 FL은 중앙 집중식 감독 시스템보다 더 높은 정확도를 달성한다.
제안된 시스템은 데이터 증강 기반 반-감독 FL (DA)보다 성능이 우수하다.
이 방법은 더 적은 라벨과 더 작은 로컬 모델을 사용하면서 감독 FL과 비교 가능한 정확도를 달성한다.
오토인코더를 사용하면 로컬 모델 크기가 감소하고 서버로의 업로드 트래픽을 줄일 수 있다.
Raspberry Pi 4 Model B에서의 추론은 감독 FL과 비교하여 실시간 처리 시간이 허용 가능한 수준으로 가능하다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.