QUICK REVIEW

[논문 리뷰] XceptionTime: A Novel Deep Architecture based on Depthwise Separable Convolutions for Hand Gesture Classification

Elahe Rahimian, Soheil Zabihi|arXiv (Cornell University)|2019. 11. 09.

Muscle activation and electromyography studies참고 문헌 23인용 수 38

한 줄 요약

희소 다채널 sEMG에서 제스처 분류를 위해 depthwise separable convolution, adaptive average pooling, mu-law normalization을 사용하는 심층 모델 XceptionTime을 제안하고 NinaPro DB1에서 93.91%를 달성했으며 파라미터를 감소시킵니다.

ABSTRACT

Capitalizing on the need for addressing the existing challenges associated with gesture recognition via sparse multichannel surface Electromyography (sEMG) signals, the paper proposes a novel deep learning model, referred to as the XceptionTime architecture. The proposed innovative XceptionTime is designed by integration of depthwise separable convolutions, adaptive average pooling, and a novel non-linear normalization technique. At the heart of the proposed architecture is several XceptionTime modules concatenated in series fashion designed to capture both temporal and spatial information-bearing contents of the sparse multichannel sEMG signals without the need for data augmentation and/or manual design of feature extraction. In addition, through integration of adaptive average pooling, Conv1D, and the non-linear normalization approach, XceptionTime is less prone to overfitting, more robust to temporal translation of the input, and more importantly is independent from the input window size. Finally, by utilizing the depthwise separable convolutions, the XceptionTime network has far fewer parameters resulting in a less complex network. The performance of XceptionTime is tested on a sub Ninapro dataset, DB1, and the results showed a superior performance in comparison to any existing counterparts. In this regard, 5:71% accuracy improvement, on a window size 200ms, is reported in this paper, for the first time.

연구 동기 및 목표

희소 다채널 sEMG 신호에서 제스처 인식의 도전과제를 다룬다.
데이터 증강이나 수작업 특징 추출 없이 시간적 및 공간적 특징을 포착하는 새로운 엔드투엔드 아키텍처를 개발한다.
높은 정확도와 윈도우 크기 독립성을 유지하면서 모델 복잡성을 감소시킨다.

제안 방법

깊이별 분리 합성 곱과 차원을 축소하는 병목을 사용한 병렬 경로의 XceptionTime 모듈을 도입한다.
잔차 연결이 있는 다수의 XceptionTime 모듈을 쌓아 아키텍처를 형성한다.
분류기에 adaptive average pooling을 사용하여 입력 윈도우 크기 독립성을 가능하게 한다.
강건성과 학습 속도를 향상시키기 위해 sEMG 신호에 비선형 mu-law 정규화를 적용한다.
Conv1x1 투영과 배치 정규화를 사용하여 특징을 52 제스처 클래스에 매핑하도록 학습한다.

실험 결과

연구 질문

RQ1깊이별 분리 합성 곱이 희소 sEMG 제스처 데이터에서 정확도를 유지하거나 향상시키면서 모델 파라미터 수를 줄일 수 있는가?
RQ2mu-law 비선형 정규화가 선형 정규화에 비해 분류 성능을 향상시키는가?
RQ3재구성 없이도 서로 다른 입력 윈도우 크기에 대해 XceptionTime 아키텍처가 견고한가?
RQ4NinaPro DB1 데이터셋에서 정확도와 복잡도 측면에서 XceptionTime이 최첨단 모델과 어떻게 비교되는가?

주요 결과

XceptionTime은 NinaPro DB1에서 10 채널 및 200 ms 윈도우로 93.91% 정확도를 달성한다.
깊이별 분리 합성 곱을 표준 합성 곱으로 교체(XceptionTime-V2)하면 매개변수가 413,516에서 1,918,476으로 증가하고 200 ms 윈도우에서 정확도가 하락한다(95.43% vs 94.59%).
Mu-law 정규화가 Minmax 정규화보다 더 높은 정확도를 가져온다(예: 50 ms 윈도우: mu-law 81.71% vs Minmax 71.49%).
적응적 풀링으로 인해 윈도우 길이가 다르더라도 아키텍처를 재구성하지 않고도 모델은 효과적이다.
학습 중 윈도우 크기를 혼합하여 훈련하면 50 ms를 제외한 대부분의 윈도우 길이에서 성능이 향상되어 시간적 시점 이동 및 윈도우 변동에 대한 내성이 있음을 시사한다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.