Skip to main content
QUICK REVIEW

[論文レビュー] Deep Temporal Clustering : Fully Unsupervised Learning of Time-Domain Features

Naveen Sai Madiraju, Seid M. Sadat|arXiv (Cornell University)|Feb 4, 2018
Time Series Analysis and Forecasting参考文献 12被引用数 107
ひとこと要約

DTC は時系列の時間次元縮約とクラスタリングをエンドツーエンドの深層フレームワークで同時学習し、時系列自己符号化器と新規クラスタリング層を用い、様々な時系列類似度指標を活用;ベースラインを上回る。

ABSTRACT

Unsupervised learning of time series data, also known as temporal clustering, is a challenging problem in machine learning. Here we propose a novel algorithm, Deep Temporal Clustering (DTC), to naturally integrate dimensionality reduction and temporal clustering into a single end-to-end learning framework, fully unsupervised. The algorithm utilizes an autoencoder for temporal dimensionality reduction and a novel temporal clustering layer for cluster assignment. Then it jointly optimizes the clustering objective and the dimensionality reduction objec tive. Based on requirement and application, the temporal clustering layer can be customized with any temporal similarity metric. Several similarity metrics and state-of-the-art algorithms are considered and compared. To gain insight into temporal features that the network has learned for its clustering, we apply a visualization method that generates a region of interest heatmap for the time series. The viability of the algorithm is demonstrated using time series data from diverse domains, ranging from earthquakes to spacecraft sensor data. In each case, we show that the proposed algorithm outperforms traditional methods. The superior performance is attributed to the fully integrated temporal dimensionality reduction and clustering criterion.

研究の動機と目的

  • Motivate unsupervised learning for complex time-series where labels are unavailable or unreliable.
  • Develop an end-to-end architecture that jointly optimizes reconstruction and clustering losses.
  • Enable flexible temporal similarity metrics within a learnable clustering layer.
  • Provide visualization to localize and explain temporal events detected by clusters.
  • Demonstrate superior performance over traditional temporal clustering methods across diverse datasets.

提案手法

  • Encode time-series into a latent space using a temporal autoencoder (CNN for short-term features + BI-LSTM for multi-scale temporal dynamics).
  • Introduce a temporal clustering layer that assigns latent representations to k clusters via a similarity metric.
  • Compute cluster assignments using a KL-divergence loss between a soft assignment q and a target distribution p.
  • Train end-to-end by jointly minimizing reconstruction loss (MSE) and clustering loss (KL divergence).
  • Initialize cluster centroids via hierarchical clustering on latent features and refine them during training.
  • Provide heatmap-based visualization to localize informative time regions contributing to cluster assignments.

実験結果

リサーチクエスチョン

  • RQ1Can end-to-end joint optimization of reconstruction and clustering yields superior unsupervised temporal clustering compared to separate steps?
  • RQ2How do different temporal similarity metrics affect clustering performance in a deep temporal clustering framework?
  • RQ3Is the proposed DTC robust across diverse real-world time-series domains and dataset sizes?
  • RQ4Can the model localize temporal events within unlabeled sequences to aid interpretation?

主な発見

Dataset(N, L, r, P)ACFDTC ACFCIDDTC CIDCORDTC COREUCLDTC EUCLkshape
NASA MMS (104,1140,1.21,10)0.510.590.850.930.650.670.560.690.61
BeetleFly (40,512,1.00,8)0.550.690.800.8920.550.5840.550.6060.65
BirdChicken (40,512,1.00,8)0.700.7920.600.7320.550.7120.550.7720.52
Computers (500,720,1.00,10)0.580.640.510.680.550.5550.550.580.58
Earthquakes (461,512,0.25,8)0.5080.5690.5080.5880.5460.5490.5460.5400.59
MoteStrain (1272,84,0.86,4)0.680.890.570.810.600.930.600.930.88
Phalanges OutlinesCorrect (2658,80,1.77,4)0.5860.5220.5010.5290.5010.5250.5010.5560.56
ProximalPhalanx OutlineCorrect (891,80,2.12,4)0.520.6780.50.660.520.620.520.630.65
ShapeletSim (200,500,1.00,10)0.540.740.830.910.520.550.520.530.56
SonyAIBO RobotSurfaceII (980,65,1.61,5)0.560.720.8270.850.570.820.570.830.65
SonyAIBO RobotSurface (621,70,0.78,5)0.740.940.510.810.580.780.580.660.74
ItalyPower Demand (1096,24,1,4)0.590.630.600.660.540.570.540.610.52
WormsTwoClass (258,900,1.37,10)0.530.620.590.610.550.560.550.510.55
  • Joint end-to-end training significantly improves clustering performance compared to disjoint training (AUC 0.93 vs 0.88 on MMS dataset).
  • DTC outperforms baseline temporal clustering methods (k-Shape and hierarchical clustering) across 13 datasets and multiple similarity metrics.
  • DTC shows robust improvements across domains with varying sequence lengths and class imbalances.
  • Heatmap visualizations localize event locations in time, providing interpretable explanations for cluster assignments.

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。