Memorizing Normality to Detect Anomaly: Memory-augmented Deep Autoencoder for Unsupervised Anomaly Detection

Dec 10, 2019 2 min read Anomaly Detection

Go to Project Site

1. どんなもの？

$$ \hat{x} = f_d(\hat{z}; \theta_d) $$

それぞれ変数を定義する
- $M \in \mathbb{R}^{N \times C}$: Memory行列
- $m_i$: $M$の$i$行目Vector
- $N$: メモリ数
- $C$: $\hat{z}$の次元数（論文内では$z$の次元数と一致）
- $w \in \mathbb{R}^{1 \times N} $: Attention Weight Vector
Encoderから得られた$z$と$m_i$の距離（内積）を算出して，softmaxすることで$w$を求める $$ w_i = \frac{\exp(d(z, m_i))}{\Sigma^N_{j=1}\exp(d(z, m_j))} $$

$$ d(z, m_i) = \frac{zm_i^T}{|z||m_i|} $$

上述のMemory構造でも復元できてしまう異常サンプルは出てくるので，$w$をスパースにすることでより制限する

\[ \hat{w}_i = \begin{cases} w_i & \text{ if } w_i > \lambda \\ 0 & \text{ otherwise } \end{cases} \]

再構成誤差と$\hat{w}$そスパースにするための誤差の重み付き和 $$ L(\theta_e, \theta_d, M) = \frac{1}{T} \Sigma^T_{t=1}[R(x^t, \hat{x}^t) + \alpha E(\hat{w}^t)] $$

$$ R(x^t, \hat{x}^t) = |x^t - \hat{x}^t| ^2 $$

$$ E(\hat{w}^t) = \Sigma^T_{i=1}-\hat{w}^t\log{\hat{w}^t} $$

論文内では,$\alpha = 0.0002$

画像では，MNIST・Cifar10で実験

動画では，UCSD-Ped2・CUHK・ShanghaiTechで実験

My research interests are in computer vision, especially in anomaly detection and XAI.