FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence

Jan 28, 2020 3 min read Semi-supervised Learning

Go to Project Site

1. どんなもの？

loss関数は確率的なaugmentation $\alpha$と$p_m$を用いて， $$ \sum_{b=1}^{\mu B}\left|p_{\mathrm{m}}\left(y | \alpha\left(u_{b}\right)\right)-p_{\mathrm{m}}\left(y | \alpha\left(u_{b}\right)\right)\right|_{2}^{2} $$
確率的なので，↑は0にはならないことに注意

$u_b$に対して，擬似的ラベル$q_b$を付与する $$ q_b = p_m(y|u_b) $$
unlabeld dataに対するloss関数は，指示関数 $\mathbb{I}$，しきい値 $\tau$を用いて $$ \frac{1}{\mu B} \sum_{b=1}^{\mu B} \mathbb{I}\left(\max \left(q_{b}\right) \geq \tau\right) \mathrm{H}\left(\hat{q}_{b}, q_{b}\right) $$ $$ \hat{q}_{b} = argmax(q_b) $$

labeled exmapleに対してはConsistency regularization (図の上側)
$\alpha$は弱いaugmentation (e.g. filpとか軽微なshift) $$ \ell_{s}=\frac{1}{B} \sum_{b=1}^{B} \mathrm{H}\left(p_{b}, p_{\mathrm{m}}\left(y | \alpha\left(x_{b}\right)\right)\right) $$
unlabeld exampleに対してはPseudo label(図の下側)
$\mathcal{A}$は強いaugmentation (AutoAugmentベースの手法：RandAugment，CTAugment) $$ \ell_{u}=\frac{1}{\mu B} \sum_{b=1}^{\mu B} \mathbb{I}\left(\max \left(q_{b}\right) \geq \tau\right) \mathrm{H}\left(\hat{q}_{b}, p_{\mathrm{m}}\left(y | \mathcal{A}\left(u_{b}\right)\right)\right) $$ $$ q_b = p_m(y | \alpha(u_b)) $$

weight decayが有効
AdamよりSGDが良い
learning scheduleにはcosine learning rate decayを使う $$ \eta \cos (\frac{7 \pi K}{16K} ) $$

Section 5は気が向いたら
単純な方法 + 少ラベルでこれほどの精度が出るのは驚き
Goodfellow曰く革命
The quiet semisupervised revolution continues https://t.co/FAY4v9aHbe
— Ian Goodfellow (@goodfellow_ian) January 22, 2020

David Berthelot, Nicholas Carlini, Ekin D. Cubuk, Alex Ku- rakin, Kihyuk Sohn, Han Zhang, and Colin Raffel. Remix- match: Semi-supervised learning with distribution matching and augmentation anchoring. In Eighth International Conference on Learning Representations, 2020.
Ekin D. Cubuk, Barret Zoph, Jonathon Shlens, and Quoc V. Le. Randaugment: Practical automated data augmen- tation with a reduced search space. arXiv preprint arXiv:1909.13719, 2019.
David Berthelot, Nicholas Carlini, Ian Goodfellow, Nicolas Papernot, Avital Oliver, and Colin A Raffel. Mixmatch: A holistic approach to semi-supervised learning. In Advances in Neural Information Processing Systems 32. 2019.

My research interests are in computer vision, especially in anomaly detection and XAI.