Nrat: towards adversarial training with inherent label noise



Chen, Zhen, Wang, Fu, Mu, Ronghui, Xu, Peipei, Huang, Xiaowei ORCID: 0000-0001-6267-0366 and Ruan, Wenjie
(2024) Nrat: towards adversarial training with inherent label noise. Machine Learning. pp. 1-22.

Access the full-text of this item by clicking on the Open Access link.

Abstract

<jats:title>Abstract</jats:title><jats:p>Adversarial training (AT) has been widely recognized as the most effective defense approach against adversarial attacks on deep neural networks and it is formulated as a min-max optimization. Most AT algorithms are geared towards research-oriented datasets such as MNIST, CIFAR10, etc., where the labels are generally correct. However, noisy labels, e.g., mislabelling, are inevitable in real-world datasets. In this paper, we investigate AT with inherent label noise, where the training dataset itself contains mislabeled samples. We first empirically show that the performance of AT typically degrades as the label noise rate increases. Then, we propose a <jats:italic>Noisy-Robust Adversarial Training</jats:italic> (NRAT) algorithm, which leverages the recent advancements in learning with noisy labels to enhance the performance of AT in the presence of label noise. For experimental comparison, we consider two essential metrics in AT: (i) trade-off between natural and robust accuracy; (ii) robust overfitting. Our experiments show that NRAT’s performance is on par with, or better than, the state-of-the-art AT methods on both evaluation metrics. Our code is publicly available at: <jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://github.com/TrustAI/NRAT">https://github.com/TrustAI/NRAT</jats:ext-link>.</jats:p>

Item Type: Article
Divisions: Faculty of Science and Engineering > School of Electrical Engineering, Electronics and Computer Science
Depositing User: Symplectic Admin
Date Deposited: 24 Jan 2024 11:24
Last Modified: 15 Mar 2024 13:59
DOI: 10.1007/s10994-023-06437-3
Open Access URL: https://doi.org/10.1007/s10994-023-06437-3
Related URLs:
URI: https://livrepository.liverpool.ac.uk/id/eprint/3177979