An Orthogonal Classifier for Improving the Adversarial Robustness of Neural Networks

Cong Xu, Xiang Li, Min Yang

2021-05-19Adversarial Robustness Adversarial Attack

Abstract

Neural networks are susceptible to artificially designed adversarial perturbations. Recent efforts have shown that imposing certain modifications on classification layer can improve the robustness of the neural networks. In this paper, we explicitly construct a dense orthogonal weight matrix whose entries have the same magnitude, thereby leading to a novel robust classifier. The proposed classifier avoids the undesired structural redundancy issue in previous work. Applying this classifier in standard training on clean data is sufficient to ensure the high accuracy and good robustness of the model. Moreover, when extra adversarial samples are used, better robustness can be further obtained with the help of a special worst-case loss. Experimental results show that our method is efficient and competitive to many state-of-the-art defensive approaches. Our code is available at \url{https://github.com/MTandHJ/roboc}.

Results

Task	Dataset	Metric	Value	Model
Adversarial Attack	CIFAR-10	Attack: AutoAttack	44.15	Xu et al.
Adversarial Attack	CIFAR-10	Attack: DeepFool	51.31	Xu et al.
Adversarial Attack	CIFAR-10	Attack: PGD20	78.68	Xu et al.

Related Papers

Bridging Robustness and Generalization Against Word Substitution Attacks in NLP via the Growth Bound Matrix Approach2025-07-14 3DGAA: Realistic and Robust 3D Gaussian-based Adversarial Attack for Autonomous Driving2025-07-14 VIP: Visual Information Protection through Adversarial Attacks on Vision-Language Models2025-07-11 Identifying the Smallest Adversarial Load Perturbations that Render DC-OPF Infeasible2025-07-10 ScoreAdv: Score-based Targeted Generation of Natural Adversarial Examples via Diffusion Models2025-07-08 Tail-aware Adversarial Attacks: A Distributional Approach to Efficient LLM Jailbreaking2025-07-06 Evaluating the Evaluators: Trust in Adversarial Robustness Tests2025-07-04 Rectifying Adversarial Sample with Low Entropy Prior for Test-Time Defense2025-07-04