X-Pruner: eXplainable Pruning for Vision Transformers
Yu, Lu, and Xiang, Wei (2023) X-Pruner: eXplainable Pruning for Vision Transformers. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. pp. 24355-24363. From: CVPR 2023: IEEE/CVF Conference on Computer Vision and Pattern Recognition, 17-24 June 2023, Vancouver, Canada.
PDF (Published Version)
- Published Version
Restricted to Repository staff only |
Abstract
Recently vision transformer models have become prominent models for a range of tasks. These models, however, usually suffer from intensive computational costs and heavy memory requirements, making them impractical for deployment on edge platforms. Recent studies have proposed to prune transformers in an unexplainable manner, which overlook the relationship between internal units of the model and the target class, thereby leading to inferior performance. To alleviate this problem, we propose a novel explainable pruning framework dubbed X-Pruner, which is designed by considering the explainability of the pruning criterion. Specifically, to measure each prunable unit's contribution to predicting each target class, a novel explainability-aware mask is proposed and learned in an end-to-end manner. Then, to preserve the most informative units and learn the layer-wise pruning rate, we adaptively search the layer-wise threshold that differentiates between unpruned and pruned units based on their explainability-aware mask values. To verify and evaluate our method, we apply the X-Pruner on representative transformer models including the DeiT and Swin Transformer. Comprehensive simulation results demonstrate that the proposed X-Pruner outperforms the state-of-the-art black-box methods with significantly reduced computational costs and slight performance degradation. Code is available at https://github.com/vickyyu90/XPruner.
Item ID: | 80913 |
---|---|
Item Type: | Conference Item (Research - E1) |
ISBN: | 9798350301298 |
Keywords: | Explainable computer vision |
Copyright Information: | © 2023 IEEE |
Date Deposited: | 30 Jan 2024 00:40 |
FoR Codes: | 46 INFORMATION AND COMPUTING SCIENCES > 4603 Computer vision and multimedia computation > 460304 Computer vision @ 100% |
SEO Codes: | 22 INFORMATION AND COMMUNICATION SERVICES > 2204 Information systems, technologies and services > 220499 Information systems, technologies and services not elsewhere classified @ 100% |
More Statistics |