Paper
20 April 2023 How to make use of pretrained models in few-shot classification
Mingyu Fu, Peng Wang
Author Affiliations +
Proceedings Volume 12602, International Conference on Electronic Information Engineering and Computer Science (EIECS 2022); 1260217 (2023) https://doi.org/10.1117/12.2668153
Event: International Conference on Electronic Information Engineering and Computer Science (EIECS 2022), 2022, Changchun, China
Abstract
Few-shot learning(FSL) aims to generalize model to novel categoeries by few labelled samples, which is challenging for machine. Large-scaled pretrained models, especially vision transformers achieve excellent performances benefiting from numerous and diverse data. Researchers have exploited pretrained models in few-shot classification by simply updating the whole parameters and finetuning on few samples. In this paper, we explore two methods: vision prompt tuning and a reparameterization method called ‘scaling&&shift’ to leverage pretrained models in few-shot classification. Vision prompt tuning is for vision transformer only and we first evaluate the method in few-shot setting. ‘Scaling&&shift’ is originally applied in convolution neural networks(CNN). We extend it to vision transformer. The two methods are evaluated on standard benchmarks such as miniImageNet, CUB, CIFAR-FS, clipart and sketch. The results show that ‘scaling&&shift’ reaches the same level compared to updating the whole parameters. Vision prompt tuning is 0%~5% lower than updating the whole parameters over five datasets while it has quite smaller amount of parameters updated.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Mingyu Fu and Peng Wang "How to make use of pretrained models in few-shot classification", Proc. SPIE 12602, International Conference on Electronic Information Engineering and Computer Science (EIECS 2022), 1260217 (20 April 2023); https://doi.org/10.1117/12.2668153
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Visual process modeling

Transformers

Data modeling

Performance modeling

Convolution

Lawrencium

Neurons

Back to Top