Paper
7 June 2023 Image calibration using ensemble of transformer and CNN-based frameworks
Anuj Kumar, Sankar Behera, Yamuna Prasad
Author Affiliations +
Proceedings Volume 12701, Fifteenth International Conference on Machine Vision (ICMV 2022); 1270115 (2023) https://doi.org/10.1117/12.2680090
Event: Fifteenth International Conference on Machine Vision (ICMV 2022), 2022, Rome, Italy
Abstract
Image distortion is a problem due to wide field-of-view cameras, and camera calibration is a fundamental step in various applications such as image undistortion, 3D reconstruction, and camera motion estimation to overcome this problem. In image calibration, intrinsic camera parameters such as focal length and distortion are estimated. The quality of the undistorted/enhanced image depends on the correctness of focal length and distortion. However, existing methods consist of two approaches: checkerboard, which requires manual interaction, and others are deep learning approaches. Most Deep Learning approaches are based on the Convolution Neural Network (CNN) framework, and it fails to capture the long-term dependency in a distorted image. This paper proposes a fully automated EnsembleNet method to infer the focal length and distortion parameters to overcome this problem. The proposed model extracts various contexts (local patches) by exploiting ViT(Vision Transformer) and spatial features from various CNN-based models using a single input image. The proposed model uses the differential evolution (DE) approach to learn the ensemble weights. The experiments show that the proposed EnsembleNet outperforms the state-of-the-art deep learning-based models in terms of mean squared error.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Anuj Kumar, Sankar Behera, and Yamuna Prasad "Image calibration using ensemble of transformer and CNN-based frameworks", Proc. SPIE 12701, Fifteenth International Conference on Machine Vision (ICMV 2022), 1270115 (7 June 2023); https://doi.org/10.1117/12.2680090
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Distortion

Visual process modeling

Data modeling

Cameras

Calibration

Transformers

Deep learning

Back to Top