| Issue |
ITM Web Conf.
Volume 82, 2026
International Conference on NextGen Engineering Technologies and Applications for Sustainable Development (ICNEXTS’25)
|
|
|---|---|---|
| Article Number | 03011 | |
| Number of page(s) | 4 | |
| Section | Information and Technology | |
| DOI | https://doi.org/10.1051/itmconf/20268203011 | |
| Published online | 04 February 2026 | |
Real-Time Multimodal Biometric Security with DIP-Based Preprocessing and Edge Deployment
Department of ECE, St. Joseph’s College of Engineering OMR, Chennai, Tamil Nadu, India
* Corresponding author: This email address is being protected from spambots. You need JavaScript enabled to view it.
Biometric authentication is now crucial for secure digital access. However, unimodal methods still face issues like spoofing, background noise, and high cloud delays. Earlier multimodal models offered improved reliability but struggled with high computational demands and moderate accuracy levels (around 85-90%). This paper presents a new multimodal biometric authentication framework. It uses Deep Image Prior (DIP) for preprocessing, lightweight CNNs for feature extraction, and edge computing for real-time deployment. To boost performance, the model includes a Convolutional Block Attention Module (CBAM) and a Capsule Network layer. These components enhance the learning of unique features across fingerprint, iris, and facial types. The extracted features are combined using Fisher Vector with Gaussian Mixture Models (GMM) and classified through a quantized ResNet-101 backbone. Tests on the CASIA multimodal dataset, which includes over 10,000 samples, show that the proposed model reaches 96% accuracy. It surpasses current unimodal and multimodal systems and reduces latency by 40% on Raspberry Pi 4 and Jetson Nano platforms. The use of attentionguided and capsule layers sets this method apart from earlier models, providing a scalable solution that resists spoofing for banking, healthcare, defense, and IoT applications Index Terms—Biometric Authentication, Deep Image Prior, Multimodal Fusion, Edge Computing, Attention Mechanism, Capsule Networ
© The Authors, published by EDP Sciences, 2026
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.

