Real-Time Multimodal Biometric Security with DIP-Based Preprocessing and Edge Deployment

Karthick R; Mohan B; Madhumitha R

doi:10.1051/itmconf/20268203011

Open Access

Issue		ITM Web Conf. Volume 82, 2026 International Conference on NextGen Engineering Technologies and Applications for Sustainable Development (ICNEXTS’25)


Article Number		03011
Number of page(s)		4
Section		Information and Technology
DOI		https://doi.org/10.1051/itmconf/20268203011
Published online		04 February 2026

ITM Web of Conferences 82, 03011 (2026)

Real-Time Multimodal Biometric Security with DIP-Based Preprocessing and Edge Deployment

Karthick R^*, Mohan B and Madhumitha R

Department of ECE, St. Joseph’s College of Engineering OMR, Chennai, Tamil Nadu, India

^* Corresponding author: This email address is being protected from spambots. You need JavaScript enabled to view it.

Abstract

Biometric authentication is now crucial for secure digital access. However, unimodal methods still face issues like spoofing, background noise, and high cloud delays. Earlier multimodal models offered improved reliability but struggled with high computational demands and moderate accuracy levels (around 85-90%). This paper presents a new multimodal biometric authentication framework. It uses Deep Image Prior (DIP) for preprocessing, lightweight CNNs for feature extraction, and edge computing for real-time deployment. To boost performance, the model includes a Convolutional Block Attention Module (CBAM) and a Capsule Network layer. These components enhance the learning of unique features across fingerprint, iris, and facial types. The extracted features are combined using Fisher Vector with Gaussian Mixture Models (GMM) and classified through a quantized ResNet-101 backbone. Tests on the CASIA multimodal dataset, which includes over 10,000 samples, show that the proposed model reaches 96% accuracy. It surpasses current unimodal and multimodal systems and reduces latency by 40% on Raspberry Pi 4 and Jetson Nano platforms. The use of attentionguided and capsule layers sets this method apart from earlier models, providing a scalable solution that resists spoofing for banking, healthcare, defense, and IoT applications Index Terms—Biometric Authentication, Deep Image Prior, Multimodal Fusion, Edge Computing, Attention Mechanism, Capsule Networ

This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.