ForensicFlow: A Tri-Modal Adaptive Network for Robust Deepfake Detection

2511.14554v1 cs.CV, cs.CR, cs.LG 2025-11-20
Авторы:

Mohammad Romani

Abstract

Deepfakes generated by advanced GANs and autoencoders severely threaten information integrity and societal stability. Single-stream CNNs fail to capture multi-scale forgery artifacts across spatial, texture, and frequency domains, limiting robustness and generalization. We introduce the ForensicFlow, a tri-modal forensic framework that synergistically fuses RGB, texture, and frequency evidence for video Deepfake detection. The RGB branch (ConvNeXt-tiny) extracts global visual inconsistencies; the texture branch (Swin Transformer-tiny) detects fine-grained blending artifacts; the frequency branch (CNN + SE) identifies periodic spectral noise. Attention-based temporal pooling dynamically prioritizes high-evidence frames, while adaptive attention fusion balances branch contributions.Trained on Celeb-DF (v2) with Focal Loss, ForensicFlow achieves AUC 0.9752, F1-Score 0.9408, and accuracy 0.9208, outperforming single-stream baselines. Ablation validates branch synergy; Grad-CAM confirms forensic focus. This comprehensive feature fusion provides superior resilience against subtle forgeries.

Ссылки и действия

Связанные статьи

Innovative Deep Learning Architecture for Enhanced Altered Fingerprint Recogniti...

## Контекст Одной из наиболее сложных задач в сфере биометрической идентификации является распознавание измененных отпе...

2025-09-26

PRNU-Bench: A Novel Benchmark and Model for PRNU-Based Camera Identification

## Контекст В современной криминологии и цифровой безопасности наблюдается рост необходимости в эффективных методах иден...

2025-09-24