ForensicFlow: A Tri-Modal Adaptive Network for Robust Deepfake Detection

2511.14554v1 cs.CV, cs.CR, cs.LG 2025-11-20

Авторы:

Mohammad Romani

Abstract

Deepfakes generated by advanced GANs and autoencoders severely threaten information integrity and societal stability. Single-stream CNNs fail to capture multi-scale forgery artifacts across spatial, texture, and frequency domains, limiting robustness and generalization. We introduce the ForensicFlow, a tri-modal forensic framework that synergistically fuses RGB, texture, and frequency evidence for video Deepfake detection. The RGB branch (ConvNeXt-tiny) extracts global visual inconsistencies; the texture branch (Swin Transformer-tiny) detects fine-grained blending artifacts; the frequency branch (CNN + SE) identifies periodic spectral noise. Attention-based temporal pooling dynamically prioritizes high-evidence frames, while adaptive attention fusion balances branch contributions.Trained on Celeb-DF (v2) with Focal Loss, ForensicFlow achieves AUC 0.9752, F1-Score 0.9408, and accuracy 0.9208, outperforming single-stream baselines. Ablation validates branch synergy; Grad-CAM confirms forensic focus. This comprehensive feature fusion provides superior resilience against subtle forgeries.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

ForensicFlow: A Tri-Modal Adaptive Network for Robust Deepfake Detection

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Breaking the Stealth-Potency Trade-off in Clean-Image Backdoors with Generative ...

NoisePrints: Distortion-Free Watermarks for Authorship in Private Diffusion Mode...

MOLM: Mixture of LoRA Markers

Innovative Deep Learning Architecture for Enhanced Altered Fingerprint Recogniti...

PRNU-Bench: A Novel Benchmark and Model for PRNU-Based Camera Identification

Навигация