📊 Статистика дайджестов

Всего дайджестов: 34022 Добавлено сегодня: 82

Последнее обновление: сегодня

📄 Synthetic Voices, Real Threats: Evaluating Large Text-to-Speech Models in Generating Harmful Audio

2025-11-17

Авторы:

Guangke Chen, Yuhui Wang, Shouling Ji, Xiapu Luo, Ting Wang

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Modern text-to-speech (TTS) systems, particularly those built on Large Audio-Language Models (LALMs), generate high-fidelity speech that faithfully reproduces input text and mimics specified speaker identities. While prior misuse studies have focused on speaker impersonation, this work explores a distinct content-centric threat: exploiting TTS systems to produce speech containing harmful content. Realizing such threats poses two core challenges: (1) LALM safety alignment frequently rejects harmf...

ID: 2511.10913v1 cs.SD, cs.AI, cs.CR, cs.MM, eess.AS

arXiv PDF

📄 GraphToxin: Reconstructing Full Unlearned Graphs from Graph Unlearning

2025-11-17

Авторы:

Ying Song, Balaji Palanisamy

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Graph unlearning has emerged as a promising solution for complying with "the right to be forgotten" regulations by enabling the removal of sensitive information upon request. However, this solution is not foolproof. The involvement of multiple parties creates new attack surfaces, and residual traces of deleted data can still remain in the unlearned graph neural networks. These vulnerabilities can be exploited by attackers to recover the supposedly erased samples, thereby undermining the inherent...

ID: 2511.10936v1 cs.LG, cs.AI, cs.CR

arXiv PDF

📄 Automata-Based Steering of Large Language Models for Diverse Structured Generation

2025-11-17

Авторы:

Xiaokun Luan, Zeming Wei, Yihao Zhang, Meng Sun

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Large language models (LLMs) are increasingly tasked with generating structured outputs. While structured generation methods ensure validity, they often lack output diversity, a critical limitation that we confirm in our preliminary study. We propose a novel method to enhance diversity in automaton-based structured generation. Our approach utilizes automata traversal history to steer LLMs towards novel structural patterns. Evaluations show our method significantly improves structural and content...

ID: 2511.11018v1 cs.CL, cs.AI, cs.CR, cs.LG, cs.SE

arXiv PDF

📄 Private-RAG: Answering Multiple Queries with LLMs while Keeping Your Data Private

2025-11-15

Авторы:

Ruihan Wu, Erchi Wang, Zhiyuan Zhang, Yu-Xiang Wang

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Retrieval-augmented generation (RAG) enhances large language models (LLMs) by retrieving documents from an external corpus at inference time. When this corpus contains sensitive information, however, unprotected RAG systems are at risk of leaking private information. Prior work has introduced differential privacy (DP) guarantees for RAG, but only in single-query settings, which fall short of realistic usage. In this paper, we study the more practical multi-query setting and propose two DP-RAG al...

ID: 2511.07637v1 cs.LG, cs.AI, cs.CR

arXiv PDF

📄 A Self-Improving Architecture for Dynamic Safety in Large Language Models

2025-11-15

Авторы:

Tyler Slater

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Context: The integration of Large Language Models (LLMs) into core software systems is accelerating. However, existing software architecture patterns are static, while current safety assurance methods are not scalable, leaving systems vulnerable to novel adversarial threats. Objective: To design, implement, and evaluate a novel software architecture that enables an AI-driven system to autonomously and continuously adapt its own safety protocols at runtime. Method: We propose the Self-Improvi...

ID: 2511.07645v1 cs.SE, cs.AI, cs.CR

arXiv PDF

📄 FAIRPLAI: A Human-in-the-Loop Approach to Fair and Private Machine Learning

2025-11-15

Авторы:

David Sanchez, Holly Lopez, Michelle Buraczyk, Anantaa Kotal

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

As machine learning systems move from theory to practice, they are increasingly tasked with decisions that affect healthcare access, financial opportunities, hiring, and public services. In these contexts, accuracy is only one piece of the puzzle - models must also be fair to different groups, protect individual privacy, and remain accountable to stakeholders. Achieving all three is difficult: differential privacy can unintentionally worsen disparities, fairness interventions often rely on sensi...

ID: 2511.08702v1 cs.LG, cs.AI, cs.CR, cs.CY

arXiv PDF

📄 Enhancing DPSGD via Per-Sample Momentum and Low-Pass Filtering

2025-11-15

Авторы:

Xincheng Xu, Thilina Ranbaduge, Qing Wang, Thierry Rakotoarivelo, David Smith

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Differentially Private Stochastic Gradient Descent (DPSGD) is widely used to train deep neural networks with formal privacy guarantees. However, the addition of differential privacy (DP) often degrades model accuracy by introducing both noise and bias. Existing techniques typically address only one of these issues, as reducing DP noise can exacerbate clipping bias and vice-versa. In this paper, we propose a novel method, \emph{DP-PMLF}, which integrates per-sample momentum with a low-pass filter...

ID: 2511.08841v1 cs.LG, cs.AI, cs.CR

arXiv PDF

📄 3D Guard-Layer: An Integrated Agentic AI Safety System for Edge Artificial Intelligence

2025-11-15

Авторы:

Eren Kurshan, Yuan Xie, Paul Franzon

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

AI systems have found a wide range of real-world applications in recent years. The adoption of edge artificial intelligence, embedding AI directly into edge devices, is rapidly growing. Despite the implementation of guardrails and safety mechanisms, security vulnerabilities and challenges have become increasingly prevalent in this domain, posing a significant barrier to the practical deployment and safety of AI systems. This paper proposes an agentic AI safety architecture that leverages 3D to i...

ID: 2511.08842v1 cs.AR, cs.AI, cs.CR

arXiv PDF

📄 Differentially Private In-Context Learning with Nearest Neighbor Search

2025-11-08

Авторы:

Antti Koskela, Tejas Kulkarni, Laith Zumot

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Differentially private in-context learning (DP-ICL) has recently become an active research topic due to the inherent privacy risks of in-context learning. However, existing approaches overlook a critical component of modern large language model (LLM) pipelines: the similarity search used to retrieve relevant context data. In this work, we introduce a DP framework for in-context learning that integrates nearest neighbor search of relevant examples in a privacy-aware manner. Our method outperforms...

ID: 2511.04332v1 cs.LG, cs.AI, cs.CR

arXiv PDF

📄 Quantum-Enhanced Generative Models for Rare Event Prediction

2025-11-06

Авторы:

M. Z. Haider, M. U. Ghouri, Tayyaba Noreen, M. Salman

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Rare events such as financial crashes, climate extremes, and biological anomalies are notoriously difficult to model due to their scarcity and heavy-tailed distributions. Classical deep generative models often struggle to capture these rare occurrences, either collapsing low-probability modes or producing poorly calibrated uncertainty estimates. In this work, we propose the Quantum-Enhanced Generative Model (QEGM), a hybrid classical-quantum framework that integrates deep latent-variable models ...

ID: 2511.02042v1 cs.LG, cs.AI, cs.CR, cs.DC

arXiv PDF

Показано 31 - 40 из 162 записей