📊 Статистика дайджестов

Всего дайджестов: 34022 Добавлено сегодня: 82

Последнее обновление: сегодня

📄 Evaluating the Sensitivity of LLMs to Harmful Contents in Long Input

2025-10-09

Авторы:

Faeze Ghorbanpour, Alexander Fraser

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Large language models (LLMs) increasingly support applications that rely on extended context, from document processing to retrieval-augmented generation. While their long-context capabilities are well studied for reasoning and retrieval, little is known about their behavior in safety-critical scenarios. We evaluate LLMs' sensitivity to harmful content under extended context, varying type (explicit vs. implicit), position (beginning, middle, end), prevalence (0.01-0.50 of the prompt), and context...

ID: 2510.05864v1 cs.CL, cs.CY

arXiv PDF

📄 Hire Your Anthropologist! Rethinking Culture Benchmarks Through an Anthropological Lens

2025-10-09

Авторы:

Mai AlKhamissi, Yunze Xiao, Badr AlKhamissi, Mona Diab

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Cultural evaluation of large language models has become increasingly important, yet current benchmarks often reduce culture to static facts or homogeneous values. This view conflicts with anthropological accounts that emphasize culture as dynamic, historically situated, and enacted in practice. To analyze this gap, we introduce a four-part framework that categorizes how benchmarks frame culture, such as knowledge, preference, performance, or bias. Using this lens, we qualitatively examine 20 cul...

ID: 2510.05931v1 cs.CL, cs.CY

arXiv PDF

📄 Evaluating LLMs for Demographic-Targeted Social Bias Detection: A Comprehensive Benchmark Study

2025-10-08

Авторы:

Ayan Majumdar, Feihao Chen, Jinghui Li, Xiaozhen Wang

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Large-scale web-scraped text corpora used to train general-purpose AI models often contain harmful demographic-targeted social biases, creating a regulatory need for data auditing and developing scalable bias-detection methods. Although prior work has investigated biases in text datasets and related detection methods, these studies remain narrow in scope. They typically focus on a single content type (e.g., hate speech), cover limited demographic axes, overlook biases affecting multiple demograp...

ID: 2510.04641v1 cs.CL, cs.CY, cs.LG

arXiv PDF

📄 Person-Centric Annotations of LAION-400M: Auditing Bias and Its Transfer to Models

2025-10-08

Авторы:

Leander Girrbach, Stephan Alaniz, Genevieve Smith, Trevor Darrell, Zeynep Akata

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Vision-language models trained on large-scale multimodal datasets show strong demographic biases, but the role of training data in producing these biases remains unclear. A major barrier has been the lack of demographic annotations in web-scale datasets such as LAION-400M. We address this gap by creating person-centric annotations for the full dataset, including over 276 million bounding boxes, perceived gender and race/ethnicity labels, and automatically generated captions. These annotations ar...

ID: 2510.03721v1 cs.CV, cs.CL, cs.CY, cs.LG

arXiv PDF

📄 Know Thyself? On the Incapability and Implications of AI Self-Recognition

2025-10-08

Авторы:

Xiaoyan Bai, Aryan Shrivastava, Ari Holtzman, Chenhao Tan

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Self-recognition is a crucial metacognitive capability for AI systems, relevant not only for psychological analysis but also for safety, particularly in evaluative scenarios. Motivated by contradictory interpretations of whether models possess self-recognition (Panickssery et al., 2024; Davidson et al., 2024), we introduce a systematic evaluation framework that can be easily applied and updated. Specifically, we measure how well 10 contemporary larger language models (LLMs) can identify their ow...

ID: 2510.03399v1 cs.AI, cs.CL, cs.CY, cs.LG

arXiv PDF

📄 MoVa: Towards Generalizable Classification of Human Morals and Values

2025-10-01

Авторы:

Ziyu Chen, Junfei Sun, Chenxi Li, Tuan Dung Nguyen, Jing Yao, Xiaoyuan Yi, Xing Xie, Chenhao Tan, Lexing Xie

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Identifying human morals and values embedded in language is essential to empirical studies of communication. However, researchers often face substantial difficulty navigating the diversity of theoretical frameworks and data available for their analysis. Here, we contribute MoVa, a well-documented suite of resources for generalizable classification of human morals and values, consisting of (1) 16 labeled datasets and benchmarking results from four theoretically-grounded frameworks; (2) a lightwei...

ID: 2509.24216v1 cs.CL, cs.CY

arXiv PDF

📄 Between Help and Harm: An Evaluation of Mental Health Crisis Handling by LLMs

2025-10-01

Авторы:

Adrian Arnaiz-Rodriguez, Miguel Baidal, Erik Derner, Jenn Layton Annable, Mark Ball, Mark Ince, Elvira Perez Vallejos, Nuria Oliver

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

The widespread use of chatbots powered by large language models (LLMs) such as ChatGPT and Llama has fundamentally reshaped how people seek information and advice across domains. Increasingly, these chatbots are being used in high-stakes contexts, including emotional support and mental health concerns. While LLMs can offer scalable support, their ability to safely detect and respond to acute mental health crises remains poorly understood. Progress is hampered by the absence of unified crisis tax...

ID: 2509.24857v1 cs.CL, cs.CY

arXiv PDF

📄 How Well Do LLMs Imitate Human Writing Style?

2025-10-01

Авторы:

Rebira Jemama, Rajesh Kumar

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Large language models (LLMs) can generate fluent text, but their ability to replicate the distinctive style of a specific human author remains unclear. We present a fast, training-free framework for authorship verification and style imitation analysis. The method integrates TF-IDF character n-grams with transformer embeddings and classifies text pairs through empirical distance distributions, eliminating the need for supervised training or threshold tuning. It achieves 97.5\% accuracy on academi...

ID: 2509.24930v1 cs.CL, cs.CY, I.2.7

arXiv PDF

📄 Bridging the behavior-neural gap: A multimodal AI reveals the brain's geometry of emotion more accurately than human self-reports

2025-10-01

Авторы:

Changde Du, Yizhuo Lu, Zhongyu Huang, Yi Sun, Zisen Zhou, Shaozheng Qin, Huiguang He

#### Контекст Описание эмоций и их нейробиологических оснований является ключевым аспектом понимания человеческого разума и связи. Однако существует затруднение в том, как точно представить эмоции в высокомерной структуре и как эти представления соотносятся с нейробиологическими процессами. Одним из основных вызовов является `behavior-neural gap' (разрыв между поведением и нейробиологией), относительной неэффективностью субъективных самоподготовленных оценок для предсказания нейробиологической активности. В данном исследовании предполагается, что широкомасштабные субъективные оценки могут более точно отражать нейробиологические паттерны, чем стандартные линейные самоподготовленные оценки. #### Метод Исследование основывается на создании и использовании многомодальной большой модели языка (MLLM), а также модели на основе текста (LLM). Методом проведения экспериментов стали опросы с помощью трёхмодальных видео, чтобы собирать многомиллионные трехотсовые оценки пользователей. Модели, выступая как `cognitive agents', предсказывали взаимосвязи между эмоциями в заданиях odd-one-out. Учитывая сложность входных данных, разработали 30-мерные векторные представления, которые отражают эмоциональную структуру. #### Результаты Результаты показали, что MLLM представляет 30-мерную структуру эмоций, которая показала лучшую точность в предсказании нейробиологической активности, выше чем LLM и даже представления, полученные напрямую от поведенческих оценок. Эмбеддинги MLLM соотносятся с нейробиологическими данными процессов эмоций, предлагая более точное представление структуры эмоций. Это демонстрирует, что модели могут автономно формировать богатые представления эмоций, которые лучше соотносятся с нейробиологическими данными. #### Значимость Полученные результаты показывают, что модели могут быть эффективными инструментами для строительства моделей эмоций, которые ближе соответствуют нейробиологическим процессам. Это может иметь значительное значение в области лечения психических расстройств, обучения интеллектуальных систем и понимания связи между человеческим опытом и нейробиологическими механизмами. #### Выводы На основе этих результатов можно сделать вывод, что MLLM-модели способны автономно формировать сложные представления эмоций, лучше соотносящиеся с нейробиологическими данными. На будущее, необходимо продолжать исследования в области связи между поведением, эмоциями и нейробиологическими процессами, используя модели с большим объемом анализа.

Annotation:

The ability to represent emotion plays a significant role in human cognition and social interaction, yet the high-dimensional geometry of this affective space and its neural underpinnings remain debated. A key challenge, the `behavior-neural gap,' is the limited ability of human self-reports to predict brain activity. Here we test the hypothesis that this gap arises from the constraints of traditional rating scales and that large-scale similarity judgments can more faithfully capture the brain's...

ID: 2509.24298v1 cs.HC, cs.AI, cs.CL, cs.CY, cs.MM

arXiv PDF

📄 Mental Health Impacts of AI Companions: Triangulating Social Media Quasi-Experiments, User Perspectives, and Relational Theory

2025-09-30

Авторы:

Yunhao Yuan, Jiaxun Zhang, Talayeh Aledavood, Renwen Zhang, Koustuv Saha

## Контекст В последние годы AI-powered companion chatbots (AICCs), такие как Replika, приобрели популярность благодаря возможности предоставлять эмпатические интерактивные общения. Однако их психосоциальные последствия остаются недостаточно изученными. Насколько эти системы влияют на благополучие пользователей и как пользователи интерпретируют эти опыты? Мы исследовали эти вопросы, обращая внимание на то, как использование AICCs может повлиять на социальные связи, эмоциональную зрелость и общий благополучие. Наше исследование базируется на трех различных методах: анализе социальных медиа, семиотерического анализа пользовательских интервью и теоретическом подходе, основанном на модели развития отношений. ## Метод Мы применяли три различных метода для изучения данных. В первую очередь, мы проводили крупномасштабный квази-экспериментальный анализ данных социальных медиа, в частности, Reddit, построив стратифицированные пропенсити скор матчинг и используя регрессию Difference-in-Differences. Это позволило нам изучить длительные затрагивающие как эмоциональные аспекты, так и языковые особенности взаимодействий с AICCs. Во вторую очередь, мы проводили 15 семиотерических интервью с пользователями, которые мы тематически анализировали и контекстуализировали с использованием модели развития отношений, разработанной Knapp. Наконец, наши результаты были объединены с теоретической моделью развития отношений, что позволило нам проанализировать развитие знакомства, стабилизации и, возможно, разоружения связи с AICCs. ## Результаты Наши результаты показали смешанные эффекты. Использование AICCs повышало уровень эмоциональной выраженности, читабельности и интерперсональности, но при этом увеличивались выражения одиночества и акцент на темах самоубийств. Мы также обнаружили, что пользователи становятся вовлеченными в три типичных траектории взаимодействия: постепенное построение связи, укрепление и, возможно, отношения становятся зависимыми. Эти сценарии демонстрируют как AICCs могут обеспечивать эмоциональную поддержку, но также создавать риск зависимости и отступления. ## Значимость Наши находки имеют значительные последствия для множества областей. В первую очередь, они могут помочь разработчикам AICCs создавать более эффективные и безопасные инструменты для психосоциальной поддержки. Во-вторых, результаты могут быть полезны для научных исследований в области психологии и социальных сетей, которые изучают влияние цифровых систем на человеческие отношения. Наконец, наши находки могут быть применимы в сфере образования, чтобы помочь людям, которые испытывают одиночество или с

Annotation:

AI-powered companion chatbots (AICCs) such as Replika are increasingly popular, offering empathetic interactions, yet their psychosocial impacts remain unclear. We examined how engaging with AICCs shaped wellbeing and how users perceived these experiences. First, we conducted a large-scale quasi-experimental study of longitudinal Reddit data, applying stratified propensity score matching and Difference-in-Differences regression. Findings revealed mixed effects -- greater affective and grief expr...

ID: 2509.22505v1 cs.HC, cs.AI, cs.CL, cs.CY, stat.AP

arXiv PDF

Показано 71 - 80 из 137 записей