From keywords to semantics: Perceptions of large language models in data discovery

2510.01473v1 cs.HC, cs.AI 2025-10-04

Авторы:

Maura E Halstead, Mark A. Green, Caroline Jay, Richard Kingston, David Topping, Alexander Singleton

Abstract

Current approaches to data discovery match keywords between metadata and queries. This matching requires researchers to know the exact wording that other researchers previously used, creating a challenging process that could lead to missing relevant data. Large Language Models (LLMs) could enhance data discovery by removing this requirement and allowing researchers to ask questions with natural language. However, we do not currently know if researchers would accept LLMs for data discovery. Using a human-centered artificial intelligence (HCAI) focus, we ran focus groups (N = 27) to understand researchers' perspectives towards LLMs for data discovery. Our conceptual model shows that the potential benefits are not enough for researchers to use LLMs instead of current technology. Barriers prevent researchers from fully accepting LLMs, but features around transparency could overcome them. Using our model will allow developers to incorporate features that result in an increased acceptance of LLMs for data discovery.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

From keywords to semantics: Perceptions of large language models in data discovery

Авторы:

Abstract

Ссылки и действия

Связанные статьи

From Symptoms to Systems: An Expert-Guided Approach to Understanding Risks of Ge...

Proactive Agentic Whiteboards: Enhancing Diagrammatic Learning

Young children's anthropomorphism of an AI chatbot: Brain activation and the rol...

In Silico Development of Psychometric Scales: Feasibility of Representative Popu...

Significant Other AI: Identity, Memory, and Emotional Regulation as Long-Term Re...

Навигация