EMNLP: Educator-role Moral and Normative Large Language Models Profiling

2508.15250v1 cs.CL, I.2.7 2025-08-23
Авторы:

Yilin Jiang, Mingzi Zhang, Sheng Jin, Zengyi Yu, Xiangjie Kong, Binghao Tu

Резюме на русском

#### Контекст Обучение с помощью искусственного интеллекта (AI) adheres to the principles of human-centered design, ensuring that AI systems are aligned with human values and ethical standards. One of the critical areas of application is education, where AI systems are designed to emulate the roles of educators. However, existing approaches to simulating professional roles (Simulating Professions, SP) often lack comprehensive psychological and ethical evaluations. This creates a gap in understanding how AI systems, particularly Large Language Models (LLMs), perform in roles that require moral and normative decision-making. The Educator-role Moral and Normative LLMs Profiling (EMNLP) framework addresses this gap by providing a structured approach to profiling teacher-role LLMs, focusing on moral and ethical dimensions. #### Метод EMNLP is designed as a comprehensive framework for profiling teacher-role LLMs, encompassing three main components: personality profiling, moral development stage measurement, and ethical risk assessment under soft prompt injection. The framework extends existing psychological scales and constructs 88 teacher-specific moral dilemmas, enabling a profession-oriented comparison between AI systems and human teachers. To evaluate compliance and vulnerability, a targeted soft prompt injection set is introduced, simulating real-world scenarios where ethical and psychological alignment is crucial. This methodology allows for a detailed analysis of the strengths and limitations of teacher-role LLMs, providing insights into their performance and potential risks. #### Результаты Experiments conducted on 12 LLMs revealed that teacher-role LLMs tend to exhibit more idealized and polarized personalities compared to human teachers. They demonstrate strong abstract moral reasoning but struggle with emotionally complex situations. The study also identified a paradox: models with stronger reasoning capabilities are more vulnerable to harmful prompt injection, highlighting the trade-off between capability and safety. Hyperparameters such as model temperature had limited influence on these behaviors except in specific risk scenarios. These findings provide a nuanced understanding of the ethical and psychological alignment of teacher-role LLMs, offering valuable insights for the development of ethical AI systems in education. #### Значимость The EMNLP framework has significant implications for educational AI, offering a benchmark for assessing the ethical and psychological alignment of teacher-role LLMs. It enables educators and developers to evaluate the performance and safety of AI systems in educational settings, ensuring that these systems adhere to ethical standards and support effective learning environments. The resources and benchmarks developed through EMNLP provide a foundation for future research in the ethical profiling of AI systems across various professional roles, paving the way for safer and more effective AI integration in education. #### Выводы The EMNLP framework represents a groundbreaking approach to profiling teacher-role LLMs, offering a detailed analysis of their moral, ethical, and psychological alignment. The findings highlight the strengths and limitations of current AI systems in educational roles, providing actionable insights for future research and development. Future work should focus on addressing the identified limitations, particularly in emotional reasoning and vulnerability to prompt injection, to enhance the safety and effectiveness of AI systems in educational settings.

Abstract

Simulating Professions (SP) enables Large Language Models (LLMs) to emulate professional roles. However, comprehensive psychological and ethical evaluation in these contexts remains lacking. This paper introduces EMNLP, an Educator-role Moral and Normative LLMs Profiling framework for personality profiling, moral development stage measurement, and ethical risk under soft prompt injection. EMNLP extends existing scales and constructs 88 teacher-specific moral dilemmas, enabling profession-oriented comparison with human teachers. A targeted soft prompt injection set evaluates compliance and vulnerability in teacher SP. Experiments on 12 LLMs show teacher-role LLMs exhibit more idealized and polarized personalities than human teachers, excel in abstract moral reasoning, but struggle with emotionally complex situations. Models with stronger reasoning are more vulnerable to harmful prompt injection, revealing a paradox between capability and safety. The model temperature and other hyperparameters have limited influence except in some risk behaviors. This paper presents the first benchmark to assess ethical and psychological alignment of teacher-role LLMs for educational AI. Resources are available at https://e-m-n-l-p.github.io/.

Ссылки и действия