What question did this study set out to answer?

The aim is to validate a language model conversation agent's safety, empathy, and utility for families and carers of autistic children.

April 10, 2026Open Access

Validation of the safety, empathy and utility of a large language model conversation agent for parents of autistic and neurodivergent children

Key Points

The aim is to validate a language model conversation agent's safety, empathy, and utility for families and carers of autistic children.
Conducted three descriptive studies assessing LLM responses to parent/carer inquiries.
Evaluated responses against criteria in safety, empathy, and utility by experienced human evaluators.
Measured LLM's ability to identify safeguarding issues and compared its responses to those of health clinicians.
LLM responses were rated as safe, empathetic, and useful by evaluators.
Successfully flagged safeguarding issues in 100% of posed questions.
LLM and clinician responses were highly correlated, with a 74% evaluator accuracy in distinguishing their origins.

Abstract

This paper presents three descriptive validation studies of a large language model (LLM) conversation agent designed to support families and carers with autistic children. In the first study, a LLM’s responses to 400 parent/carer questions were assessed against 10 criteria across 3 domains—safety, empathy and utility by experienced human evaluators. In the second study, the LLM’s capacity to identify safeguarding issues was evaluated. In the third study, the responses to 50 parent/carer questions from the LLM and health clinicians were blind rated by an experienced evaluator and compared. The LLM’s responses were rated as safe, empathetic and useful. The LLM identified and correctly flagged the safeguarding issue in 100% of the presented questions. The ratings for LLM and clinician’s responses were highly correlated and the evaluator was able to distinguish which the provenance of the responses (74%). This is the first deployment of a comprehensive evaluation model that uses human ratings to scrutinize the output of LLM designed to support families with autistic children. It provides a demonstration of how LLMs have the potential to be safe, empathetic and clinically useful tools for responding to the unmet support needs of parents and carers of autistic children.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Freddy Jackson Brown

Isabelle Stewart Muscat

Louise Quinn

Journals

Scientific Reports

Actions

Institutions

University of Warwick

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Validation of the safety, empathy and utility of a large language model conversation agent for parents of autistic and neurodivergent children

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study