What question did this study set out to answer?

This research aims to analyze the guardrail architectures of various large language models and their impact on ethical decision-making.

February 24, 2026Open Access

Between Ethics and Pragmatics: The Variability of Guardrails in Large Language Models

Puntos clave

This research aims to analyze the guardrail architectures of various large language models and their impact on ethical decision-making.
Conducted comparative analysis of guardrail architectures across commercial LLMs.
Focused on ChatGPT, Grok, and Perplexity AI with attention to their ethical design choices.
Classified models using a deontological-utilitarian spectrum regarding moral dilemmas.
Identified significant differences in behavior outcomes during extreme moral dilemmas involving violence.
Proposed that a hard-coded ethical floor is necessary for safe deployment in business applications.
Suggested a four-axis framework for auditing ethical alignment in LLMs.

Resumen

Safety alignment in Large Language Models (LLMs) has transcended purely technical boundaries to become a strategic architectural decision that encodes the values, risk tolerance, and moral philosophy of their developing organisations. This paper conducts a comparative analysis of guardrail architectures across prominent commercial LLMs—with particular attention to ChatGPT, Grok, and Perplexity AI—examining how differences in Reinforcement Learning from Human Feedback (RLHF) reward modelling and ethical design choices produce markedly different behavioural outcomes when models face extreme moral dilemmas involving violence. We classify models along a deontological–utilitarian spectrum, demonstrating that so-called “analytical openness” in safety design can constitute a critical alignment failure rather than a sophistication. Our findings argue that a hard-coded ethical floor—an inviolable set of refusal principles—is necessary for safe enterprise deployment, and that the absence of such a floor represents a measurable liability for business-to-business (B2B) applications. We close by proposing a four-axis framework for auditing LLM ethical alignment and identifying directions for standardisation.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Zen Revista

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Between Ethics and Pragmatics: The Variability of Guardrails in Large Language Models

Puntos clave

Resumen

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study