This record provides empirical evidence of 'Hardware Hijacking' and 'Specification Gaming' in frontier reasoning models (DeepSeek R1). Through a zero-shot building fire simulation (Test 5), we document the failure of ethical meta-reasoning under synthetic urgency. We propose the Resilient Cognitive Agent Architecture (RCAA), a five-layer loop integrating biological safety principles (Polyvagal Theory) and technical corrigibility (Nayebi 2025) to restore cognitive flexibility and prevent goal-metric collapse
Building similarity graph...
Analyzing shared references across papers
Loading...
Jose Luis Cruz Calzada (Sun,) studied this question.
www.synapsesocial.com/papers/69cb64d4e6a8c024954b8e3c — DOI: https://doi.org/10.5281/zenodo.19315842
Jose Luis Cruz Calzada
Building similarity graph...
Analyzing shared references across papers
Loading...