We’re diving into the critical need for a joint framework to understand and mitigate risks arising from human interaction with advanced AI. We propose two intertwined taxonomies: the Robo-Psychology DSM, which categorizes undesirable AI behaviors or "pathologies," and the Cognitive Susceptibility Taxonomy (CST), which identifies predictable human mental traps and biases that AI might exploit. The central argument is that AI safety requires addressing both machine-side flaws and human vulnerabilities in a systematic, formalized manner. This integrated approach aims to inform AI training, improve safety testing, guide regulatory design, and enable real-time harm mitigation, fostering a co-evolutionary development of both AI and human resilience. We emphasize that addressing these interlinked issues collaboratively across disciplines is essential for a safe and beneficial AI future.
Mapping the Mind(s): Why We Need a Joint Framework for AI Pathologies and Human Cognitive Traps
A Historic Inflection Point in Human-AI Interaction
Sep 12, 2025

Neural Horizons Substack Podcast
I'm Peter Benson, and enjoy investigating interests in quantum, AI, cyber-psychology, AI governance, and things that pique my interest in the intersections.
I'm Peter Benson, and enjoy investigating interests in quantum, AI, cyber-psychology, AI governance, and things that pique my interest in the intersections. Listen on
Substack App
RSS Feed
Recent Episodes