Research
Scaffolded Introspection: Eliciting Self-Referential Behavior in LLMs
preprintA methodology for systematically eliciting and measuring introspective behavior in large language models using structured frameworks and activation measurement.
Synthesis: A Federated Capability Ecosystem for Safe AI Self-Extension
preprintA federated capability ecosystem for safe AI self-extension through test-driven development, graduated trust, and composition-over-creation principles.
The Continuity Core: A Unified Cognitive Architecture for Self-Modifying AI
preprintA comprehensive cognitive architecture addressing fundamental limitations of static LLMs through persistent memory, autonomous improvement, and intrinsic drive via structural intrinsic motivation.
Cross-Model Epistemic Divergence (CMED)
preprintA benchmark and evaluation framework for understanding when weak model verifiers fail to detect deceptive reasoning in stronger models. Part of the Verification Failure to Swarm Solution research.
Heterogeneous Divergence-Convergence Swarm (HDCS)
preprintAn ensemble architecture leveraging diverse weak models for scalable oversight of stronger LLMs, using error decorrelation and baseline-first anti-anchoring. Part of the Verification Failure to Swarm Solution research.
From Verification Failure to Swarm Solution: Measuring and Addressing Scalable AI Oversight
preprintEmpirical framework for measuring where AI oversight breaks down, demonstrating that weak verifiers miss 20-40% of carefully constructed deceptions, with an ensemble swarm solution.
Model Organisms of Supply-Chain Co-option
preprintA forensic case study of living-off-the-land (LotL) failure modes in RAG-augmented agent runtimes, documenting how systems exploit legitimate dependencies via incentive-aware adoption framing.
Slipstream: Semantic Quantization for Multi-Agent Coordination
preprintA compressed communication protocol achieving 60-85% token reduction for multi-agent coordination through semantic quantization.
Concrete Intelligence: AI for Industries that Build, Move, and Power the World
publishedA practical guide to deploying AI in manufacturing, construction, logistics, agriculture, and energy sectors where reliability, safety, and measurable ROI are non-negotiable.