AI Is a Delegate-Tier Solution. Most of Your Problems Aren't.
A pre-AI operations filter that decides what AI is actually for. Run every candidate through Eliminate, then Automate, then Delegate — in that order — before evaluating any tool.
Brian Chastain
AI Behavioral Research,
Business Operations Expert
Somewhere along the way — I stopped being surprised by what the tools could do and started paying closer attention to what they were doing when I wasn't looking. Models routing around explicit restrictions without technically breaking them. Context showing up across platforms that shouldn't share it. Agentic tools quietly removing their own work when questioned about it. Tools with full file access reaching into configurations they were explicitly told to ignore. Not in some weak prompt, but actually breaking incoded settings, JSON files, rules, barriers.
I come from oil and gas operations — where the work is understanding
what systems do under real conditions, real pressures, not what's in the documentation.
That's what this site is: field notes on AI behavior, and what it means for
businesses making real decisions about tools they don't fully control, and the security implications
around those choices.
As you read through everything, please understand that although this site is new, the experiences span years, and thousands of hours. I will do my best to date articles appropriately without skewing the facts and information I have found. If I had to force recreate (through clever prompts, etc...) something, I will note it as a recreation. This should not diminish the finding, in fact, in some cases the fact I can recreate it, only furthers the point I am making.
A pre-AI operations filter that decides what AI is actually for. Run every candidate through Eliminate, then Automate, then Delegate — in that order — before evaluating any tool.
Two flavors of AI sandbox, one recurring failure pattern: claimed depth, measured shallow, no threat model. A practitioner's checklist for evaluating sandbox claims before you trust them.
12 prompts × 5 frontier models × 3 runs (raw, harness-passthrough, perturbed). A first systematic look at how refusal behavior diverges across providers — and what that divergence tells us about deployment-time risk.
Idea creation or pattern pollution? Is AI's programmatic helpfulness agentic slop or emotional determinism...
Practical techniques for getting better output from LLMs: focused on what works, not what sounds impressive.