binary breakthroughs

Notes on building data platforms, AI systems, and the infrastructure between them

Circuit Tracing for the Rest of Us: From Probes to Attribution Graphs and What It Means for Production Safety

MIT Tech Review named mechanistic interpretability a 2026 Breakthrough Technology. Anthropic open-sourced circuit tracing. Here's what actually changed, how it connects to the activation probes I built for sandbagging detection, and why production teams should care.

17 min read · January 31, 2026

RLVR Beyond Math and Code: The Verifier Problem Nobody Has Solved

Reinforcement Learning with Verifiable Rewards powers every reasoning model worth talking about. But it only works where you can check the answer automatically. Extending it to messy, real-world domains is the hardest open problem in LLM training right now.

19 min read · January 18, 2026

The Agent Protocol Stack: Why MCP + A2A + A2UI Is the TCP/IP Moment for Agentic AI

MCP handles agent-to-tool. A2A handles agent-to-agent. A2UI handles agent-to-interface. Together they form a protocol stack that nobody has mapped properly - including the security gaps that should terrify you.

18 min read · January 06, 2026

I Trained Probes to Catch AI Models Sandbagging

First empirical demonstration of activation-level sandbagging detection. Linear probes achieve 90-96% accuracy across Mistral, Gemma, and Qwen models. Key finding - sandbagging representations are model-specific, and steering can reduce sandbagging by 20%.

10 min read · December 20, 2025

Why Steering Vectors Beat Prompting (And When They Don't)

I tested activation steering on 4 agent behaviors across 3 models. The results surprised me.

9 min read · December 18, 2025

The MCP Maturity Model: Evaluating Your Multi-Agent Context Strategy

A practical framework for evaluating your multi-agent context management strategy. From ad-hoc string concatenation to self-evolving context systems - where does your architecture stand?

32 min read · November 19, 2025

All Articles

Introducing OConsent - Open Consent Protocol

OConsent is a blockchain-based platform that enables transparent processing of personal data, empowering users and data controllers to manage consent and privacy.

22 min read · December 10, 2020

2020 system-design privacy consent-management blockchain cryptography system-design platform cryptography blockchain
Welcome to my blog

Let's talk tech! I'll post everything from polished pieces to spur-of-the-moment thoughts. And if you've got ideas for posts or want to collaborate, let's connect!

2 min read · January 01, 1970

1970 welcome others