about
blog
now
publications
projects
cv
bets
more
repositories reading list contact
ARTEMIS UPIR AI Metacognition SMPP Gateway ISO8583
symmetry

Interpretability

an archive of posts in this category

Dec 20, 2025	I Trained Probes to Catch AI Models Sandbagging

Subhadip Mitra

Technical Leader, Inventor, and Researcher building the future of Data & Applied AI.
Leading Google Cloud's D&A practice across Southeast Asia.

contact@subhadipmitra.com

Explore

About
What I'm Doing Now
Blog
Publications
Repositories

Connect

Contact
Email
Schedule a Call
LinkedIn
GitHub

© 2026 Subhadip Mitra. Some Rights Reserved. Last updated: January 10, 2026. Privacy · Licenses · Sitemap · RSS · LLMs