Subhadip
Mitra
Toggle navigation
about
blog
now
publications
projects
cv
more
repositories
archive
contact
licenses
privacy
docs: UPIR
docs: AI Metacognition
ctrl k
activation-probing
an archive of posts with this tag
Dec 20, 2025
I Trained Probes to Catch AI Models Sandbagging