blog - page 5 | Subhadip Mitra

All Articles

Making LLMs Faster: My Deep Dive into Speculative Decoding

A deep dive into implementing speculative decoding from scratch, with benchmarks on GPT-2 and extensions to diffusion models.

13 min read · March 20, 2025

2025 machine-learning llm inference optimization AI Infrastructure Research
Engineering Autonomous Multi-Agent Systems - A Technical Deep Dive into Telecom Customer Service

Dive into the world of autonomous AI agents with practical implementations, code examples, and real-world scenarios. Learn how to build intelligent systems with advanced memory management, dynamic prompt evolution, and sophisticated monitoring capabilities in telecom customer service.

52 min read · January 05, 2025

2025 genai architecture casestudy system-design architecture genai casetudy system-design
Why I Built a Modern Java SMPP Library in 2025

The story behind smpp-core - a clean-room Java 21 implementation of the SMPP protocol. Why I replaced Cloudhopper, what went into it, and actual benchmark numbers.

10 min read · January 03, 2025

2025 java smpp telecom open-source performance java open-source telecom
Engineering Multi-Agent Systems - A Retail Banking Case Study

Explore a detailed technical implementation of a multi-agent system for retail banking credit assessment. Learn about agent architecture, distributed systems patterns, error handling, compliance requirements, and performance optimization through actual code examples and system diagrams. Ideal for software architects and engineers building scalable financial systems.

36 min read · December 28, 2024

2024 architecture casestudy architecture casetudy
ETLC 2.0 - Building Context-Aware Data Pipelines

Think your data pipelines could do more than just process information? ETLC 2.0 takes data engineering to the next level with Adaptive Context, Contextual Joins, and a scalable Context Store. It's not just about moving data—it's about making it intelligent. Ready to unlock the future of data pipelines? Read on.

10 min read · December 07, 2024

2024 platform genai etlc platform genai

binary breakthroughs

Notes on building data platforms, AI systems, and the infrastructure between them

Circuit Tracing for the Rest of Us: From Probes to Attribution Graphs and What It Means for Production Safety

RLVR Beyond Math and Code: The Verifier Problem Nobody Has Solved

The Agent Protocol Stack: Why MCP + A2A + A2UI Is the TCP/IP Moment for Agentic AI

I Trained Probes to Catch AI Models Sandbagging

Why Steering Vectors Beat Prompting (And When They Don't)

The MCP Maturity Model: Evaluating Your Multi-Agent Context Strategy

All Articles

Making LLMs Faster: My Deep Dive into Speculative Decoding

Engineering Autonomous Multi-Agent Systems - A Technical Deep Dive into Telecom Customer Service

Why I Built a Modern Java SMPP Library in 2025

Engineering Multi-Agent Systems - A Retail Banking Case Study

ETLC 2.0 - Building Context-Aware Data Pipelines