Welcome to Gen-AI

👋 I’m Sai Boorlagadda, a Principal Engineer and ASF Member building agentic platforms. I write to fix the impedance mismatch between AI research and software engineering—providing the missing context and systems thinking that builders need to turn papers into production. More about me →

How LLMs Read?

Everyone talks about the Neural Network, but the Tokenizer is the unsung hero of LLMs. This post explains what a Tokenizer actually does, why we use Byte Pair Encoding (BPE), and how these tokens bridge the gap between rigid integers and meaningful vector embeddings in models like GPT-4.

January 16, 2026 Â· Sai

The Thesis – Why Dictation is the New Interface

High-bandwidth input is the bottleneck of modern computing. Voice agents are for delegation; Voice dictation is for creation. This post explores why we need ‘Agentic Dictation’ to match the speed of our thoughts.

December 29, 2025 Â· Sai

Deep Dive: Keyword Search

Conventional keyword search matches user query words to document words using an inverted index data structure for efficient matching and ranking by relevancy.

Mar 15, 2024 → Updated: Jan 22, 2026 Â· Sai

Building RAG: All things retrieval

Retrieval is the backbone of RAG. We explore the critical steps often missed by developers: proper chunking strategies, the ‘Librarian’ analogy for vector vs. keyword search, and solving the math problem of Hybrid Search using Reciprocal Rank Fusion (RRF).

Mar 2, 2024 → Updated: Jan 21, 2026 Â· Sai

Revolutionizing Question-and-Answer Systems

LLMs revolutionize question-and-answer systems with exceptional language understanding and creative writing skills. Lossy compression during training may make retrieving information challenging. Leveraging LLMs’ language expertise transforms building question-and-answer systems into reading comprehension using their ability to comprehend text, forming the basis for RAG systems that shift question answering to efficient knowledge base searches.

Feb 25, 2024 → Updated: Jan 20, 2026 Â· Sai