November 2023 – C4: Container, Code, Cloud & Context

Building Production RAG Applications with LangChain: From Document Ingestion to Conversational AI

Posted on November 28, 2023 by Nithin Mohan TK 13 min read

Introduction: LangChain has emerged as the dominant framework for building production Retrieval-Augmented Generation (RAG) applications, providing abstractions for document loading, text splitting, embedding, vector storage, and retrieval chains. By late 2023, LangChain reached production maturity with improved stability, better documentation, and enterprise-ready features. After deploying LangChain-based RAG systems across multiple organizations, I’ve found that its […]

Read more →

.NET 8 and C# 12: A Deep Dive into Native AOT, Primary Constructors, and Blazor United

Posted on November 20, 2023 by Nithin Mohan TK 9 min read

Introduction: .NET 8 represents a landmark release in Microsoft’s development platform evolution, bringing Native AOT to mainstream scenarios, unifying Blazor’s rendering models, and introducing C# 12’s powerful new features. Released in November 2023, this Long-Term Support version delivers significant performance improvements, reduced memory footprint, and enhanced developer productivity. After migrating several enterprise applications to .NET […]

Read more →

OpenAI Assistants API: Building Stateful AI Agents with Code Interpreter and File Search

Posted on November 15, 2023 by Nithin Mohan TK 8 min read

Introduction: OpenAI’s Assistants API, launched at DevDay 2023, represents a significant evolution in how developers build AI-powered applications. Unlike the stateless Chat Completions API, Assistants provides a managed, stateful runtime for building sophisticated AI agents with built-in tools like Code Interpreter and File Search. The API handles conversation threading, file management, and tool execution, allowing […]

Read more →

GPT-4 Turbo and the OpenAI Assistants API: Building Production Conversational AI Systems

Posted on November 15, 2023 by Nithin Mohan TK 12 min read

Introduction: OpenAI’s DevDay 2023 marked a pivotal moment in AI development with the announcement of GPT-4 Turbo and the Assistants API. These releases fundamentally changed how developers build AI-powered applications, offering 128K context windows, native JSON mode, improved function calling, and persistent conversation threads. After integrating these capabilities into production systems, I’ve found that the […]

Read more →

Harnessing AWS CDK for Python: Streamlining Infrastructure as Code

Posted on November 11, 2023 by Nithin Mohan TK 6 min read

After two decades of managing cloud infrastructure across enterprises of all sizes, I’ve witnessed the evolution of Infrastructure as Code from simple shell scripts to sophisticated declarative frameworks. AWS Cloud Development Kit (CDK) represents a paradigm shift that fundamentally changes how we think about infrastructure provisioning. Rather than wrestling with YAML or JSON templates, CDK […]

Read more →

LLM Observability: Tracing, Cost Tracking, and Quality Monitoring for Production AI

Posted on November 10, 2023 by Nithin Mohan TK 11 min read

Introduction: You can’t improve what you can’t measure. LLM applications are notoriously difficult to debug—prompts are opaque, responses are non-deterministic, and failures often manifest as subtle quality degradation rather than crashes. Observability gives you visibility into every LLM call: what prompts were sent, what responses came back, how long it took, how much it cost, […]

Read more →

Searching in

Month: November 2023

Building Production RAG Applications with LangChain: From Document Ingestion to Conversational AI

.NET 8 and C# 12: A Deep Dive into Native AOT, Primary Constructors, and Blazor United

OpenAI Assistants API: Building Stateful AI Agents with Code Interpreter and File Search

GPT-4 Turbo and the OpenAI Assistants API: Building Production Conversational AI Systems

Harnessing AWS CDK for Python: Streamlining Infrastructure as Code

LLM Observability: Tracing, Cost Tracking, and Quality Monitoring for Production AI