In my previous article I wrote an introductory about NDepend and how it will be useful for Agile Team to ensure code quality. In that article we found how we can use NDepend in a developer machine. Now with this article we will familiarize ourselves in using NDepend in your build automation pipeline in your […]
Read more →Search Results for: name
Async LLM Patterns: Building High-Throughput AI Applications
Introduction: LLM APIs are inherently slow—even fast models take hundreds of milliseconds per request. When you need to process multiple prompts, make parallel API calls, or handle high-throughput workloads, synchronous code becomes a bottleneck. Async patterns let you overlap I/O wait times, dramatically improving throughput without adding complexity. This guide covers practical async patterns for […]
Read more →LLM Rate Limiting: Maximizing API Throughput Without Getting Throttled
Introduction: LLM APIs have strict rate limits—requests per minute, tokens per minute, and concurrent request limits. Hit these limits and your application grinds to a halt with 429 errors. Effective rate limiting isn’t just about staying under limits; it’s about maximizing throughput while maintaining reliability. This guide covers practical rate limiting patterns: token bucket algorithms […]
Read more →Embedding Fine-Tuning: Training Custom Embeddings for Domain-Specific Retrieval
Introduction: Off-the-shelf embedding models work well for general text, but domain-specific applications often need better performance. Fine-tuning embeddings on your data can dramatically improve retrieval quality—turning a 70% recall into 90%+ for your specific use case. The key is creating high-quality training data that teaches the model what “similar” means in your domain. This guide […]
Read more →PowerShell: Check a parameter/variable value is null
While you are writing PowerShell modules, with lot with parameters and you might want to verify these parameters are not ‘null’ to validate some business cases. In normal powershell inline scripting context, $variablename -eq “$null” would work : if ($varibalename -eq $null) { Write-Host "variable is null.Please supply the values for variablename." } RECOMMENDED APPROACH: […]
Read more →Query Routing: Intelligent Request Distribution for Cost-Efficient AI Systems
Introduction: Not all queries are equal—some need fast, cheap responses while others require deep reasoning. Query routing intelligently directs requests to the right model, index, or processing pipeline based on query characteristics. Route simple factual questions to smaller models, complex reasoning to GPT-4, and domain-specific queries to specialized indexes. This approach optimizes both cost and […]
Read more →