LLM Caching Strategies: Reducing Costs and Latency at Scale

Introduction: LLM API calls are expensive and slow. A single GPT-4 request can cost cents and take seconds—multiply that by thousands of users and costs spiral quickly. Caching is the most effective way to reduce both cost and latency. But LLM caching is different from traditional caching: exact string matches are rare, and semantically similar […]

Read more →

Azure DevOps Server 2019 rc1–Available/Download Now

Microsoft has announced the availability of first release candidate (RC) of Azure DevOps Server 2019. The Azure DevOps Server(previously TFS/Team Foundation Server) delivers the Azure DevOps Services optimized for customers who prefer to self-host these devops services on-premises. Key Features included/improved : Branding Changes Azure DevOps Server includes support for Azure SQL in addition to […]

Read more →

Visual Studio 2017–Version 15.9.0 released

Microsoft has today released Visual Studio 2017 – Update 15.9.0 with lots of bug fixed and improvements to the IDE for stability and performance. Release Notes: Visual Studio 2017 version 15.9 Minor Release Download the latest update from: visualstudio.com/downloads Latest News: There is a new service update released on November 15, 2018 — Visual Studio […]

Read more →

Prompt Compression Techniques: Fitting More Context in Less Tokens

Introduction: Context windows are limited and tokens are expensive. Long prompts with extensive context, examples, or retrieved documents quickly hit limits and drive up costs. Prompt compression techniques reduce token count while preserving the information LLMs need to generate quality responses. This guide covers practical compression strategies: token pruning to remove low-information tokens, extractive summarization […]

Read more →

Global Office 365 Developer Bootcamp–Letterkenny-Nov’10 2018–Register Now

We have got the opportunity to host Global Office 365 Developer Bootcamp in Letterkenny as part of Letterkenny DotNet Azure User Group (LK-MUG). Global Office 365 Developer Bootcamp – Overview? Following the success of last year, Global Office 365 Developer Bootcamp now becomes an annual event. It is a free, one-day, hands-on training event led […]

Read more →

Azure Cosmos DB – TTL (Time to Live) – Reference Usecase

TTL capability within Azure Cosmos DB is a live saver, as it would take necessary steps to purge redudent data based on the configurations you may.  Let us think in terms of an Industrial IoT scenario, devices can produce vast amounts of telemetry information, logs and user session information that is only useful until we […]

Read more →