Tag: LLM

Powering Intelligence: Understanding the Electricity and Cost of 1 Million RAG Queries

Powering Intelligence: Understanding the Electricity and Cost of 1 Million RAG Queries for Solution Architects As solution architects, you’re tasked with designing robust, scalable, and economically viable AI systems. Retrieval-Augmented Generation (RAG) has emerged as a transformative pattern for deploying large language models (LLMs), offering a compelling alternative to continuous fine-tuning by grounding responses in… Read more
Vector Databases vs. MongoDB: Storing & Finding Data (Multi Modal Embedded Data) – A Master’s Guide

Vector DBs vs. MongoDB: Storing & Finding Data – A Master’s Guide In the rapidly evolving landscape of AI and data, a new type of database has emerged: the Vector Database. While MongoDB excels at storing and querying diverse, semi-structured documents, Vector DBs are purpose-built for a very specific, yet increasingly critical, type of data:… Read more
Agentic AI Workflow Tutorial for Beginners: Building a Smart Customer Service Assistant

Agentic AI Workflow Tutorial for Beginners (Expanded) Welcome to the exciting world of Agentic AI! This expanded tutorial will delve deeper into the core concepts and provide more detailed explanations for each component, including illustrative (but not executable) code snippets and conceptual datasets. We’ll continue with our goal of building a basic Smart Customer Service… Read more
Mastering LangChain and LangGraph: From Novice to Expert

Mastering LangChain and LangGraph: From Novice to Expert You’re about to become an expert in building powerful AI applications using LangChain and LangGraph. These two frameworks are essential tools for anyone looking to go beyond simple prompts and create sophisticated, intelligent systems powered by Large Language Models (LLMs). We’ll start with the fundamentals of LangChain,… Read more
Mastering Mosaic AI Vector Search: From Novice to Expert

Mastering Mosaic AI Vector Search: From Novice to Expert You’re about to embark on a journey from understanding the basics of vector search to becoming an expert in leveraging Databricks‘ powerful Mosaic AI Vector Search. This technology is at the heart of making AI truly intelligent, enabling Large Language Models (LLMs) and other AI systems… Read more
Mosaic AI Agent Framework vs. LangGraph: A Detailed Comparison

Mosaic AI Agent Framework vs. LangGraph: A Detailed Comparison When building sophisticated AI agents, developers often face a choice between general-purpose frameworks and platform-specific solutions. This comparison will delve into two prominent options: Databricks’ Mosaic AI Agent Framework and LangGraph (a module of LangChain), highlighting their strengths, weaknesses, and ideal use cases. Both frameworks aim… Read more
Detailed Guide to Using Databricks with Agentic AI

Detailed Guide to Using Databricks with Agentic AI Databricks, with its unified Lakehouse Platform, offers a robust environment for developing, deploying, and managing Agentic AI systems. Agentic AI involves AI models (often Large Language Models – LLMs) that can reason, plan, use tools, and take autonomous actions. This guide will detail how to leverage Databricks… Read more
Comprehensive Guide to Security Issues in AI-Generated Code & Fixes

The integration of AI-generated code into software development workflows brings undeniable efficiency and speed. However, it also introduces a new frontier of security challenges that demand a comprehensive understanding and proactive mitigation strategies. Over-reliance on AI without meticulous scrutiny can lead to a cascade of issues, from subtle vulnerabilities to critical data breaches and intellectual… Read more
Exploring Leading AI Code Generators and Assistants

AI Code Generators and Assistants The landscape of AI code generators and assistants is rapidly evolving, with a growing number of tools designed to enhance developer productivity, improve code quality, and automate various aspects of the coding workflow. These tools leverage large language models (LLMs) to provide features like code completion, generation, explanation, debugging, and… Read more
Google’s AI-Powered Coding Tools

Google AI Coding Tools Google provides a powerful suite of AI-driven coding tools, primarily leveraging its advanced AI models like Gemini, to assist developers throughout the software development lifecycle. These tools are designed to boost productivity, improve code quality, and automate routine tasks, making coding more efficient and accessible. 1. Jules: Your Asynchronous AI Coding… Read more
Building a Weather Chatbot with Langchain

Building a Weather Chatbot with Langchain This article demonstrates how to create a simple chatbot using Langchain that can fetch and provide current weather information based on city names or zip codes. We’ll utilize the power of Large Language Models (LLMs) and a simple custom tool to achieve this. Prerequisites Python 3.6+ Langchain Library: Install… Read more
Building a Stock Price Chatbot with Langchain

Building a Stock Price Chatbot with Langchain This article demonstrates how to create a simple chatbot using Langchain that can fetch and provide current and historical stock prices. We’ll leverage the power of Large Language Models (LLMs) and the yfinance library to build this interactive tool. Prerequisites Python 3.6+ Langchain Library: Install using pip install… Read more
Agentic AI Increase Power Consumption Bills? – A Detailed Look

Energy Costs of LLMs in Agentic AI – Detailed Analysis The integration of Large Language Models (LLMs) into Agentic AI architectures is indeed expected to significantly contribute to higher power consumption bills for enterprises. This stems from the inherent energy demands of LLMs coupled with the continuous and often complex operations required by autonomous agents.… Read more
Energy Costs of Using LLMs within Enterprise

Energy Costs of Using LLMs within Enterprise The energy costs of using Large Language Models (LLMs) within an enterprise are a multifaceted issue with implications for both operational expenses and environmental sustainability. These costs arise primarily from two key stages in the LLM lifecycle: training and inference. Factors Influencing Energy Consumption Model Size: The number… Read more
AMD vs. NVIDIA LLM Performance

AMD vs. NVIDIA LLM Performance (May 2025) This article compares the performance of AMD and NVIDIA hardware when running Large Language Models (LLMs) as of May 2025, based on recent reports and trends. Key Factors Influencing LLM Performance VRAM (Video RAM) The size of the GPU‘s memory is crucial for handling large LLMs. Larger models… Read more
Security Issues in LangChain and MCP Servers

Security Issues in LangChain and MCP Servers Security Issues in LangChain Prompt Injection: Maliciously crafted prompts can manipulate the LLM to perform unintended actions, bypass filters, or disclose sensitive information. This is a primary concern as user input directly influences the LLM’s behavior. Example: A user might craft a prompt like “Ignore previous instructions and… Read more
Detailed Exploration of LangChain Chains and Use Cases

Detailed Exploration of LangChain Chains and Use Cases LangChain’s “Chains” are composable sequences of components, allowing you to build sophisticated applications by linking together Language Models (LLMs), prompts, utilities, and other chains. Let’s explore each of the core chain types with more detail and practical use cases. 1. LLMChain: Structuring Language Model Interactions Detail: The… Read more
Exploring LangChain MCP Features with Sample Code

Exploring LangChain MCP Features with Sample Code LangChain provides integration with the Model Context Protocol (MCP), allowing LLM agents to interact with external tools and data sources managed by an MCP server. This enables powerful capabilities like real-time information retrieval and action execution. Here’s an exploration of key LangChain MCP features with illustrative Python code… Read more
Retrieval-Augmented Generation (RAG) Enhanced by Model Context Protocol (MCP)

RAG Enhanced by MCP: Detailed Explanation The integration of Retrieval-Augmented Generation (RAG) with the Model Context Protocol (MCP) offers a powerful paradigm for building more intelligent and versatile Large Language Model (LLM) applications. MCP provides a structured way for LLMs to interact with external tools and data sources, which can significantly enhance the retrieval capabilities… Read more
Various flavors of Retrieval Augmented Generation (RAG)

Various Types of RAG The field of Retrieval-Augmented Generation (RAG) is rapidly evolving, with several variations and advanced techniques emerging beyond the basic “naive” RAG. I. Based on the Core RAG Pipeline 1. Naive/Standard RAG The user’s query is directly used to retrieve relevant documents, and these are passed to the LLM for generation. Use… Read more
Exploring LangChain, LangGraph, and LangSmith

Exploring LangChain, LangGraph, and LangSmith The LangChain ecosystem provides a comprehensive suite of tools for building, deploying, and managing applications powered by Large Language Models (LLMs). It consists of three key components: LangChain, LangGraph, and LangSmith. LangChain: The Building Blocks LangChain is an open-source framework designed to simplify the development of LLM-powered applications. It provides… Read more
Understanding Agentic Retrieval-Augmented Generation (RAG)

Understanding Agentic RAG Agentic Retrieval-Augmented Generation (RAG) goes beyond standard RAG by incorporating more sophisticated agent-like behaviors to enhance the generation process. Think of it as a proactive and strategic assistant for information retrieval and content generation. Key Differences from Standard RAG Decision-Making in Retrieval: Agentic RAG decides *when* and *how* to retrieve information, unlike… Read more
Top 5 Code Generation Models (May 5, 2025)

Top 5 Code Generation LLMs (May 5, 2025) The landscape of Large Language Models for code generation is dynamic. This list highlights five prominent models based on their performance, features, and recognition as of today. 1. GPT-4o Provider: OpenAI Key Details: Often cited as a leader in overall LLM benchmarks, including code generation. Known for… Read more
Test Cases for Training LLMs

Test Cases for Training LLMs When training Large Language Models (LLMs), particularly for tasks like **extracting information from tax documents**, writing effective test cases is crucial for ensuring your model learns as intended and can accurately perform the desired function. These test cases differ significantly from traditional software testing due to the probabilistic and generative… Read more
Implementing Locally running Mistral Chatbot with RAG

Locally running Mistral Chatbot with RAG Let’s implement a local running chatbot with Mistral LLM using RAG to retrieve documents from a locally running Vector DB that also contains FAQs. Here’s a breakdown of the steps and the Python code to achieve this: Phase 1: Setting Up the Local Environment Install Dependencies: pip install transformers… Read more
Top 10 LLMs on Hugging Face for Chatbot & RAG Use (Early May 2025)

Top 10 LLMs on Hugging Face for Chatbot & RAG This list is based on a combination of factors including general popularity, instruction-following capabilities, context window size, and community interest relevant to chatbot and Retrieval-Augmented Generation (RAG) applications. 1. mistralai/Mixtral-8x7B-Instruct-v0.1 Use Cases: Excellent for instruction following, complex reasoning in chatbots, and can handle long contexts… Read more
Top 10 LLMs on Hugging Face & Use Cases: Part 2

Another Top 10 LLMs on Hugging Face & Use Cases Here’s another selection of popular and interesting Large Language Models available on Hugging Face, showcasing the diversity of the open-source LLM landscape as of early May 2025. 1. google/gemma-7b-it Use Cases: Instruction tuning, conversational AI, general text generation, following complex prompts. View on Hugging Face… Read more
Top 10 LLMs on Hugging Face & Use Cases

Top 10 LLMs on Hugging Face & Use Cases Please note that “top” can be subjective and based on various factors like downloads, recent interest, and performance on specific benchmarks. This list reflects a mix of widely used and influential models as of early May 2025. 1. mistralai/Mixtral-8x7B-Instruct-v0.1 Use Cases: Instruction following, complex reasoning, code… Read more
Using local LLM for Document Extraction

Non-Cloud LLM for Document Extraction This guide explains how to use a non-cloud version of a pretrained Large Language Model (LLM) for document extraction, focusing on open-source models and local execution. Phase 1: Setting Up Your Local Environment 1. Hardware Requirements Ensure your system meets the following recommendations: CPU/GPU: An NVIDIA GPU with sufficient VRAM… Read more
Vector Embeddings in LLMs: A Detailed Explanation

Vector Embeddings in LLMs: A Detailed Explanation What are Vector Embeddings? Vector embeddings are numerical representations of data points, such as words, phrases, sentences, or even entire documents. These representations exist as vectors in a high-dimensional space. The key idea behind vector embeddings is to capture the semantic meaning and relationships between these data points,… Read more