Category: indexing

  • Building a Product Manual Chatbot with Amazon OpenSearch and Open-Source LLMs

    This article guides you through building an intelligent chatbot that can answer questions based on your product manuals, leveraging the power of Amazon OpenSearch for semantic search and open-source Large Language Models (LLMs) for generating informative responses. This approach provides a cost-effective and customizable solution without relying on Amazon Bedrock. The Challenge: Navigating through lengthy Read more

  • Scaling a vector database

    Scaling a vector database is a crucial consideration as your data grows and your query demands increase. Here’s a breakdown of the common strategies and factors involved in scaling vector databases: Why Scaling is Important: Common Scaling Strategies: Techniques for Horizontal Scaling: Factors to Consider When Scaling: Choosing the Right Scaling Strategy: The best scaling Read more

  • Spring AI chatbot with RAG and FAQ

    Demonstrate the concepts of building a Spring AI chatbot with both general knowledge RAG and an FAQ section into a single comprehensive article.Building a Powerful Spring AI Chatbot with RAG and FAQLarge Language Models (LLMs) offer incredible potential for building intelligent chatbots. However, to create truly useful and context-aware chatbots, especially for specific domains, we Read more

  • Vector Database Internals

    Vector databases are specialized databases designed to store, manage, and efficiently query high-dimensional vectors. These vectors are numerical representations of data, often generated by machine learning models to capture the semantic meaning of the underlying data (text, images, audio, etc.). Here’s a breakdown of the key internal components and concepts: 1. Vector Embeddings: 2. Data Read more

  • Tensor

    PyTorch’s fundamental data structure is the Tensor. It’s the central object for numerical computation in PyTorch, analogous to NumPy’s ndarray but with added capabilities for GPU acceleration and automatic differentiation (crucial for deep learning). Here’s a breakdown of PyTorch’s data structure landscape, with the Tensor at the core: 1. Tensors (torch.Tensor) 2. NumPy Arrays (numpy.ndarray) Read more