Building a Documentation Agent with RAG¶
Current AI models are trained on vast datasets, making them powerful at generating general-purpose text. However, when asked about specific topics outside their training data (like your company's internal documentation), these models often hallucinate - generating plausible-sounding but incorrect information.
Thankfully, there is a solution to this problem: Retrieval-Augmented Generation (RAG). This technique consists on combining two key components:
- A retrieval system that finds relevant information from your custom dataset
- A language model that generates accurate responses using the retrieved information
In this tutorial, you'll learn how to build a RAG-powered agent that accurately answers questions about NEAR Protocol.
What You Will Need¶
To follow this tutorial you will need:
- NEAR AI CLI installed on your local machine → Installation Guide
- Basic understanding of creating a NEAR AI agent → Agents Quickstart Tutorial
Overview¶
This tutorial is divided in the following sections:
- The problem → Understanding AI hallucination from incorrect data
- Vector Stores → Getting started with vector stores
- RAG Agent → Building a NEAR Documentation Q&A agent
- Chunking → Dive deeper into how vector stores store documents
- Embeddings → Creating document embeddings manually