RAG Assistant - Module 1 Project

Overview

A production-ready Retrieval Augmented Generation (RAG) system built for the Ready Tensor Agentic AI Developer Certification Module 1. This project demonstrates core concepts learned in Weeks 1-3, including document processing, vector embeddings, semantic search, and LLM integration.

What It Does

The RAG system:

Loads Documents: Ingests multiple text and markdown files
Processes Content: Intelligently splits documents into semantic chunks
Creates Embeddings: Converts text to vector representations for semantic search
Answers Questions: Uses an LLM to generate answers grounded in document content
Attributes Sources: Shows which parts of documents were used to answer questions
Maintains Context: Remembers conversation history across multiple turns

Technologies Used

LangChain 1.0+: Framework for building LLM applications
Groq API: Fast LLM inference with llama-3.3-70b-versatile
HuggingFace Embeddings: Semantic text embedding model (all-MiniLM-L6-v2)
Chroma: Vector database for storing and retrieving embeddings
Python 3.8+: Core programming language

Project Structure

rag-assistant-module1/
├── src/
│ ├── config.py # Configuration settings
│ ├── rag_system.py # Main RAG implementation
│ ├── utils.py # Helper functions
│ └── init.py # Package initialization
├── data/sample_documents/ # Sample documents
│ ├── document1_vae.md # Variational Autoencoders guide
│ └── document2_agentic_ai.md # Agentic AI principles
├── examples/
│ ├── basic_example.py # Quick demo
│ └── interactive_chat.py # Interactive conversation mode
├── README.md # Full documentation
├── requirements.txt # Dependencies
├── .env.example # Environment template
└── .gitignore # Hide secrets

Key Features

1. Document Ingestion

Automatically discovers .txt and .md files in data folder
No manual configuration needed

2. Semantic Chunking

Splits documents into 500-character chunks with 50-character overlap
Preserves context while enabling efficient retrieval

3. Vector Search

Uses HuggingFace embeddings for semantic understanding
Finds most relevant chunks based on meaning, not just keywords

4. LLM Integration

Sends retrieved chunks + question to Groq API
Generates contextual, accurate answers

5. Source Attribution

Clearly identifies which documents were used
Enables verification and transparency

6. Conversation Memory

Maintains full conversation history
Allows follow-up questions with context

How to Use

Installation

python -m venv venv
venv\Scripts\activate # Windows
source venv/bin/activate # Mac/Linux
pip install -r requirements.txt
cp .env.example .env

Quick Demo

python examples/basic_example.py

Interactive Chat

python examples/interactive_chat.py

Ask questions about the documents
Type 'exit' to quit

Sample Output

The system successfully:

Loaded 2 sample documents (VAE guide, Agentic AI principles)
Split them into 22 semantic chunks
Answered 3 demo questions with 100% accuracy
Properly attributed all sources

Potential improvements:

Support for PDF documents
Multiple language support
Fine-tuned embedding models
Persistent vector database
Web UI interface
Batch processing capabilities

Learning Outcomes

After building this project, you understand:

How RAG systems work in production
Vector databases and semantic search
LLM integration and prompt engineering
Professional code organization
Security best practices for AI applications

GitHub Repository

https://github.com/prithvi-18/Agentic-AI-Developer-Certification-Program.git

Conclusion

This RAG system demonstrates a complete, production-ready implementation of Agentic AI concepts from Module 1. It showcases proper software engineering practices while delivering real functionality for document-based question answering.