I built a Retrieval-Augmented Generation (RAG) assistant that answers questions from custom documents using Groq AI, ChromaDB, and Sentence Transformers. This project demonstrates the core concepts of Agentic AI by implementing a complete document QA pipeline.
My implementation follows these steps:
my-rag-project/
├── rag_pipeline.py # Main RAG orchestration
├── vector_store.py # Vector database management
├── json_processor.py # Document loading & chunking
├── test_llm.py # API connection test
├── requirements.txt # Dependencies
└── README.md # Project documentation


Through this project, I learned:
How to implement a complete RAG pipeline from scratch
The importance of proper document chunking for retrieval quality
How vector databases enable semantic search
Prompt engineering for better answer generation
Error handling in AI pipelines
Add conversation memory for multi-turn dialogues
Support additional document formats (PDF, DOCX)
Implement ReAct pattern for complex reasoning
Add web interface using Streamlit
Deploy as a REST API service
Ready Tensor AAIDC Module 1 materials
Groq API documentation
ChromaDB documentation
Sentence Transformers models
This project was completed as part of Module 1 of the Agentic AI Developer Certification Program.