This project builds a Retrieval-Augmented Generation (RAG) assistant that answers developer questions about Docker using real documentation.
It combines document ingestion, semantic search, and LLM-based answering with source citations.
Docker documentation is large, fragmented, and hard to search.
Developers often struggle to find precise answers to questions like:
Traditional keyword search fails to capture semantic meaning or cite sources.
We built a modular RAG pipeline that:
.txt files from Docker docs.sentence-transformers/all-MiniLM-L6-v2llama-3-8b-instant via GroqQ: What is Docker volume?
A: A Docker volume persists data outside the lifecycle of a container. It can be shared across containers and remains even when a container is removed.
Sources: docker_volume_overview.txt, docker_bind_mounts.txt
Q: How does the VOLUME instruction in a Dockerfile work?
A: The VOLUME instruction creates a mount point inside the container and marks it for external volumes so data written there is persisted outside the container layer.
Sources: dockerfile_reference.txt
pip install -r requirements.txt python src/cli.py