This project implements a RAG-based (Retrieval-Augmented Generation)
AI assistant that answers questions based on Ready Tensor publications.
The system combines vector search with large language models to provide
accurate, context-aware responses grounded in real documents.
Traditional AI chatbots generate responses purely from training data,
which can lead to hallucinations or outdated information. This RAG-based
assistant solves that by first retrieving relevant content from a
document corpus, then using an LLM to generate answers based on that
retrieved context.
The pipeline works in the following steps: