This work introduces DataVortex, an intelligent chatbot system designed using the principles of retrieval-augmented generation (RAG) and agentic AI reasoning. Traditional large language models (LLMs) demonstrate impressive generative capabilities, yet frequently hallucinate or fail when faced with knowledge-intensive tasks requiring external, up-to-date context. DataVortex addresses these limitations by integrating a retrieval pipeline with an agentic decision layer that allows tool usage, multi-step reasoning, and grounded response generation.The system architecture consists of query preprocessing, embedding generation, semantic search through a vector database, context assembly, and a reasoning engine powered by an LLM. An optional agentic layer extends the modelβs capacity to call APIs, perform computations, or chain multiple steps for more complex queries.Experimental evaluations demonstrate that DataVortex significantly improves response grounding, citation transparency, and contextual accuracy compared to baseline LLM outputs. Example interactions highlight its ability to handle both factual and reasoning-heavy queries. This work showcases how agentic RAG frameworks can be deployed in scalable chatbot systems for education, research, and enterprise use cases.
Over the past few years, large language models (LLMs) have redefined human-computer interaction by enabling conversational AI that feels natural and responsive. Despite these advances, LLMs continue to suffer from a critical flaw: hallucination. When confronted with knowledge-intensive questions beyond their training data, they often produce fabricated or outdated information.
To mitigate these issues, retrieval-augmented generation (RAG) has emerged as a promising paradigm. By retrieving external knowledge sources and grounding model responses, RAG enables more accurate and trustworthy output. In parallel, the concept of agentic AI β systems that can autonomously select tools, reason across multiple steps, and adapt dynamically β has gained momentum.
This paper presents DataVortex, a chatbot that combines both paradigms. Unlike vanilla RAG bots, DataVortex incorporates an agentic decision layer, enabling it to not only recall knowledge but also act on queries requiring additional computation or API access. This approach enhances adaptability, reliability, and scalability in real-world applications.
RAG was popularized by Lewis et al. (2020), who demonstrated the value of combining retrieval mechanisms with generative modeling for knowledge-intensive tasks. Since then, frameworks such as LangChain and LlamaIndex have simplified the integration of retrieval pipelines with LLMs.
However, these systems often stop at retrieval and lack agency. Recent advances in autonomous agent frameworks have enabled AI systems to reason about tasks, use external tools, and engage in iterative planning. DataVortex merges these two streams of work β retrieval grounding and agentic reasoning β to create a chatbot capable of both factual reliability and flexible problem-solving.
The architecture of DataVortex follows a modular pipeline:
Optional Agentic Layer:
For complex tasks, the system can call external tools (e.g., web APIs, calculators, summarization modules) and incorporate the results into its reasoning pipeline.
DataVortex is built using Python, LangChain, FAISS, and a LLaMA-based LLM. Documents (text/PDF) are chunked into ~500 tokens with overlap, embedded, and stored in FAISS for fast similarity search. Queries go through a retrieval + context assembly pipeline, then passed to LLaMA for grounded responses.
Key optimizations:
This setup ensures efficient, accurate, and open-source RAG operations without reliance on closed APIs.
Code Modules Map (repo structure):
DataVortex was tested on mixed text and PDF datasets. It consistently retrieved relevant context and produced fact-grounded answers with minimal hallucinations. Response times were optimized with GPU inference, averaging 1.8s per query.
Compared to closed models, LLaMA offered greater control, transparency, and cost efficiency, making DataVortex practical for real-world use.
Discussion & Future Work
DataVortex demonstrates that combining retrieval with agentic reasoning creates a chatbot capable of handling both knowledge grounding and dynamic reasoning tasks. Its modular architecture makes it extensible across domains such as education, research assistance, and enterprise knowledge bases.
However, limitations remain:
Future work will expand the system with multimodal input support (images, audio), real-time web search, advanced reranking methods, and improved evaluation metrics for factual correctness.
This work presented DataVortex, a retrieval-augmented and agentic chatbot designed to address the weaknesses of standard LLMs. By grounding responses in external knowledge and incorporating an agentic decision layer, DataVortex demonstrates significant improvements in accuracy, transparency, and adaptability. The system highlights the potential of agentic RAG frameworks to power next-generation conversational AI.