The Ready Tensor Publication Explorer is an advanced AI-powered tool that utilizes Retrieval-Augmented Generation (RAG) techniques to automate the handling of a sample dataset that contains Ready Tensor technical documentation. By leveraging RAG models, the system delivers accurate and context-aware responses to (natural language) user queries. Integrating OpenAI embeddings, semantic search capabilities, and a user-friendly interface, this tools offers a scalable and efficient solution for Ready Tensor users, developers, researchers, and organizations searching streamlined access to documentation resources enclosed in the Ready Tensor platform by exploring its contents and asking questions.
This project uses a sample dataset and is structured on a modular LangChain-based pipeline.
Sample Dataset:
A collection of 35 Ready Tensor publications, each of them characterised by id
, username
, license
, title
, and publication description
. There are 6 types of licenses; 27 publications use βMITβ or βCCβ, and the rest are βnoneβ or missing. Under MIT/CC, reuse is permitted for open source projects.
Features & Modules:
This project is structured on a modular LangChain-based pipeline in which each feature is mapped to the specific tool or module implementing it:
Feature | Tool / Library / Module |
---|---|
Prompt formulation | LangChain PromptTemplate |
Vector store retrieval | Chroma Vector Database |
LLM-generated response | OpenAI GPT-3.5/4 via LangChain |
Document ingestion & embedding | LangChain DocumentLoader, OpenAI Embeddings, Chroma |
Minimal UI for interaction | Streamlit |
Example queries, retrieval, response eval | LangChain Chains & Evaluators |
Session-based memory/intermediate reasoning | LangChain ReAct, ConversationBuffer |
Workflow:
The LangChain-based pipeline is designed to:
The core workflow and system architecture of the application are illustrated in the following flowchart:
This pubblication has a GitHub code repository
attached under the "Code" section.
Prerequisites: Python 3.10+, pip, and access to the referenced dataset.
git clone https://github.com/Joshua-Abok/rag_apk cd rag_apk git checkout dev
python3 -m venv .venv source .venv/bin/activate # Linux / macOS .\.venv\Scripts\activate # Windows
export OPEN_API_KEY=your_open_api_key_here # Linux / macOS set OPEN_API_KEY=your_open_api_key_here # Windows
pip install -r requirements.txt
Note: The
sample dataset
is available in the "Datasets" section.
streamlit run app/main.py
You can now interact with the Ready Tensor Publication Explorer!
The main landing page of the Ready Tensor Publication Explorer, showing a search bar (selectbox for choosing a publication title and viewing its details) and the chat interface.
The search bar interface where users can view and search the list of publications by title.
Example of detailed view of the content of a publication, selected by title.
Examples of interactive chat interface for asking questions about publications using the RAG-powered assistant.
Note: When a user asks a question in the chat, the agent has access to the content of all publications and can retrieve information from any or all of them to answer the query. The chat input can be used to ask about any aspect of the dataset, including questions that span multiple publications. Therefore, the agent will use the full dataset to answer, not just the selected publication.
The assistant helps users explore and comprehend Ready Tensor publications. Example general queries might be:
Use Cases:
API keys are stored securely in environment variables for secure access. The API exposes endpoints for querying and interacting with Ready Tensor publications using the RAG pipeline.
We welcome contributions to improve the Ready Tensor Publication Explorer!
git checkout -b your-feature-name
Please follow our code style and guidelines. For questions or suggestions, open an issue.
We are actively seeking contributors who want to help implement and/or propose the following future features:
Feel free to suggest more ideas by opening an issue or starting a discussion! For bug reports or feature requests, open an issue. For general questions or share your thoughts, start a comment.
This publication is licensed under the MIT License.
chibueze.k.muoneke@gmail.com, michelaagostini73@gmail.com, nyajuaya.j.a@gmail.com
This project is part of the Agentic AI Developer Certification program by the Ready Tensor. We appreciate the contributions of the Ready Tensor developer community for their guidance and contributions.