Cloud cost governance has become a critical requirement as organizations increasingly adopt multi-cloud platforms. This work presents a stateless FinOps conversational agentic AI system that combines multi-agent reasoning, Retrieval-Augmented Generation (RAG), Text-to-SQL translation, and Groq-based LLM inference to automate FinOps analysis. The system supports cost queries, data insights, trend generation, and domain-grounded explanations using a modular agentic architecture. The agents include an Intent Router, Entity Extractor, Text-to-SQL Agent, Data Fetcher, Insight Agent, Visualizer Agent, Knowledge Agent, and Supervisor. The Knowledge Agent further enhances accuracy using FinOps domain documents stored as .txt files. The current version operates as a stateless chatbot, while future work will extend it into a stateful, memory-based FinOps assistant capable of long-term dialogue and contextual continuity.
Financial Operations (FinOps) aims to enhance accountability and cost efficiency in cloud environments. Traditional FinOps analysis depends heavily on manual dashboards, SQL queries, and multi-team collaboration. This increasingly becomes inefficient as cloud usage grows in volume and complexity.
Large Language Models (LLMs) provide an opportunity to automate these processes through natural language queries. However, naive LLM usage can lead to hallucinations and unreliable financial outputs. To address these issues, this work introduces a multi-agent FinOps conversational system powered by LangGraph. Through modular agents and RAG-driven knowledge grounding, the system reliably answers FinOps questions, performs SQL-based cost extraction, computes cloud insights, and generates expenditure visualizations.
The architecture is designed to be fully modular, making it extensible for future research and industry deployment. The present implementation is stateless, meaning each user query is treated independently. A future extension will introduce statefulness, enabling memory of previous conversations, iterative analysis, and long-term cost investigations.
The system handles FinOps queries end-to-end:
natural language ā intent routing ā SQL ā insights ā visualization ā RAG ā final summary.
Key Features
Installation
git clone https://github.com/Suchi-BITS/ReadyTensor_Project-2.git
python -m venv venv
venv\Scripts\activate
pip install --upgrade pip
pip install -r requirements.txt
Running the app
streamlit run integrations/app.py
Project Structure
A-Agent/
ā
āāā agents/
ā āāā agentic_tools/
ā ā āāā entity_extraction.py
ā ā āāā text2sql.py
ā ā āāā transfer_tools.py
ā ā āāā init.py
ā ā
ā āāā data_fetcher.py
ā āāā insightAgent.py
ā āāā intent_router.py
ā āāā knowledge.py
ā āāā small_talk.py
ā āāā supervisor.py
ā āāā visualizerAgent.py
ā
āāā prompts/
ā āāā intent_router.txt
ā āāā knowledge_agent.txt
ā āāā schema-context.txt
ā āāā supervisor.txt
ā āāā text2sql.txt
ā āāā visualizer.txt
ā āāā small_talk.txt
ā
āāā data/
ā āāā cloud_architecture_patterns.txt
ā āāā cloud_cost_analysis.txt
ā āāā cloud_cost_anomaly_detection.txt
ā āāā cloud_cost_governance.txt
ā āāā cloud_cost_optimisation_strategy.txt
ā āāā cloud_financial_forecasting.txt
ā āāā cloud_financial_operating_maturity_model.txt
ā āāā cloud_predictive_analysis.txt
ā āāā finops_operating_model.txt
ā āāā finops.txt
ā
āāā results/
ā āāā data_cleaned.csv
ā āāā query_results.csv
ā āāā auto_visualization.png
ā āāā trend_chart.png
ā āāā trend_chart_modern.png
ā
āāā utils/
ā āāā prompt_loader.py
ā āāā logger_setup.py
ā āāā log_utils.py
ā āāā sample.py
ā
āāā integrations/
ā āāā app.py
ā āāā main.py
ā
āāā .env
āāā requirements.txt
āāā secrets.toml
āāā README.md
2.1 System Overview
The methodology employs an agentic pipeline consisting of specialized agents, each responsible for a distinct FinOps task. LangGraph is used as the orchestration layer that manages routing, control flow, and inter-agent message exchange.
2.2 Agents
a. Intent Router Agent: Classifies user queries into SQL, insights, knowledge, or small talk.
b. Entity Extraction Agent: Extracts resources, cost fields, dates, and filters using schema context.
c. Text-to-SQL Agent: Converts natural language queries into Snowflake SQL.
d. Data Fetcher Agent: Executes SQL, cleans and stores results.
e. Insight Agent: Generates KPIs, cost trends, anomalies, and summaries using Python REPL execution.
f. Visualizer Agent: Produces cost trend charts and comparative visualizations.
g. Knowledge Agent (RAG): Loads all FinOps domain text files, consolidates them, and generates grounded explanations.
h. Supervisor Agent: Manages workflow ordering, merges outputs, and produces final responses.
2.3 RAG Pipeline
A set of domain-specific .txt files related to cloud architecture, cost governance, FinOps principles, predictive analysis, and optimization strategies are loaded dynamically. These documents serve as grounding material for the Knowledge Agent. This ensures explanations remain aligned to FinOps standards rather than relying solely on LLM inference.
2.4 Stateless Architecture
The system is currently stateless, meaning:
This helps evaluate agent performance independently, but future work aims to develop a stateful version with user memory, context persistence, and session-level FinOps tracking.
High-Level System Architecture

Agent-to-Agent Communication (Sequence Diagram)

Multi-Agent Workflow (LangGraph Execution)

4.1 SQL Generation Accuracy
The Text-to-SQL agent produced syntactically correct and semantically meaningful queries for most user inputs. Errors were eliminated through schema-context grounding.
4.2 Insight Generation
The Insight Agent correctly produced monthly spend trends, summaries, and KPIs using Python REPL, enabling reproducible computation instead of hallucinated numbers.
4.3 Visualization
Visualizer Agent generated clear PNG charts showing trends, seasonality, and anomalies.
4.4 Knowledge Agent Performance
RAG significantly improved factual correctness:
20-30% reduction in hallucinations
Outputs aligned closely with FinOps standards
Domain-specific definitions became consistent
4.5 End-to-End Execution
The LangGraph supervisor reliably orchestrated all agents, producing complete answers across diverse FinOps queries.
UI
This work presents a functional and robust stateless multi-agent FinOps conversational system that integrates LLMs, LangGraph, text-based RAG, SQL generation, data insights, and visualization. The combination of agent specialization and knowledge grounding produces accurate and reliable FinOps responses suitable for operational usage.
Future Work
The next phase will introduce a stateful FinOps assistant capable of:
Maintaining long-term memory
Tracking cost over multiple user sessions
Performing iterative analysis
Asking clarifying questions
Supporting FinOps decision-making through multi-query reasoning
This will move the system from a single-turn analytical tool to a full conversational FinOps co-pilot.