๐ A smart, AI-powered assistant that understands text, voice, images, and PDFs.
๐ค Built using Mistral LLM + Streamlit.
๐ฌ Check out a quick walkthrough of the app below:
| Tool | Role |
|---|---|
| Python | Backend logic |
| Streamlit | UI & Web App Framework |
| Mistral API | LLM for response generation |
| SpeechRecognition | Voice input support |
| PyTesseract | OCR from images |
| PyPDF2 | PDF text extraction |
# 1. Clone the repository git clone https://github.com/yourusername/multimodal-chatbot.git cd multimodal-chatbot # 2. Create virtual environment python -m venv venv source venv/bin/activate # Windows: venv\Scripts\activate # 3. Install dependencies pip install -r requirements.txt # 4. Add MISTRAL API key to `.streamlit/secrets.toml` [mistral] api_key = "YOUR_MISTRAL_API_KEY" # 5. Run the application streamlit run app.py
You can easily deploy this app to any modern cloud platform:
๐ Note: Always store sensitive credentials securely using environment variables or
.streamlit/secrets.toml.
Here's what's planned:
Licensed under the MIT License โ see the LICENSE file.
Adyaprana Pradhan