A set of services to create a prototype to speak with a chatbot that speaks back the answer to the user.
All code is available here.
This repository is designed to create an application for a personal computer (with one GPU having 6Go of VRAM minium), further modifications are required to serve it for multiple users on the cloud.
Tutorials will come over my blog (https://website.vincent-roger.fr/blog/).
The announcement for the first version of this project is now live. Watch the video on YouTube: Watch Now.
Streamlit service that interact with the user (to record speech and play the answer). It uses all other services in the same order as described here.
This service is responsible for converting spoken language into text. It uses the Whisper tiny model.
A chatBot (based on Qwen 2.5.1) to create text answer to the request of the user.
This service converts the generated text response back into spoken language. It uses mini parler-tts model.
Before setting up the project, ensure you have the following installed:
Tested on Linux using Fedora 41. Feedback on other platforms are welcomed.
To set up the project, clone the repository and navigate to the project directory:
git clone https://github.com/yourusername/voice-chatbot.git cd voice-chatbot
Build and launch the Docker containers using the command:
docker compose up --build
Open your web browser and go to http://localhost:8501/.
The web interface will wait for other services to be up and running before speaking with the bot.
Use the interface to record your speech and get a spoken response from the chatbot.
Contributions are welcome! Please follow these steps to contribute:
git checkout -b feature-branch
).git commit -m 'Add some feature'
).git push origin feature-branch
).This project is licensed under the MIT License.
There are no models linked
There are no datasets linked
There are no models linked
There are no datasets linked