Welcome to the NaoAI-Assistant repository! This project was developed as part of the Cognitive Computing subject and is a collaborative effort by the following team members:
This project leverages the Nao robot, utilizing its SDK and artificial intelligence capabilities to create a personalized assistant. The robot can engage in interactive conversations, recognize speech, answer questions using GPT-3.5 API, generate text-to-speech responses, and even detect facial emotions.
nao_script.pyThe nao_script.py file enables the Nao robot to initiate a conversation by asking if you have any questions. It records your answer for further processing.
speech_recognition_code.pyThe speech_recognition_code.py file utilizes the SpeechRecognition library to extract text from an audio file. This is a crucial step in understanding the user's input.
gpt3_code.pyThe gpt3_code.py file interacts with the GPT-3.5 API to generate intelligent responses based on the user's input. It plays a key role in providing informative and contextually relevant answers.
nao_tts_code.pyThe nao_tts_code.py file uses a text-to-speech (TTS) library to convert the generated text responses into audio. This enables the Nao robot to articulate its answers in a human-like manner.
emo_detection.pyThe emo_detection.py file receives an image from the Nao robot and utilizes a facial emotion detection model. This model, trained using OpenCV CascadeClassifier and TensorFlow, allows the robot to be aware of the user's facial expressions.
my_wave.pyThe my_wave.py file adds a friendly touch to the interaction. When the user says "hi" to the Nao robot, it responds by saying hello while waving.
To get started, follow these steps:
nao_script.py to initiate the conversation with Nao.speech_recognition_code.py to extract text from audio files.gpt3_code.py for intelligent responses based on the input.nao_tts_code.py for a natural interaction.emo_detection.py.my_wave.py.nao_script.py: Initiates conversation with the Nao robot.speech_recognition_code.py: Extracts text from audio files using SpeechRecognition.gpt3_code.py: Interacts with the GPT-3.5 API for intelligent responses.nao_tts_code.py: Converts text to speech for the Nao robot.emo_detection.py: Detects facial emotions using a trained model.my_wave.py: Nao robot's friendly wave response to the user.