Multimodal AI Assistant Using Whisper and LLaVa