Deepfake image classification with Multimodal