White-Box Interpretability and Adversarial Testing of the MedGemma Language Model