Towards Scalable Grammar Scoring from Spoken Language: Using Faster-Whisper and Language Models