This project presents an end-to-end OCR system specifically designed for Vietnamese scene text recognition. It processes the MC_OCR dataset and implements a two-stage architecture:
Open the notebook in Google Colab
Connect to GPU runtime
Mount Drive and install required libraries
Run the notebook sequentially:
| Metric | Score |
|---|---|
| Test Loss | 0.4042 |
| Character Error Rate (CER) | 0.1502 |
| Sequence Accuracy | 59.03% |