Home
Publications
Certifications
Competitions
Contributors
Log in
Sign up
Multimodal AI model based on Vision Transformers, implemented in approximately 500 lines of code